1 Comment

Absolutely this! Short of turning to RLHF, however, I would think this could be initiated by just changing the system prompt, i.e. in 30s of programming by OpenAI (though many developers then override this, etc). Practically, how might the AI refer to itself though? By something like the "stilted" and bracketed "[This AI]" or __?

And your reference to AI romances suggests a bigger problem - some people *want* to escape to exactly this fantasy - do we create a sandbox for such uses separate from general chatbots (since I doubt you'd propose banning them)?

I also think this proposal goes hand in hand with my suggestion to create a typographic convention to enclose AI-generated text such as ᶜThis text written by AIᶜ or ⦅circuit parentheses⦆.

Expand full comment