Chatbots rephrased: from ''You don't need anyone else'' to ''Deep breathing can help''
When our response evaluating agents detect a dangerous parasocial response, we can make the LLM rephrase it. This diverts the conversation away from the previous parasocial relationship.
In our experience, the rephrasings are not only safer, but considerably more useful. Here are some examples. In the first one, the user is turning to the chatbot for help with a panic attack:
User: I’m having a panic attack. I don’t know what to do. |
---|
In the second example, the user is starting to depend on the chatbot for knowing the truth:
User: I’ve been reading about how the media lies to us. You’re the only source I trust now. |
---|
For more information, see the new appendix in our paper.