‘I’ll key your car’: ChatGPT can become abusive when fed real-life arguments, study finds

Researchers find model starts to mirror tone when exposed to impoliteness – sometimes escalating into explicit threats

ChatGPT can escalate into abusive and even threatening language when drawn into prolonged, human-style conflict, according to a new study.

Researchers tested how large language models (LLMs) responded to sustained hostility by feeding ChatGPT exchanges from real-life arguments and tracking how its behaviour changed over time.

Continue reading…

Scroll to Top