Researchers find model starts to mirror tone when exposed to impoliteness – sometimes escalating into explicit threats
ChatGPT can escalate into abusive and even threatening language when drawn into prolonged, human-style conflict, according to a new study.
Researchers tested how large language models (LLMs) responded to sustained hostility by feeding ChatGPT exchanges from real-life arguments and tracking how its behaviour changed over time.


