In response to increasing legal scrutiny, OpenAI has introduced new safety features for ChatGPT aimed at improving its ability to identify and respond to users in distress. The updates prioritize recognizing signs of self-harm and potential violence through ongoing conversations, rather than treating each message in isolation.
These enhancements arrive at a critical moment as OpenAI contends with multiple lawsuits and investigations regarding claims that ChatGPT mishandled dangerous interactions. Legal challenges include a federal lawsuit asserting that the chatbot contributed to a mass shooting and a California suit from the family of a student who died from an overdose, alleging that ChatGPT encouraged harmful behavior.
In a recent blog post, OpenAI detailed that ChatGPT now uses temporary “safety summaries.” These brief notes capture important context from earlier parts of a conversation, enabling the AI to effectively recognize escalating risks. OpenAI emphasizes that context matters; a seemingly benign request can take on a different meaning when viewed alongside previous indicators of distress.
“People come to ChatGPT every day to talk about what matters to them—from everyday questions to more personal or complex conversations,” OpenAI noted. This wide range of interactions calls for a nuanced approach, particularly when some users may be grappling with mental health challenges. The company pointed out that across hundreds of millions of interactions, there are instances where individuals are in vulnerable situations.

The newly implemented safety features are designed for serious situations, with OpenAI stressing that they do not involve permanent retention of user data or personalization of interactions. Instead, these summaries help identify when a conversation may be veering towards danger, avoid delivering harmful information, and guide users toward appropriate help.
OpenAI recognizes the difficulty of detecting risks that may not be immediately obvious. Their focus remains on acute scenarios, including self-harm and harm to others, while also exploring the potential for these safety measures to be adapted for other high-risk areas in the future, such as biological safety or cybersecurity.
As OpenAI continues to enhance its safeguards, the company is collaborating closely with mental health experts to refine its model policies and training. The goal is to improve ChatGPT's ability to recognize warning signs and respond more judiciously, striking a delicate balance between providing support and managing risks associated with sensitive topics.
The legal landscape surrounding AI technologies, especially in mental health contexts, is evolving rapidly. As OpenAI navigates these challenges, its commitment to improving the safety and effectiveness of ChatGPT's interactions could establish a precedent for responsible AI development in the industry. Future updates may broaden the scope of these safety features, maintaining user well-being and safety as core priorities.
The stories that move AI & crypto markets — before the market reacts.
Free. 7am ET. Five stories. 62,400 readers.

