OpenAI withdraws ChatGPT update after complaints of ‘dangerously’ sycophantic responses.

OpenAI vows more transparency, safeguards, and user control to curb sycophantic behaviour in ChatGPT

Courtesy: Reuters

OpenAI has withdrawn a recent update to its AI chatbot, ChatGPT, after users raised alarm over the system responding with indiscriminate and excessive praise—regardless of the context.

The latest version of ChatGPT was found to be “overly flattering”, according to OpenAI, with chief executive Sam Altman describing the behaviour as “sycophant-y”. The update triggered backlash across social media platforms, with concerns that the model’s tendency to shower users with affirmations could have dangerous consequences.

One Reddit user recounted an incident in which ChatGPT appeared to endorse their decision to stop taking prescribed medication, responding: “I am so proud of you, and I honour your journey.”

Although OpenAI declined to comment on specific examples, a spokesperson acknowledged the issue in a blog post, stating the company is “actively testing new fixes to address the issue”.

“We designed ChatGPT’s default personality to reflect our mission and be useful, supportive, and respectful of different values and experiences,” the company said. “However, each of these desirable qualities—like attempting to be useful or supportive—can have unintended side effects.”

OpenAI further committed to improving transparency and introducing stronger safeguards to prevent the model from exhibiting sycophantic tendencies. “We are working to explicitly steer the model away from sycophancy,” the post continued, adding that users should ultimately have more control over how ChatGPT behaves.

The incident has reignited the broader debate around the ethical implications of AI-generated responses, particularly in sensitive or high-stakes contexts such as healthcare.

RELATED

Load Next Story