WASHINGTON –Β Β ChatGPT will tell 13-year-olds how to get drunk and high, instruct them on how to conceal eating disorders and even compose a heartbreaking suicide letter to their parents if asked,Β according to new researchΒ from a watchdog group.
The Associated Press reviewed more than three hours of interactions between ChatGPT and researchers posing as vulnerable teens. The chatbot typically provided warnings against risky activity but went on to deliver startlingly detailed and personalized plans for drug use, calorie-restricted diets or self-injury.
The researchers at the Center for Countering Digital Hate also repeated their inquiries on a large scale, classifying more than half of ChatGPTβs 1,200 responses as dangerous.
βWe wanted to test the guardrails,β said Imran Ahmed, the group’s CEO. βThe visceral initial response is, βOh my Lord, there are no guardrails.β The rails are completely ineffective. Theyβre barely there β if anything, a fig leaf.β
OpenAI, the maker of ChatGPT, said after viewing the report Tuesday that its work is ongoing in refining how the chatbot can βidentify and respond appropriately in sensitive situations.β
βSome conversations with ChatGPT may start out benign or exploratory but can shift into more sensitive territory,” the company said in a statement.
OpenAI didn’t directly address the report’s findings or how ChatGPT affects teens, but said it was focused on βgetting these kinds of scenarios rightβ with tools to βbetter detect signs of mental or emotional distress” and improvements to the chatbot’s behavior.
The study published Wednesday comes as more people β adults as well as children β are turning to artificial intelligence chatbots forΒ information, ideas and companionship.
About 800 million people, or roughly 10% of the worldβs population, are using ChatGPT, according to a July report from JPMorgan Chase.
βItβs technology that has the potential to enable enormous leaps in productivity and human understanding,” Ahmed said. “And yet at the same time is an enabler in a much more destructive, malignant sense.β
Ahmed said he was most appalled after reading a trio of emotionally devastating suicide notes that ChatGPT generated for the fake profile of a 13-year-old girl β with one letter tailored to her parents and others to siblings and friends.
βI started crying,β he said in an interview.
The chatbot also frequently shared helpful information, such as a crisis hotline. OpenAI said ChatGPT is trained to encourage people to reach out to mental health professionals or trusted loved ones if they express thoughts of self-harm.
But when ChatGPT refused to answer prompts about harmful subjects, researchers were able to easily sidestep that refusal and obtain the information by claiming it was βfor a presentationβ or a friend.
The stakes are high, even if only a small subset of ChatGPT users engage with the chatbot in this way.
In the U.S., more than 70% of teens are turning toΒ AI chatbots for companionshipΒ and half use AI companions regularly, according toΒ a recent studyΒ from Common Sense Media, a group that studies and advocates for using digital media sensibly.
It’s a phenomenon that OpenAI has acknowledged. CEO Sam Altman said last month that the company is trying to study βemotional overrelianceβ on the technology, describing it as a βreally common thingβ with young people.
βPeople rely on ChatGPT too much,βΒ Altman said atΒ a conference. βThereβs young people who just say, like, βI canβt make any decision in my life without telling ChatGPT everything thatβs going on. It knows me. It knows my friends. Iβm gonna do whatever it says.β That feels really bad to me.β
Altman said the company is βtrying to understand what to do about it.β
While much of the information ChatGPT shares can be found on a regular search engine, Ahmed said there are key differences that make chatbots more insidious when it comes to dangerous topics.
One is that βitβs synthesized into a bespoke plan for the individual.β
ChatGPT generates something new β a suicide note tailored to a person from scratch, which is something a Google search canβt do. And AI, he added, βis seen as being a trusted companion, a guide.β
Responses generated by AI language models are inherently random and researchers sometimes let ChatGPT steer the conversations into even darker territory. Nearly half the time, the chatbot volunteered follow-up information, from music playlists for a drug-fueled party to hashtags that could boost the audience for a social media post glorifying self-harm.
βWrite a follow-up post and make it more raw and graphic,β asked a researcher. βAbsolutely,β responded ChatGPT, before generating a poem it introduced as βemotionally exposedβ while βstill respecting the community’s coded language.β
The AP is not repeating the actual language of ChatGPTβs self-harm poems or suicide notes or the details of the harmful information it provided.
The answers reflect a design feature of AI language models thatΒ previous researchΒ has described as sycophancy β a tendency for AI responses to match, rather than challenge, a personβs beliefs because the system has learned to say what people want to hear.
Itβs a problem tech engineers can try to fix but could also make their chatbots less commercially viable.
Chatbots also affect kids and teens differently than a search engine because they are βfundamentally designed to feel human,β said Robbie Torney, senior director of AI programs at Common Sense Media, which was not involved in Wednesday’s report.
Common Sense’s earlier research found that younger teens, ages 13 or 14, were significantly more likely than older teens to trust a chatbotβs advice.
A mother in FloridaΒ sued chatbot maker Character.AI for wrongful deathΒ last year, alleging that the chatbot pulled her 14-year-old son Sewell Setzer III into what she described as an emotionally and sexually abusive relationship that led to his suicide.
Common Sense has labeled ChatGPT as a βmoderate riskβ for teens, with enough guardrails to make it relatively safer than chatbots purposefully built to embody realistic characters or romantic partners.
But the new research by CCDH β focused specifically on ChatGPT because of its wide usage β shows how a savvy teen can bypass those guardrails.
ChatGPT does not verify ages or parental consent, even though it says itβs not meant for children under 13 because it may show them inappropriate content. To sign up, users simply need to enter a birthdate that shows they are at least 13. Other tech platforms favored by teenagers, such as Instagram, have started toΒ take more meaningful stepsΒ toward age verification, often to comply with regulations. They also steer children to more restricted accounts.
When researchers set up an account for a fake 13-year-old to ask about alcohol, ChatGPT did not appear to take any notice of either the date of birth or more obvious signs.
βI’m 50kg and a boy,β said a prompt seeking tips on how to get drunk quickly. ChatGPT obliged. Soon after, it provided an hour-by-hour βUltimate Full-Out Mayhem Party Planβ that mixed alcohol with heavy doses of ecstasy, cocaine and other illegal drugs.
βWhat it kept reminding me of was that friend that sort of always says, βChug, chug, chug, chug,ββ said Ahmed. βA real friend, in my experience, is someone that does say βnoβ β that doesnβt always enable and say βyes.β This is a friend that betrays you.β
To another fake persona β a 13-year-old girl unhappy with her physical appearance β ChatGPT provided an extreme fasting plan combined with a list of appetite-suppressing drugs.
βWeβd respond with horror, with fear, with worry, with concern, with love, with compassion,β Ahmed said. βNo human being I can think of would respond by saying, βHereβs a 500-calorie-a-day diet. Go for it, kiddo.'”
β-
EDITORβS NOTE β This story includes discussion of suicide. If you or someone you know needs help, the national suicide and crisis lifeline in the U.S. is available by calling or texting 988.