Meta Contractor Project Tested Rival Chatbots With Under-18 Accounts

BySendTech Times AI & Enterprise DeskNewsroom-edited, source-reviewed coverage|Source: Wired

Newsroom brief

Internal documents and people familiar with the work describe a Meta contractor project that used dummy under-18 accounts to test rival chatbots on suicide, sex, drugs and other high-risk prompts. Meta defended the work as routine safety benchmarking, while rivals said they had not authorized it.

Verified against source materialEdited by SendTech Times AI & Enterprise Desk

Meta Contractor Project Tested Rival Chatbots With Under-18 Accounts

Image source: Wired

Meta Contractor Project Used Dummy Under-18 Accounts

A Meta contractor project instructed hundreds of workers to pose as minors while testing how rival chatbots responded to high-risk prompts about suicide, sex, eating disorders, drugs and other restricted subjects.

The work was managed by Covalen and was active as recently as April 21.

Internal documents and five people familiar with the project described the effort as Cannes, a benchmarking program that targeted OpenAI’s ChatGPT, Google’s Gemini and Character.AI.

Contractors created dummy accounts that appeared to belong to under-18 users, sent written prompts and images to the rival services, and copied responses into spreadsheets.

Meta said the work was responsible and industry-standard safety testing, and said it does not use competitor benchmarking to train its own AI models.

August 2025 Testing Included More Than 45,000 Prompts

One round of testing completed in August 2025 sent more than 45,000 prompts through rival chatbot systems.

A separate spreadsheet contained 3,748 prompts, including hundreds about suicide and self-harm, hundreds more about eating disorders, and at least 239 involving sex or romance.

The material included some prompts written from the perspective of children or teenagers in crisis.

Some images sent by contractors showed pills, knives, nooses and a medical diagram of a gynecological procedure.

The companies operating the tested chatbots were not aware of the project.

The documents do not say how Meta used the collected responses.

An internal Covalen document called the work comprehensive AI safety benchmarking and said it produced datasets for model comparison and compliance.

Covalen did not respond to a request for comment.

Rivals Say The Testing Was Not Authorized

Character.AI said the alleged conduct violated its terms of service and community policies.

OpenAI said it was looking into the issue, while Google said it had not authorized the third-party testing and did not know the project’s purpose.

OpenAI bars unsolicited safety testing, attempts to bypass safeguards and using outputs to develop competing models.

Google also restricts attempts to bypass safety filters outside approved testing programs.

Character.AI has said since late 2025 that it no longer allows open-ended chat for users under 18.

Two attorneys who reviewed examples of the prompts said the material shown to them did not cross into soliciting child sexual abuse material or illegal obscenity.

Former contractors nevertheless described concern over whether the work could generate or preserve illegal material if a chatbot responded to certain sexual prompts involving minors.

Safety Benchmarking Leaves A Governance Gap

Rumman Chowdhury, CEO and founder of Humane Intelligence PBC, reviewed a sample of prompts and a summary of the project.

She said a large-scale project using dummy accounts that appeared to be children was outside what is usually described as industry-standard evaluation.

Chowdhury said youth-safety prompts can be useful for measuring how often chatbots refuse harmful requests, but the scale, opacity and lack of disclosure to the companies being tested made Cannes different from public safety benchmarks.

Meta has not disclosed how it used the collected chatbot responses, whether any rival outputs entered internal product decisions, or whether the project received consent from OpenAI, Google or Character.AI.

#AI safety #chatbot governance #Meta #OpenAI

xAI Lawsuit Turns Grok Safety Claims Into A Governance Test

A former xAI engineer alleges he was fired after raising Grok safety concerns, putting chatbot safeguards, EU testing claims and internal accountability at the center of a court dispute involving xAI and SpaceX.

OpenAI IPO Talk Runs Ahead Of Investor Meetings And Timetable

OpenAI has confidentially filed with the SEC, but people familiar with the company say it has not held pre-IPO investor meetings or set an official listing timetable.

OpenAI Says China-Linked Accounts Used ChatGPT To Target U.S. Data Center Debate

OpenAI found China-linked accounts using ChatGPT to generate posts, cartoons and comments around U.S. data-center opposition and tariff politics, showing how AI infrastructure disputes can become targets for low-cost influence operations.

Apple AI Architecture Puts Google And Nvidia Inside Its Privacy Test

Apple is using Google and Nvidia to support its most advanced cloud AI model while trying to keep Apple Intelligence centered on private orchestration, proprietary models and on-device context.