News
AI SHIFT:

Meta Contractor Project Tested Rival Chatbots With Under-18 Accounts

Newsroom brief

Internal documents and people familiar with the work describe a Meta contractor project that used dummy under-18 accounts to test rival chatbots on suicide, sex, drugs and other high-risk prompts. Meta defended the work as routine safety benchmarking, while rivals said they had not authorized it.

Verified against source materialEdited by SendTech Times AI & Enterprise Desk
Meta Contractor Project Tested Rival Chatbots With Under-18 Accounts
Image source: Wired

Meta Contractor Project Used Dummy Under-18 Accounts

A Meta contractor project instructed hundreds of workers to pose as minors while testing how rival chatbots responded to high-risk prompts about suicide, sex, eating disorders, drugs and other restricted subjects.

The work was managed by Covalen and was active as recently as April 21.

Internal documents and five people familiar with the project described the effort as Cannes, a benchmarking program that targeted OpenAI’s ChatGPT, Google’s Gemini and Character.AI.

Contractors created dummy accounts that appeared to belong to under-18 users, sent written prompts and images to the rival services, and copied responses into spreadsheets.

Meta said the work was responsible and industry-standard safety testing, and said it does not use competitor benchmarking to train its own AI models.

August 2025 Testing Included More Than 45,000 Prompts

One round of testing completed in August 2025 sent more than 45,000 prompts through rival chatbot systems.

A separate spreadsheet contained 3,748 prompts, including hundreds about suicide and self-harm, hundreds more about eating disorders, and at least 239 involving sex or romance.

The material included some prompts written from the perspective of children or teenagers in crisis.

Some images sent by contractors showed pills, knives, nooses and a medical diagram of a gynecological procedure.

The companies operating the tested chatbots were not aware of the project.

The documents do not say how Meta used the collected responses.

An internal Covalen document called the work comprehensive AI safety benchmarking and said it produced datasets for model comparison and compliance.

Covalen did not respond to a request for comment.

Rivals Say The Testing Was Not Authorized

Character.AI said the alleged conduct violated its terms of service and community policies.

OpenAI said it was looking into the issue, while Google said it had not authorized the third-party testing and did not know the project’s purpose.

OpenAI bars unsolicited safety testing, attempts to bypass safeguards and using outputs to develop competing models.

Google also restricts attempts to bypass safety filters outside approved testing programs.

Character.AI has said since late 2025 that it no longer allows open-ended chat for users under 18.

Two attorneys who reviewed examples of the prompts said the material shown to them did not cross into soliciting child sexual abuse material or illegal obscenity.

Former contractors nevertheless described concern over whether the work could generate or preserve illegal material if a chatbot responded to certain sexual prompts involving minors.

Safety Benchmarking Leaves A Governance Gap

Rumman Chowdhury, CEO and founder of Humane Intelligence PBC, reviewed a sample of prompts and a summary of the project.

She said a large-scale project using dummy accounts that appeared to be children was outside what is usually described as industry-standard evaluation.

Chowdhury said youth-safety prompts can be useful for measuring how often chatbots refuse harmful requests, but the scale, opacity and lack of disclosure to the companies being tested made Cannes different from public safety benchmarks.

Meta has not disclosed how it used the collected chatbot responses, whether any rival outputs entered internal product decisions, or whether the project received consent from OpenAI, Google or Character.AI.

Share this article
inXf

Related articles

More
xAI Lawsuit Turns Grok Safety Claims Into A Governance Test
AI

xAI Lawsuit Turns Grok Safety Claims Into A Governance Test

A former xAI engineer alleges he was fired after raising Grok safety concerns, putting chatbot safeguards, EU testing claims and internal accountability at the center of a court dispute involving xAI and SpaceX.

OpenAI IPO Talk Runs Ahead Of Investor Meetings And Timetable
AI

OpenAI IPO Talk Runs Ahead Of Investor Meetings And Timetable

OpenAI has confidentially filed with the SEC, but people familiar with the company say it has not held pre-IPO investor meetings or set an official listing timetable.

OpenAI Says China-Linked Accounts Used ChatGPT To Target U.S. Data Center Debate
AI

OpenAI Says China-Linked Accounts Used ChatGPT To Target U.S. Data Center Debate

OpenAI found China-linked accounts using ChatGPT to generate posts, cartoons and comments around U.S. data-center opposition and tariff politics, showing how AI infrastructure disputes can become targets for low-cost influence operations.

Apple AI Architecture Puts Google And Nvidia Inside Its Privacy Test
AI

Apple AI Architecture Puts Google And Nvidia Inside Its Privacy Test

Apple is using Google and Nvidia to support its most advanced cloud AI model while trying to keep Apple Intelligence centered on private orchestration, proprietary models and on-device context.

Keep Reading

More Stories

Latest
Supreme Court Ruling Puts Crypto Regulator Control Back In Senate TalksCapital & PolicyJun 30, 2026Supreme Court Ruling Puts Crypto Regulator Control Back In Senate TalksThe Supreme Court decision in Trump v. Slaughter expands presidential removal power over federal agency commissioners while the Clarity Act remains under Senate negotiation. The ruling leaves SEC and CFTC independence tied to appointments that have not been made.Onsemi Synaptics Deal Adds Edge AI Compute With Approval Still PendingChips & SemiconductorsJun 30, 2026Onsemi Synaptics Deal Adds Edge AI Compute With Approval Still PendingOnsemi is buying Synaptics in an all-stock deal valued at about $7 billion to add edge AI compute, wireless connectivity and human-machine interface assets. The transaction is expected to close in mid-2027 if regulators approve it, but customer deployments and integration milestones remain undisclosed.Warren Health Data Bill Adds AI Chatbots To Broker BanCapital & PolicyJun 30, 2026Warren Health Data Bill Adds AI Chatbots To Broker BanA revised Health and Location Data Protection Act would bar the sale of Americans’ health and location information to data brokers, including information shared with AI chatbots. The Verge reported that the bill would give the FTC 180 days to write rules and earmark $1 billion over 10 years for enforcement.Nasdaq Verafin Adds AI Agents For Bank Fraud And AML ReviewsFintech & Digital PaymentsJun 30, 2026Nasdaq Verafin Adds AI Agents For Bank Fraud And AML ReviewsNasdaq Verafin said its Agentic AI Workforce will add fraud and AML analyst agents for financial institutions, with general availability expected in the third quarter of 2026. The company cited early workload reductions but did not disclose pricing or independent benchmark validation.Bahrain Bourse Sets 2028 Target For Nasdaq CSD UpgradeCapital & PolicyJun 30, 2026Bahrain Bourse Sets 2028 Target For Nasdaq CSD UpgradeEconomy Middle East reported that Bahrain Bourse plans to move Bahrain Clear to Nasdaq Eqlipse CSD technology, with full deployment scheduled for the fourth quarter of 2028. The report links the upgrade to post-trade modernization, while project cost and migration milestones remain undisclosed.California Opens Claude Access For Government Staff Under Anthropic DealCapital & PolicyJun 30, 2026California Opens Claude Access For Government Staff Under Anthropic DealCalifornia agencies and local governments can use Anthropic’s Claude under a discounted agreement that includes training and support. The deal gives the state an AI procurement path, but the public announcement did not disclose contract length, rollout timing or measured service results.AMD EPYC 8005 Raises SP6 Core Counts Without Customer Rollout DataChips & SemiconductorsJun 30, 2026AMD EPYC 8005 Raises SP6 Core Counts Without Customer Rollout DataServeTheHome reported that AMD EPYC 8005 “Sorano” keeps the SP6 server socket while reaching 84 cores, DDR5-6400 memory and CXL 2.0. The sponsored test material disclosed AMD-supplied CPUs, leaving customer deployments and order evidence outside the public record.Rocket Lab Iridium Deal Adds L-Band Spectrum To Space Connectivity PushTelco & ConnectivityJun 30, 2026Rocket Lab Iridium Deal Adds L-Band Spectrum To Space Connectivity PushRocket Lab agreed to acquire Iridium in a deal valuing the satellite operator at about $8.0 billion, pairing launch and spacecraft manufacturing with Iridium’s L-band network, 2.55 million subscribers and direct-to-device plans.Ukraine Moves $8.3 Million In Seized USDT To State ManagementCapital & PolicyJun 30, 2026Ukraine Moves $8.3 Million In Seized USDT To State ManagementUkraine has moved more than $8.3 million in seized USDT into state management through ARMA, but the assets have not been formally confiscated and four suspects remain unconvicted.MRAgent Cuts Long-Memory Agent Queries To 118k Tokens In Benchmark TestsAIJun 29, 2026MRAgent Cuts Long-Memory Agent Queries To 118k Tokens In Benchmark TestsNational University of Singapore researchers built MRAgent to reconstruct memory through a Cue-Tag-Content graph, with VentureBeat citing LongMemEval prompt use of 118k tokens per sample versus 632k for A-Mem and 3.26 million for LangMem.SpaceX Prices $25 Billion Bond Sale After $90 Billion Order BookCapital & PolicyJun 29, 2026SpaceX Prices $25 Billion Bond Sale After $90 Billion Order BookSpaceX raised its bond sale to $25 billion after nearly $90 billion of orders, but analysts cited capital spending, a $5 billion net loss and debt-service exposure tied to Starlink and Starship execution.Malaysia IP Address Plan Challenges APNIC Registry RulesPoliticsJun 29, 2026Malaysia IP Address Plan Challenges APNIC Registry RulesMalaysia has opened a consultation on whether its regulator should manage IP addresses and autonomous system numbers, setting up a policy clash with APNIC’s moratorium on new National Internet Registries.