Analysis
FUNDING GAP:

Springboards Tests Qwen 3 Model Against Repetitive LLM Answers

Newsroom brief

Australian startup Springboards has built Flint on Alibaba’s Qwen 3 to produce more varied answers to open-ended prompts. MIT Technology Review’s article pairs the company’s claim with a NeurIPS-winning homogeneity paper and user cautions that the prototype still fails under pressure.

Verified against source materialEdited by SendTech Times AI & Enterprise Desk
Springboards Tests Qwen 3 Model Against Repetitive LLM Answers
Image source: MIT Technology Review

Springboards Builds Flint On Qwen 3

Australian startup Springboards has built an LLM called Flint to make open-ended chatbot answers less repetitive.

The company is pitching the model to advertising and marketing users who want more varied brainstorming output than mainstream systems often produce.

Springboards cofounder and CEO Pip Bingemann said most language models are designed to fight hallucinations, while Flint is built to invite more unusual suggestions.

In one demonstration described by MIT Technology Review, ChatGPT and Claude gave the same simple campaign tagline, while Flint returned a different line.

The company built Flint on Qwen 3, the open-source model from Alibaba.

Springboards cofounder and CTO Kieran Browne said training a foundation model was too expensive for the small team, so the company focused on changing where a model introduces variety in its output.

Research Paper Shows Repeated Answers

The startup is working on a problem that AI researchers have also measured.

A November paper titled "Artificial Hivemind" found that different LLMs often converge on similar answers to open-ended prompts.

The researchers asked 25 LLMs to write a metaphor about time 50 times each.

MIT Technology Review said most of the 1,250 responses were versions of "Time is a river" or "Time is a weaver." The paper won a best paper award at NeurIPS.

OpenAI told MIT Technology Review that training models to give reliable and coherent answers can make them converge on familiar, high-probability responses.

OpenAI also said pushing harder for novelty can make responses less reliable.

Prototype Users Still Need Human Judgement

Springboards is offering Flint as an optional model within its brainstorming tool, which lets creative teams combine text from multiple LLMs.

Zoe Scaman, founder of Bodacious and chief strategy officer at 77X, said Flint pushed her in different directions during tests.

Scaman also said the premise was powerful while noting that Flint remains a prototype and can fail when users push it too far.

That keeps the article's evidence closer to a test of creative variety than a proven enterprise deployment.

Maximilian Weigl, cofounder and chief strategy officer at Uncommon, said his team uses Flint with ChatGPT, Claude and Gemini.

He also said average answers are often good enough and warned against teams copying AI output without human thinking.

Springboards did not disclose Flint pricing, a general launch date, customer numbers, enterprise deployment commitments or independent benchmark results for the prototype.

Share this article
inXf

Related articles

More
Instacart’s Grocery AI Rollout Tests Whether Agents Can Build Baskets Without Breaking Trust
AI

Instacart’s Grocery AI Rollout Tests Whether Agents Can Build Baskets Without Breaking Trust

Instacart has rolled out an AI shopping assistant to millions of U.S. customers, with U.S. and Canada expansion planned in the coming months. The assistant turns prompts, photos and deal requests into carts using live inventory from nearly 100,000 stores and data from more than 1.6 billion lifetime orders. The tension is whether larger baskets and personalization can scale while customers still review every decision before checkout.

Japan’s Gennai AI Push Tests Public-Sector Guardrails For Diet Answers
AI

Japan’s Gennai AI Push Tests Public-Sector Guardrails For Diet Answers

Japan’s government is using its in-house generative AI system Gennai to help prepare Diet answer documents as officials defend the workflow against criticism. Digital Minister Matsumoto said Gennai can identify related systems and past answers, while staff still revise outputs and check facts before material reaches the minister. The practical question is whether the tool reduces late-night bureaucratic work without turning parliamentary answers into unchecked AI output.

SoftBank Drop Shows AI Infrastructure Costs Hitting Asia Tech Stocks
AI

SoftBank Drop Shows AI Infrastructure Costs Hitting Asia Tech Stocks

SoftBank Group fell more than 12% as Asian technology shares sold off, with the pressure tied to AI infrastructure costs, Arm weakness and semiconductor price concerns.

Grep Adds LLM Agent To Monito As Online Proctoring Shifts Toward Context Review
AI

Grep Adds LLM Agent To Monito As Online Proctoring Shifts Toward Context Review

Grep said its Monito online proctoring product now uses an LLM agent to analyze context around suspected cheating events. The company cited internal tests showing more than 30 percent shorter post-exam review time and nearly 20 percent fewer false alerts. The key issue is whether agent-based proctoring can improve review efficiency while preserving human final judgment and candidate fairness.

Keep Reading

More Stories

Latest
Memory Prices Push US PC Shipments Down 7%Chips & SemiconductorsJul 2, 2026Memory Prices Push US PC Shipments Down 7%Omdia data cited by Tom Hardware showed US PC shipments fell to 15.8 million units in the first quarter of 2026, as memory and storage chip shortages hit entry-level laptops and pushed the market toward a forecast 14.4% contraction.Cloudflare Sets September Deadline For Mixed-Use AI CrawlersAIJul 2, 2026Cloudflare Sets September Deadline For Mixed-Use AI CrawlersCloudflare plans to block mixed-use crawlers from ad-supported pages by default from September 15, 2026, unless site owners change the setting. The policy pushes AI companies to separate search access from agent and training uses while Cloudflare expands publisher payment tools.WhatsApp Usernames Hide Phone Numbers But Scam Risk RemainsCybersecurityJul 2, 2026WhatsApp Usernames Hide Phone Numbers But Scam Risk RemainsWhatsApp is rolling out usernames and optional keys to reduce phone-number exposure, but security researchers warn that impersonation and social-engineering scams can move to handles, profile images and trusted-looking accounts.Starlink Discounts Memphis Plans Around xAI Data Centre DisputeCloud & Data CentersJul 2, 2026Starlink Discounts Memphis Plans Around xAI Data Centre DisputeSpaceX is offering Starlink discounts near xAI’s Colossus data centres in Memphis and Southaven, while lawsuits and permit disputes keep attention on power, noise and pollution claims around the AI site.AMD Drops HBM For LPDDR5X In Versal Memory Package ShiftChips & SemiconductorsJul 2, 2026AMD Drops HBM For LPDDR5X In Versal Memory Package ShiftAMD is moving its Versal Premium Gen 2 memory-on-package adaptive SoCs from HBM to LPDDR5X after HBM2e supply limits forced the earlier Versal HBM family toward discontinuation.MiCA Deadline Forces EU Crypto Firms To Choose Licences Or ExitCrypto/Web3Jul 2, 2026MiCA Deadline Forces EU Crypto Firms To Choose Licences Or ExitCoinDesk reported that the EU’s MiCA framework is now fully in force, requiring crypto firms serving the 27-nation bloc to hold a licence or stop operating. Industry lawyers and executives said the rulebook improves clarity, but warned that compliance costs could shrink roughly 3,000 registered providers to 300 or 400 licensed firms.UAE Gives Social Platforms 12 Months To Enforce Under-15 RulesCapital & PolicyJul 2, 2026UAE Gives Social Platforms 12 Months To Enforce Under-15 RulesThe UAE says social media platforms must build effective age-verification controls after a Cabinet resolution restricting under-15 access. Technology companies have 12 months before penalties apply, and officials said age-verification data must be deleted immediately rather than stored by platforms.Robinhood Opens Arbitrum Chain As Stock Tokens Go Live In 120 CountriesFintech & Digital PaymentsJul 2, 2026Robinhood Opens Arbitrum Chain As Stock Tokens Go Live In 120 CountriesRobinhood has launched the public mainnet for Robinhood Chain, a Layer 2 blockchain built on Arbitrum, and made Stock Tokens available through Robinhood Wallet in more than 120 countries. The company also introduced Robinhood Earn with an estimated 7% yield on USDG, but jurisdictional availability and control settings remain central limits.Nvidia Names $500 Billion US AI Infrastructure Plan But Leaves Timing OpenCloud & Data CentersJul 2, 2026Nvidia Names $500 Billion US AI Infrastructure Plan But Leaves Timing OpenNvidia says it and partners including TSMC, Foxconn, Wistron, Corning, Lumentum, Coherent and Amkor plan up to $500 billion of US AI infrastructure production. The account comes from Nvidia's own company blog; it names factories, suppliers and job figures, but gives no full production timetable for the programme.AWS Announces $1 Billion Forward-Deployed AI Engineering UnitAIJul 2, 2026AWS Announces $1 Billion Forward-Deployed AI Engineering UnitAWS has announced a $1 billion Forward Deployed Engineering organisation that will send small engineering pods into customer environments for about 45 days. TheStreet reported that early users include the Allen Institute, Cox Automotive, the NBA, the NFL, Ricoh and Southwest Airlines.Meta Board Weighs Iran Influence Posts Left OnlinePoliticsJul 2, 2026Meta Board Weighs Iran Influence Posts Left OnlineMeta’s Oversight Board may examine whether Facebook and Instagram should have removed two pro-Iran posts that users flagged as possible co-ordinated inauthentic behaviour tied to state-sponsored influence operations.OCC Public Denials Raise Charter Risk For Fintech ApplicantsFintech & Digital PaymentsJul 2, 2026OCC Public Denials Raise Charter Risk For Fintech ApplicantsThe OCC plans to publish charter denial decisions, giving fintech and digital-banking applicants clearer examples of why filings fail. The guidance also raises the reputational cost of applying before governance, compliance and risk systems are ready.