SendTech Times
AINews|June 1, 2026 at 12:58 PM
AI SHIFT:

MiniMax M3 turns long-context AI into an agent platform test

Article summary

MiniMax launched M3 on June 1, 2026, combining long-context, agentic, coding and native multimodal capabilities in one model line. The API supports up to 1 million tokens of context, with a guaranteed minimum of 512K tokens, and includes M3 and M3-highspeed versions. MiniMax plans to open-source M3 on HuggingFace and GitHub, while early pricing offers a 50% discount for the first seven days.

MiniMax M3 turns long-context AI into an agent platform test
Image source: Pandaily

MiniMax has introduced M3, a new flagship AI model that puts long-context reasoning, agentic workflows, coding performance and multimodal processing into one product line.

The Shanghai-based company, also known as Shanghai Hixi Technology, launched the model on June 1, 2026.

What happened

M3 is being positioned as a domestic AI model built for demanding agent and coding tasks.

Its API supports a context window of up to 1 million tokens, with at least 512K tokens guaranteed, and the model is trained as natively multimodal rather than as a text-only system with later visual additions.

MiniMax says the model uses its proprietary Sparse Attention architecture.

The company also says it rebuilt its data pipeline, expanded pre-training data to hundreds of terabytes, and worked on alignment between text and visual semantic spaces.

The launch includes two API versions: M3 and M3-highspeed.

Both are described as producing identical results, while M3-highspeed is designed for faster inference.

Automatic caching is enabled by default.

Why it matters

The announcement is a signal that Chinese AI model developers are moving beyond headline benchmark competition and into agent infrastructure.

A model with a million-token window could matter for developers building long-running coding sessions, research assistants, document-heavy workflows or video-understanding tools.

The key commercial question is whether these capabilities translate into reliable enterprise use.

Long context, tool use and autonomous task execution may reduce friction for teams experimenting with AI agents, but adoption would depend on stability, cost and the quality of downstream applications.

Performance signals

MiniMax highlighted several benchmark and demonstration results.

In BrowseComp, M3 scored 83.5, compared with 79.3 for OpenAI Opus 4.7.

In one autonomous experiment, M3 spent nearly 12 hours reproducing an ICLR 2025 outstanding paper on LLM fine-tuning dynamics, producing 18 commits and 23 experimental charts.

The company also tested M3 as an AI research assistant.

In that task, the model was given four pre-trained base models and asked to carry out data synthesis, training, evaluation and iteration within 12 hours without human intervention.

M3 scored 37.1, behind Opus 4.7 at 42.4 and GPT-5.5 at 39.3.

What to watch next

MiniMax plans to open-source M3 on HuggingFace and GitHub, with support for private cluster deployment and fine-tuning.

That could make the model more relevant to teams that want more control over infrastructure and customization.

Pricing will also shape market response.

For the first seven days, MiniMax is offering a 50% discount for M3 API usage at contexts up to 512K tokens, with input, output and cache-read pricing set across standard and priority tiers.

Readers should watch whether developers treat M3 as a practical agent platform rather than only a benchmark announcement.

Share this article
inXf

Related articles

More
Tencent Takes WorkBuddy AI Agent Global In Enterprise Productivity Push
AI

Tencent Takes WorkBuddy AI Agent Global In Enterprise Productivity Push

Tencent Cloud launched WorkBuddy for overseas users after an earlier China rollout. The agent can run tasks through messaging apps and connect with GitHub, Jira, Google Drive, Gmail, Notion, and Slack. Miora and TokenHub show Tencent building a wider enterprise AI stack around agents, creative work, and model access.

Anthropic’s Conway Points Claude Toward Always-On AI Agents
AI

Anthropic’s Conway Points Claude Toward Always-On AI Agents

Anthropic is preparing a Claude expansion that includes Conway, Orbit, Operon, memory upgrades and multilingual voice mode. The move signals a shift from chat-based AI toward persistent assistants that can connect with external services and manage workspaces. Enterprises, developers and research teams could be affected if Claude becomes a broader agent platform.

Grep Adds LLM Agent To Monito As Online Proctoring Shifts Toward Context Review
AI

Grep Adds LLM Agent To Monito As Online Proctoring Shifts Toward Context Review

Grep said its Monito online proctoring product now uses an LLM agent to analyze context around suspected cheating events. The company cited internal tests showing more than 30 percent shorter post-exam review time and nearly 20 percent fewer false alerts. The key issue is whether agent-based proctoring can improve review efficiency while preserving human final judgment and candidate fairness.

Korea Privacy Regulator Reviews Naver AI Tab Search Agent
AI

Korea Privacy Regulator Reviews Naver AI Tab Search Agent

South Korea’s privacy regulator approved the result of a prior adequacy review for Naver’s AI Tab search agent. AI Tab will provide conversational search answers and may use user activity, age group, gender and interests for personalization. The case offers an early Korean reference point for privacy controls around consumer AI agents.

Keep Reading

More Stories

Latest
Mercari Moves AI Leadership Into HR as It Tests AI-Native WorkflowsAIJun 1, 2026Mercari Moves AI Leadership Into HR as It Tests AI-Native WorkflowsMercari's Japan business CTO Toshiya Kimura is becoming CHRO and CAIO as the company links AI adoption with organizational redesign. The company has tested smaller AI Pods and found faster decisions, but also limits around design, compliance and cross-functional work. Mercari plans to make HR itself AI-first while using governance across legal, privacy, security, public policy and AI expertise.Nvidia and Foxconn Push Agentic AI Into Taiwan HospitalsAIJun 1, 2026Nvidia and Foxconn Push Agentic AI Into Taiwan HospitalsNvidia and Foxconn are working with Taiwanese medical centers on agentic AI systems for clinical and hospital operations. The effort is tied to Healthy Taiwan and a USD 1.5 billion sovereign AI healthcare investment. CoDoctor, CoDoClaw, Scrub Bot and Nurabot show healthcare AI moving toward multi-agent and physical AI workflows.Starlink Business Opportunity Narrows Around Coverage Gaps and Backup ConnectivityTelco & ConnectivityJun 1, 2026Starlink Business Opportunity Narrows Around Coverage Gaps and Backup ConnectivityStarlink strongest business role appears to be coverage gaps and backup connectivity. Recon Analytics found 72% of large businesses would consider ISP coverage extended with Starlink. Carrier offers from T-Mobile and Comcast Business may scale the model but keep Starlink in an infrastructure role.Japan’s AI Suitcase Turns Assistive Mobility Into a Robotics Test CaseAIJun 1, 2026Japan’s AI Suitcase Turns Assistive Mobility Into a Robotics Test CaseCAAMP, a consortium that includes university research institutes and IBM Japan, has developed the AI Suitcase to guide visually impaired users with sensors, cameras, motors and AI. The suitcase is being tested at locations including Miraikan, New Chitose Airport and Tokyo’s Nihonbashi district, where 39 visually impaired participants completed a monthlong indoor trial without collisions. An updated indoor-and-outdoor model is planned for Expo 2025 in Osaka, with CAAMP aiming to collect feedback from 2,000 to 3,000 people without visual disabilities.SoftBank’s €75 Billion France Plan Signals Europe’s AI Infrastructure RaceCloud & Data CentersJun 1, 2026SoftBank’s €75 Billion France Plan Signals Europe’s AI Infrastructure RaceSoftBank plans to invest up to €75 billion in AI data centres in France, with initial sites expected to come online in five years. The first phase includes €45 billion for 3.1 GW of capacity in Hauts-de-France by 2031, including locations in Dunkirk’s Loon-Plage, Bosquel and Bouchain. The plan could strengthen Europe’s AI infrastructure base, but questions remain over financing, regulatory approvals and what technological sovereignty means when a Japanese group leads the buildout.Huawei's Tau scaling puts architecture at the center of China's AI-chip pushChips & SemiconductorsJun 1, 2026Huawei's Tau scaling puts architecture at the center of China's AI-chip pushHuawei proposed Tau scaling, a framework focused on shorter signal paths rather than only smaller transistors. LogicFolding is planned for Kirin chips in fall and winter 2026, with Huawei claiming a density jump to 238 million transistors per square millimeter. The test is whether architecture-led gains can be validated in commercial chips amid export-control limits on advanced tools.FuriosaAI and Broadcom Target the Next Layer of AI Inference InfrastructureChips & SemiconductorsJun 1, 2026FuriosaAI and Broadcom Target the Next Layer of AI Inference InfrastructureFuriosaAI said it will work with Broadcom on a next-generation AI inference platform built around its TCP architecture and Broadcom networking and packaging technologies. The planned third-generation accelerator will use a 2-nanometer compute die, HBM4/HBM4E memory and multi-die packaging, with sampling planned for the first half of 2028. The deal points to AI infrastructure competition shifting from single-chip performance toward memory, networking, power efficiency and rack-level system design.AIVEX Brings Physical AI Into Korean Battery-Plant Packaging WorkAIJun 1, 2026AIVEX Brings Physical AI Into Korean Battery-Plant Packaging WorkAIVEX said its AIbot platform automated a crucible packaging-removal process at a leading Korean battery-materials company. The system combines AI vision, 3D optics, 6D pose estimation and automatic path planning to handle irregular ropes and wrapping film. The deployment points to physical AI moving into factory tasks that are repetitive but too variable for simple fixed automation.Dubai RTA Digital Revenue Shows Smart Mobility Moving Into Daily UseEconomyJun 1, 2026Dubai RTA Digital Revenue Shows Smart Mobility Moving Into Daily UseRTA said digital-channel revenue reached AED5.3 billion in 2025, up 20.6 percent from 2024. The authority reported more than 628 million digital transactions and 96 percent digital adoption. The next test is whether AI, WhatsApp and government-platform integrations make mobility services simpler.Nvidia Pushes AI PC Strategy With RTX Spark Superchip for Windows DevicesChips & SemiconductorsJun 1, 2026Nvidia Pushes AI PC Strategy With RTX Spark Superchip for Windows DevicesNvidia is entering the Windows PC market with the RTX Spark Superchip for laptops and desktops expected this autumn. The chip combines a microprocessor and graphics processor, was built with help from MediaTek, and will run Microsoft Windows for Arm. The key test is whether AI-ready PCs can deliver useful local model, creative software and gaming features without software or battery-life trade-offs.Om AI Bets on Edge Multimodal Models as China AI Startups Move Toward DeploymentAIJun 1, 2026Om AI Bets on Edge Multimodal Models as China AI Startups Move Toward DeploymentOm AI Technology is focusing on compact edge-side multimodal vision models for PCs, cameras, robots and other devices rather than very large cloud models. At BEYOND Expo 2026, the company showed OttoBox AI Studio, a local-AI content tool for video analysis, asset matching, script generation and fast production. The next test is whether its VLX edge multimodal model can improve video understanding and decision-making while keeping operating costs lower.AT&T and Comcast Frame AI as the Next Telecom Network TestTelco & ConnectivityJun 1, 2026AT&T and Comcast Frame AI as the Next Telecom Network TestAT&T and Comcast described AI as a network workload that changes traffic direction, operations and edge infrastructure. AT&T says AI handles about 700000 daily network changes, while Comcast points to 200 edge compute centers and automated fault handling. The commercial test is whether carriers can turn AI-enabled network capabilities into services households and small businesses understand.