MiniMax M3 turns long-context AI into an agent platform test

BySendTech Times AI & Enterprise DeskNewsroom-edited, source-reviewed coverage|Source: Pandaily

Newsroom brief

MiniMax launched M3 on June 1, 2026, combining long-context, agentic, coding and native multimodal capabilities in one model line. The API supports up to 1 million tokens of context, with a guaranteed minimum of 512K tokens, and includes M3 and M3-highspeed versions. MiniMax plans to open-source M3 on HuggingFace and GitHub, while early pricing offers a 50% discount for the first seven days.

Verified against source materialEdited by SendTech Times AI & Enterprise Desk

MiniMax M3 turns long-context AI into an agent platform test

Image source: Pandaily

MiniMax has introduced M3, a new flagship AI model that puts long-context reasoning, agentic workflows, coding performance and multimodal processing into one product line.

The Shanghai-based company, also known as Shanghai Hixi Technology, launched the model on June 1, 2026.

What happened

M3 is being positioned as a domestic AI model built for demanding agent and coding tasks.

Its API supports a context window of up to 1 million tokens, with at least 512K tokens guaranteed, and the model is trained as natively multimodal rather than as a text-only system with later visual additions.

MiniMax says the model uses its proprietary Sparse Attention architecture.

The company also says it rebuilt its data pipeline, expanded pre-training data to hundreds of terabytes, and worked on alignment between text and visual semantic spaces.

The launch includes two API versions: M3 and M3-highspeed.

Both are described as producing identical results, while M3-highspeed is designed for faster inference.

Automatic caching is enabled by default.

Why it matters

The announcement is a signal that Chinese AI model developers are moving beyond headline benchmark competition and into agent infrastructure.

A model with a million-token window could matter for developers building long-running coding sessions, research assistants, document-heavy workflows or video-understanding tools.

The key commercial question is whether these capabilities translate into reliable enterprise use.

Long context, tool use and autonomous task execution may reduce friction for teams experimenting with AI agents, but adoption would depend on stability, cost and the quality of downstream applications.

Performance signals

MiniMax highlighted several benchmark and demonstration results.

In BrowseComp, M3 scored 83.5, compared with 79.3 for OpenAI Opus 4.7.

In one autonomous experiment, M3 spent nearly 12 hours reproducing an ICLR 2025 outstanding paper on LLM fine-tuning dynamics, producing 18 commits and 23 experimental charts.

The company also tested M3 as an AI research assistant.

In that task, the model was given four pre-trained base models and asked to carry out data synthesis, training, evaluation and iteration within 12 hours without human intervention.

M3 scored 37.1, behind Opus 4.7 at 42.4 and GPT-5.5 at 39.3.

What to watch next

MiniMax plans to open-source M3 on HuggingFace and GitHub, with support for private cluster deployment and fine-tuning.

That could make the model more relevant to teams that want more control over infrastructure and customization.

Pricing will also shape market response.

For the first seven days, MiniMax is offering a 50% discount for M3 API usage at contexts up to 512K tokens, with input, output and cache-read pricing set across standard and priority tiers.

Readers should watch whether developers treat M3 as a practical agent platform rather than only a benchmark announcement.

#MiniMax #M3 #AI agents #multimodal AI

Microsoft Uses Build 2026 to Push Agents Beyond Copilot

Microsoft used its Build 2026 keynote to introduce MAI models, Project Soltera and Microsoft Scout as part of a broader agent strategy. MAI-Thinking-1 is described as a 35-billion-parameter reasoning model with a 128,000-context window for multi-step instructions, long-context reasoning and code generation. The announcement gives Microsoft a clearer agent roadmap, but the source does not provide customer rollout data, pricing or enterprise adoption evidence.

Zhipu AI’s Record Hong Kong Rally Tests China’s AI Valuation Boom

Zhipu AI briefly reached HK$1,993, lifting its market capitalization above HK$880 billion, or about $112 billion. Caixin reported 2025 revenue of 724 million yuan, adjusted net loss of 3.2 billion yuan and open API ARR of 1.7 billion yuan. The rally reflects scarcity premiums for listed Chinese AI developers despite price competition and compute-cost pressure.

Jedify’s $24M Round Tests Enterprise AI’s Context Problem

Jedify raised $24 million to expand a context-graph platform for enterprise AI agents, with Snowflake joining as a strategic investor and early customers testing permission-aware deployments.

Tencent Takes WorkBuddy AI Agent Global In Enterprise Productivity Push

Tencent Cloud launched WorkBuddy for overseas users after an earlier China rollout. The agent can run tasks through messaging apps and connect with GitHub, Jira, Google Drive, Gmail, Notion, and Slack. Miora and TokenHub show Tencent building a wider enterprise AI stack around agents, creative work, and model access.

Google Adds Gemini Agent To Search Ads In India Beta

Google launched Business Agent for Leads in India as a Gemini-powered Search ad format that can chat with users on the search page. The company cited $77.25 billion in first-quarter ad revenue and several India ad metrics, but did not disclose pricing, wider rollout dates or independent lead-quality validation.

Z.ai GLM-5.2 Pushes Open Coding Models Into Longer Workflows

Z.ai released GLM-5.2 under an MIT license with a one million-token context window, coding-agent benchmarks and self-hosting options, putting long-context software engineering back into the open-model race.