SendTech Times
AIPolicy|
REGULATION WATCH:

South Korea Opens 25 High-Value Public Datasets for AI Companies

Article summary

South Korea will release 25 AI and high-value public datasets through the public data portal by December. The datasets were selected from more than 3,280 candidate projects identified through company visits and public-demand surveys. The program targets AI use cases in energy, culture, infrastructure safety, legal-risk checking and agricultural diagnosis.

South Korea Opens 25 High-Value Public Datasets for AI Companies
Image source: 인공지능신문

What happened

South Korea's Ministry of the Interior and Safety has finalized detailed plans to open 25 datasets this year from its AI and high-value public data Top 100 program.

The ministry said the datasets will be released sequentially through the public data portal by December, with the goal of supporting domestic AI companies and new industries.

The selection came from more than 3,280 candidate projects identified through visits to about 800 companies and an online public-demand survey.

External experts reviewed the candidates based on economic impact, links to national policy tasks and AI suitability.

The government plans to open about 100 high-value datasets by 2028.

It opened 10 datasets in 2025, plans 25 this year, 30 in 2027 and 35 in 2028.

Why it matters

For AI developers, access to structured, lawful and domain-specific data can be as important as model choice.

The new releases focus on four areas tied to commercial and public-service use cases: new industries, K-culture, disaster and safety, and AI training data.

Examples include renewable-energy technical potential data from the Korea Institute of Energy Research, cultural AI training data from the Korea Culture Information Service, special-bridge inspection and management data from the Korea Authority of Land and Infrastructure Safety, Fair Trade Commission decision data structured for AI learning, and crop disease and pest diagnosis data from the Rural Development Administration.

The program also shows how South Korea is trying to balance AI training demand with privacy.

Some family, youth and transport-worker qualification data will be released as synthetic data, designed to preserve useful structure and distribution without exposing personal information.

Who is affected

AI startups and service developers are the main target group.

The ministry expects the data to support business-model development in energy project analysis, cultural content, infrastructure maintenance, legal-risk checking, unfair-trade queries and agricultural diagnosis.

The opening could also affect public agencies that hold data but need to make it more AI-friendly.

The ministry said it will strengthen demand surveys for training data and shift public-data management toward an AI-friendly system.

What to watch next

The near-term test is whether the 25 datasets are released on schedule by December and whether their formats are usable for model training, retrieval systems and commercial applications.

Data quality, metadata, licensing terms and update cadence will determine how useful the releases are in practice.

Readers should also watch whether Korean AI companies can turn these public datasets into deployed services rather than experiments.

If adoption follows, the program could become part of South Korea's effort to support a stronger domestic AI ecosystem without relying only on private datasets.

Share this article
inXf

Related articles

More
ORBBEC Pushes 3D Vision Deeper Into Physical AI
AI

ORBBEC Pushes 3D Vision Deeper Into Physical AI

ORBBEC is expanding from robot vision into physical AI, general AI vision, 3D printing and 3D data acquisition. The company reports more than 70% service robot market share in China and South Korea and has entered supply chains for AgiBot, UBTech and Unitree. Q1 2026 revenue reached RMB 203 million, while net profit after deductions rose 531.01% year on year.

Grep Adds LLM Agent To Monito As Online Proctoring Shifts Toward Context Review
AI

Grep Adds LLM Agent To Monito As Online Proctoring Shifts Toward Context Review

Grep said its Monito online proctoring product now uses an LLM agent to analyze context around suspected cheating events. The company cited internal tests showing more than 30 percent shorter post-exam review time and nearly 20 percent fewer false alerts. The key issue is whether agent-based proctoring can improve review efficiency while preserving human final judgment and candidate fairness.

Mercari Moves AI Leadership Into HR as It Tests AI-Native Workflows
AI

Mercari Moves AI Leadership Into HR as It Tests AI-Native Workflows

Mercari's Japan business CTO Toshiya Kimura is becoming CHRO and CAIO as the company links AI adoption with organizational redesign. The company has tested smaller AI Pods and found faster decisions, but also limits around design, compliance and cross-functional work. Mercari plans to make HR itself AI-first while using governance across legal, privacy, security, public policy and AI expertise.

Nota Runs VLA Robotics Model in Real Time on Qualcomm Edge AI Hardware
AI

Nota Runs VLA Robotics Model in Real Time on Qualcomm Edge AI Hardware

Nota demonstrated real-time operation of a vision-language-action robotics model on Qualcomm Dragonwing edge AI hardware. The company reduced the model action-head processing time from 218 milliseconds to 31 milliseconds while keeping task success nearly unchanged. The demo points to a path for physical AI systems that can run closer to robots rather than relying mainly on GPU servers or cloud infrastructure.

Keep Reading

More Stories

Latest
Salesforce opens Headless 360 as AI agents push enterprise software beyond the browserAIJun 1, 2026Salesforce opens Headless 360 as AI agents push enterprise software beyond the browserSalesforce Japan described Headless 360 as a way for external interfaces and AI agents to directly access Salesforce assets through APIs, MCP and CLI tools. The briefing connected Headless 360 with prior Agentforce 360 customer uptake. In Japan, the key test may be whether IT service vendors and partners treat the platform as a preferred toolkit.Nvidia and Foxconn Push Agentic AI Into Taiwan HospitalsAIJun 1, 2026Nvidia and Foxconn Push Agentic AI Into Taiwan HospitalsNvidia and Foxconn are working with Taiwanese medical centers on agentic AI systems for clinical and hospital operations. The effort is tied to Healthy Taiwan and a USD 1.5 billion sovereign AI healthcare investment. CoDoctor, CoDoClaw, Scrub Bot and Nurabot show healthcare AI moving toward multi-agent and physical AI workflows.Starlink Business Opportunity Narrows Around Coverage Gaps and Backup ConnectivityTelco & ConnectivityJun 1, 2026Starlink Business Opportunity Narrows Around Coverage Gaps and Backup ConnectivityStarlink strongest business role appears to be coverage gaps and backup connectivity. Recon Analytics found 72% of large businesses would consider ISP coverage extended with Starlink. Carrier offers from T-Mobile and Comcast Business may scale the model but keep Starlink in an infrastructure role.MiniMax M3 turns long-context AI into an agent platform testAIJun 1, 2026MiniMax M3 turns long-context AI into an agent platform testMiniMax launched M3 on June 1, 2026, combining long-context, agentic, coding and native multimodal capabilities in one model line. The API supports up to 1 million tokens of context, with a guaranteed minimum of 512K tokens, and includes M3 and M3-highspeed versions. MiniMax plans to open-source M3 on HuggingFace and GitHub, while early pricing offers a 50% discount for the first seven days.Japan’s AI Suitcase Turns Assistive Mobility Into a Robotics Test CaseAIJun 1, 2026Japan’s AI Suitcase Turns Assistive Mobility Into a Robotics Test CaseCAAMP, a consortium that includes university research institutes and IBM Japan, has developed the AI Suitcase to guide visually impaired users with sensors, cameras, motors and AI. The suitcase is being tested at locations including Miraikan, New Chitose Airport and Tokyo’s Nihonbashi district, where 39 visually impaired participants completed a monthlong indoor trial without collisions. An updated indoor-and-outdoor model is planned for Expo 2025 in Osaka, with CAAMP aiming to collect feedback from 2,000 to 3,000 people without visual disabilities.Anthropic’s Conway Points Claude Toward Always-On AI AgentsAIJun 1, 2026Anthropic’s Conway Points Claude Toward Always-On AI AgentsAnthropic is preparing a Claude expansion that includes Conway, Orbit, Operon, memory upgrades and multilingual voice mode. The move signals a shift from chat-based AI toward persistent assistants that can connect with external services and manage workspaces. Enterprises, developers and research teams could be affected if Claude becomes a broader agent platform.SoftBank’s €75 Billion France Plan Signals Europe’s AI Infrastructure RaceCloud & Data CentersJun 1, 2026SoftBank’s €75 Billion France Plan Signals Europe’s AI Infrastructure RaceSoftBank plans to invest up to €75 billion in AI data centres in France, with initial sites expected to come online in five years. The first phase includes €45 billion for 3.1 GW of capacity in Hauts-de-France by 2031, including locations in Dunkirk’s Loon-Plage, Bosquel and Bouchain. The plan could strengthen Europe’s AI infrastructure base, but questions remain over financing, regulatory approvals and what technological sovereignty means when a Japanese group leads the buildout.Huawei's Tau scaling puts architecture at the center of China's AI-chip pushChips & SemiconductorsJun 1, 2026Huawei's Tau scaling puts architecture at the center of China's AI-chip pushHuawei proposed Tau scaling, a framework focused on shorter signal paths rather than only smaller transistors. LogicFolding is planned for Kirin chips in fall and winter 2026, with Huawei claiming a density jump to 238 million transistors per square millimeter. The test is whether architecture-led gains can be validated in commercial chips amid export-control limits on advanced tools.FuriosaAI and Broadcom Target the Next Layer of AI Inference InfrastructureChips & SemiconductorsJun 1, 2026FuriosaAI and Broadcom Target the Next Layer of AI Inference InfrastructureFuriosaAI said it will work with Broadcom on a next-generation AI inference platform built around its TCP architecture and Broadcom networking and packaging technologies. The planned third-generation accelerator will use a 2-nanometer compute die, HBM4/HBM4E memory and multi-die packaging, with sampling planned for the first half of 2028. The deal points to AI infrastructure competition shifting from single-chip performance toward memory, networking, power efficiency and rack-level system design.AIVEX Brings Physical AI Into Korean Battery-Plant Packaging WorkAIJun 1, 2026AIVEX Brings Physical AI Into Korean Battery-Plant Packaging WorkAIVEX said its AIbot platform automated a crucible packaging-removal process at a leading Korean battery-materials company. The system combines AI vision, 3D optics, 6D pose estimation and automatic path planning to handle irregular ropes and wrapping film. The deployment points to physical AI moving into factory tasks that are repetitive but too variable for simple fixed automation.Dubai RTA Digital Revenue Shows Smart Mobility Moving Into Daily UseEconomyJun 1, 2026Dubai RTA Digital Revenue Shows Smart Mobility Moving Into Daily UseRTA said digital-channel revenue reached AED5.3 billion in 2025, up 20.6 percent from 2024. The authority reported more than 628 million digital transactions and 96 percent digital adoption. The next test is whether AI, WhatsApp and government-platform integrations make mobility services simpler.Nvidia Pushes AI PC Strategy With RTX Spark Superchip for Windows DevicesChips & SemiconductorsJun 1, 2026Nvidia Pushes AI PC Strategy With RTX Spark Superchip for Windows DevicesNvidia is entering the Windows PC market with the RTX Spark Superchip for laptops and desktops expected this autumn. The chip combines a microprocessor and graphics processor, was built with help from MediaTek, and will run Microsoft Windows for Arm. The key test is whether AI-ready PCs can deliver useful local model, creative software and gaming features without software or battery-life trade-offs.