Nota Runs VLA Robotics Model in Real Time on Qualcomm Edge AI Hardware

BySendTech Times AI & Enterprise DeskNewsroom-edited, source-reviewed coverage|Source: Newstheai

Newsroom brief

Nota demonstrated real-time operation of a vision-language-action robotics model on Qualcomm Dragonwing edge AI hardware. The company reduced the model action-head processing time from 218 milliseconds to 31 milliseconds while keeping task success nearly unchanged. The demo points to a path for physical AI systems that can run closer to robots rather than relying mainly on GPU servers or cloud infrastructure.

Verified against source materialEdited by SendTech Times AI & Enterprise Desk

Nota Runs VLA Robotics Model in Real Time on Qualcomm Edge AI Hardware

Image source: 더에이아이

Nota runs a VLA robotics model in real time on Qualcomm edge AI hardware

South Korean AI optimization company Nota says it has demonstrated real-time operation of a vision-language-action model on an edge AI device, showing how physical AI workloads could move closer to robots and autonomous systems.

The company presented the work at Embedded Vision Summit 2026 in Santa Clara, California.

The demo used Qualcomm's Dragonwing IQ-9075 edge AI device to run SmolVLA 0.45B, a robotics VLA model that connects visual perception, language understanding and action generation.

VLA models are important for physical AI because they allow a system to see its environment, interpret natural-language instructions and turn those inputs into movement.

But that combined workload is computationally heavy, and many VLA deployments still depend on GPU servers or cloud infrastructure rather than embedded devices.

Selective optimization instead of full compression

Nota said its approach did not compress the entire model.

It kept the visual perception and language understanding stages intact and focused optimization on the action head, the part that generates robot movement commands.

The company used two techniques: real-time inference optimization to reduce repeated computation, and NPU-based graph optimization to redesign the computation flow for Qualcomm's edge AI execution environment.

The result was a sharp reduction in action-head processing time from 218 milliseconds to 31 milliseconds.

That is an 85.8% decrease and up to a sevenfold speed improvement.

Total inference time also fell from 505 milliseconds to 310 milliseconds, while the task success rate moved only slightly from 86% to 85%.

Why it matters for physical AI

At the summit, visitors selected items and the optimized VLA model recognized them in real time, then directed a robot arm to place them in a basket.

Nota said the demonstration was not a replayed video or fixed scenario; the AI made decisions on site according to each visitor's choice.

The result supports Nota's argument that industrial physical AI will need models that run quickly and reliably on edge devices.

The company links the work to demand across on-device AI, data centers and physical AI.

Nota reported first-quarter 2026 consolidated revenue of 3.58 billion won, compared with about 67 million won a year earlier.

Its order backlog stood at about 12.1 billion won at the end of the quarter.

Chief executive Chae Myung-soo said the VLA optimization shows that Nota's technology can become a core foundation for physical AI adoption in industrial settings.

#Nota #Qualcomm Dragonwing #VLA #physical AI

Om AI Bets on Edge Multimodal Models as China AI Startups Move Toward Deployment

Om AI Technology is focusing on compact edge-side multimodal vision models for PCs, cameras, robots and other devices rather than very large cloud models. At BEYOND Expo 2026, the company showed OttoBox AI Studio, a local-AI content tool for video analysis, asset matching, script generation and fast production. The next test is whether its VLX edge multimodal model can improve video understanding and decision-making while keeping operating costs lower.

Chips & Semiconductors

Nvidia's RTX Spark Turns AI PCs Into the Next Chip Battleground

Nvidia is entering the AI PC market with RTX Spark, a MediaTek-linked SoC that combines Blackwell GPU technology with a CPU on a single chip. The move shifts Nvidia's AI strategy closer to edge devices, where agentic AI could run locally instead of relying only on cloud infrastructure. Analysts cited in the source said the PC opportunity is still small compared with Nvidia's data center and networking businesses.

Chips & Semiconductors

Onsemi Synaptics Deal Adds Edge AI Compute With Approval Still Pending

Onsemi is buying Synaptics in an all-stock deal valued at about $7 billion to add edge AI compute, wireless connectivity and human-machine interface assets. The transaction is expected to close in mid-2027 if regulators approve it, but customer deployments and integration milestones remain undisclosed.

Apple AI Architecture Puts Google And Nvidia Inside Its Privacy Test

Apple is using Google and Nvidia to support its most advanced cloud AI model while trying to keep Apple Intelligence centered on private orchestration, proprietary models and on-device context.

liko.ai Funding Turns Edge AI Into a Smart-Home Hardware Test

liko.ai completed its first-round financing to fund edge-side vision-language models, AI-native hardware and multi-modal home terminals. The investor group includes Shangtang Guoxiang Capital, Orient Fortune Capital, iFlytek Venture Capital, Hongtai Fund, Zhengxuan Investment and Mianbi Intelligence. The practical question is whether the startup can turn camera-based edge AI into a consumer smart-home hub without relying on cloud processing.

Google Tests Local AI Demand With Gemma 4 12B Release

Google released Gemma 4 12B as an open-weights multimodal AI model designed to run locally on a standard enterprise laptop. The model is described as an 11.95-billion-parameter system with an Apache 2.0 license, 16GB memory target, 256K context window and immediate availability through Google AI Edge Gallery. The practical question is whether enterprises use local multimodal inference when cloud access, latency or data handling are constraints.