# Nemotron 3 Nano Omni unifies vision, audio, and language in a single architecture with 30B parameters but only 3B active per inference.

*semiconductor · news · 2026-04-28 · TNW*

## Key points

- Nvidia's Nemotron 3 Nano Omni unifies vision, audio, and language into a single 30B-parameter model.
- The model uses only 3B active parameters per inference, enabling single GPU edge deployment.
- Nemotron 3 Nano Omni claims 9x throughput and 2.9x faster reasoning than comparable open multimodal models.
- It is the first open-weight multimodal model combining mixture-of-experts, unified architecture, and audio capabilities.
- Available under Nvidia's Open Model Agreement, it allows full commercial use and runs on multiple frameworks.

**Companies:** Nvidia, Amazon, Foxconn, Palantir, Dell, DocuSign, Infosys, Oracle, Aible, ASI, Eka Care, H Company
**Countries:** United States, Taiwan

[Read the full story on TNW](https://thenextweb.com/news/nvidia-nemotron-nano-omni-multimodal-agent-edge)

---

Canonical: https://newsio.io/n/9ae296fc-7d4f-43fd-9bb1-ef08bd1dbc24/nemotron-3-nano-omni-unifies-vision-audio-and-language-in-a-single-architecture
Summarized by Newsio from TNW. https://newsio.io/how-it-works
