NVIDIA
✓ VerifiedGPU maker and AI model provider, including the Nemotron series.
At a glance
Overview
NVIDIA is the dominant supplier of AI training and inference hardware globally. Its CUDA software stack and H100/Blackwell GPUs are the de-facto standard for frontier-model training, giving NVIDIA an effective monopoly that briefly made it the most valuable company in the world (>US$3.5 trillion market cap, 2024–2025).
AI software contributions
While best known for hardware, NVIDIA has been an active publisher of AI models and software:
- Megatron-LM — large-scale transformer training framework
- Nemotron family — open-weights LLMs (Nemotron-4 340B, etc.)
- NeMo — speech and LLM toolkit
- TensorRT-LLM — optimised inference runtime
- Omniverse — generative AI for 3D worlds
- NIM — packaged inference microservices
Hardware roadmap
- A100 (2020) — Ampere generation
- H100 (2022) — Hopper, the workhorse of the LLM training era
- H200 (2024) — memory-upgraded H100
- Blackwell B200/GB200 (2024–2025) — current generation, ~2× perf vs H100
- Rubin (announced 2025) — successor architecture, sampling 2026
Customer base
Every major AI lab — OpenAI, Anthropic, Google, Meta, xAI, Mistral, DeepSeek — runs primarily or partially on NVIDIA hardware.
Latest news from NVIDIA
8 most recent · auto-synced from RSS- ·GeForce NOW Community
No Need for Space Gear — Capcom’s ‘PRAGMATA’ Joins GeForce NOW on Launch Day
Head straight for orbit with GeForce NOW — no space helmet required. PRAGMATA, Capcom’s long-awaited sci-fi action adventure, touches down on GeForce NOW the same day it launches worldwide. The futuristic journey through a cold lunar station in the near future can be streamed instantly from the cloud to almost any device, no console or […]
- ·Shruti Koparkar
Rethinking AI TCO: Why Cost per Token Is the Only Metric That Matters
Traditional data centers only stored, retrieved and processed data. In the generative and agentic AI era, these facilities have evolved into AI token factories. With AI inference becoming their primary workload, their primary output is intelligence manufactured in the form of tokens. This transformation demands a corresponding shift in how the economics of AI infrastructure, […]
- ·Joel Pennington
New Adobe Premiere Color Grading Mode Accelerated on NVIDIA GPUs
The NAB Show 2026 trade show, running April 18-22 in Las Vegas, is set to showcase a wave of new features and optimizations for top video editing applications. Bringing together over 60,000 content professionals from across the broadcast and media and entertainment industries, the event highlights how video editors, livestreamers and professional creators are exploring […]
- ·NVIDIA Writers
National Robotics Week — Latest Physical AI Research, Breakthroughs and Resources
This National Robotics Week, NVIDIA is highlighting the breakthroughs that are bringing AI into the physical world — as well as the growing wave of robots transforming industries, from agricultural and manufacturing to energy and beyond. Advancements in robot learning, simulation and foundation models are accelerating development, enabling robots to move from training in virtual […]
- ·GeForce NOW Community
Strength and Destiny Collide: ‘Samson: A Tyndalston Story’ Arrives in the Cloud
A timeless story of grit, faith and rebellion takes center stage as Samson: A Tyndalston Story joins the GeForce NOW library today. The highly anticipated release from Liquid Swords can now be streamed on nearly any device with GeForce NOW bringing cinematic intensity and mythic storytelling to the cloud. Catch it as part of four […]
- ·Michael Fukuyama
From RTX to Spark: NVIDIA Accelerates Gemma 4 for Local Agentic AI
Open models are driving a new wave of on-device AI, extending innovation beyond the cloud to everyday devices. As these models advance, their value increasingly depends on access to local, real-time context that can turn meaningful insights into action. Designed for this shift, Google’s latest additions to the Gemma 4 family introduce a class of small, fast and omni-capable models built for efficient local execution across a wide range […]
- ·GeForce NOW Community
Press Start on April: GeForce NOW Brings 10 Games to the Cloud
No joke — GFN Thursday is skipping the tricks and heading straight into the games. April kicks off with ten new titles, bringing fresh adventures to GeForce NOW, including the launch of Capcom’s highly anticipated PRAGMATA. A dozen new games are available to stream this week, including Arknights: Endfield, which expands the acclaimed series into a full […]
- ·Vladimir Troy
Efficiency at Scale: NVIDIA, Energy Leaders Accelerating Power‑Flexible AI Factories to Fortify the Grid
CERAWeek — dubbed the Davos of energy — is where policymakers, producers, technologists and financiers gather to discuss how the world powers itself next. NVIDIA and Emerald AI unveiled at the conference last week a new way forward — treating AI factories not as static power loads but as flexible, intelligent grid assets. This collaboration […]
Latest activity
Full history →- 0
NVIDIA: Llama 3.1 Nemotron 70B Instruct released
NVIDIA: Llama 3.1 Nemotron 70B Instruct discovered via OpenRouter. Provider: NVIDIA.
NVIDIA: Llama 3.1 Nemotron 70B Instruct - 0
NVIDIA: Nemotron Nano 9B V2 released
NVIDIA: Nemotron Nano 9B V2 discovered via OpenRouter. Provider: NVIDIA.
NVIDIA: Nemotron Nano 9B V2 - 0
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 released
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 discovered via OpenRouter. Provider: NVIDIA.
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 - 0
NVIDIA: Nemotron Nano 12B 2 VL released
NVIDIA: Nemotron Nano 12B 2 VL discovered via OpenRouter. Provider: NVIDIA.
NVIDIA: Nemotron Nano 12B 2 VL - 0
NVIDIA: Nemotron 3 Nano 30B A3B released
NVIDIA: Nemotron 3 Nano 30B A3B discovered via OpenRouter. Provider: NVIDIA.
NVIDIA: Nemotron 3 Nano 30B A3B - 0
NVIDIA: Nemotron 3 Super released
NVIDIA: Nemotron 3 Super discovered via OpenRouter. Provider: NVIDIA.
NVIDIA: Nemotron 3 Super
Releases timeline
Showing 8 most recent- NVIDIA: Nemotron 3 Super
NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer...
- NVIDIA: Nemotron 3 Nano 30B A3B
NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...
- NVIDIA: Nemotron Nano 12B 2 VL
NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s...
- NVIDIA: Llama 3.3 Nemotron Super 49B V1.5
Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...
- NVIDIA: Nemotron Nano 9B V2
NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and...
- Llama 3.3 Nemotron Super 49B
49B parameter efficient model with frontier reasoning capability from NVIDIA.
- NVIDIA: Llama 3.1 Nemotron 70B Instruct
NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels...
- Llama 3.1 Nemotron 70B
NVIDIA-tuned Llama 3.1 70B with state-of-the-art alignment and helpfulness.
Active models
- Llama 3.1 Nemotron 70BActive
NVIDIA-tuned Llama 3.1 70B with state-of-the-art alignment and helpfulness.
128K ctx - Llama 3.3 Nemotron Super 49BActive
49B parameter efficient model with frontier reasoning capability from NVIDIA.
128K ctx - NVIDIA: Llama 3.1 Nemotron 70B InstructActive
NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels...
131K ctx - NVIDIA: Llama 3.3 Nemotron Super 49B V1.5Active
Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...
131K ctx - NVIDIA: Nemotron 3 Nano 30B A3BActive
NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...
262K ctx - NVIDIA: Nemotron 3 SuperActive
NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer...
262K ctx - NVIDIA: Nemotron Nano 12B 2 VLActive
NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s...
131K ctx - NVIDIA: Nemotron Nano 9B V2Active
NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and...
131K ctx
Community ratings
Rate NVIDIA
Sign in to rate and review.
Comments
Sign in to leave a comment.