NVIDIA

✓ Verified

GPU maker and AI model provider, including the Nemotron series.

At a glance

Followers

Total models

8 (8 active)

Releases tracked

Largest context

262K

Flagship model

NVIDIA: Nemotron 3 Super

262K context

Most affordable

NVIDIA: Nemotron Nano 9B V2

$0.04/1M / 1M input tokens

Average pricing

$0.28/1M input · $0.5016666666666667/1M output

per 1M tokens, across all models

NVIDIA supportsimagetextvideo

Overview

NVIDIA is the dominant supplier of AI training and inference hardware globally. Its CUDA software stack and H100/Blackwell GPUs are the de-facto standard for frontier-model training, giving NVIDIA an effective monopoly that briefly made it the most valuable company in the world (>US$3.5 trillion market cap, 2024–2025).

AI software contributions

While best known for hardware, NVIDIA has been an active publisher of AI models and software:

Megatron-LM — large-scale transformer training framework
Nemotron family — open-weights LLMs (Nemotron-4 340B, etc.)
NeMo — speech and LLM toolkit
TensorRT-LLM — optimised inference runtime
Omniverse — generative AI for 3D worlds
NIM — packaged inference microservices

Hardware roadmap

A100 (2020) — Ampere generation
H100 (2022) — Hopper, the workhorse of the LLM training era
H200 (2024) — memory-upgraded H100
Blackwell B200/GB200 (2024–2025) — current generation, ~2× perf vs H100
Rubin (announced 2025) — successor architecture, sampling 2026

Customer base

Every major AI lab — OpenAI, Anthropic, Google, Meta, xAI, Mistral, DeepSeek — runs primarily or partially on NVIDIA hardware.

Latest news from NVIDIA

8 most recent · auto-synced from RSS

Apr 16, 2026·GeForce NOW Community
No Need for Space Gear — Capcom’s ‘PRAGMATA’ Joins GeForce NOW on Launch Day
Head straight for orbit with GeForce NOW — no space helmet required. PRAGMATA, Capcom’s long-awaited sci-fi action adventure, touches down on GeForce NOW the same day it launches worldwide. The futuristic journey through a cold lunar station in the near future can be streamed instantly from the cloud to almost any device, no console or […]
Apr 15, 2026·Shruti Koparkar
Rethinking AI TCO: Why Cost per Token Is the Only Metric That Matters
Traditional data centers only stored, retrieved and processed data. In the generative and agentic AI era, these facilities have evolved into AI token factories. With AI inference becoming their primary workload, their primary output is intelligence manufactured in the form of tokens. This transformation demands a corresponding shift in how the economics of AI infrastructure, […]
Apr 15, 2026·Joel Pennington
New Adobe Premiere Color Grading Mode Accelerated on NVIDIA GPUs
The NAB Show 2026 trade show, running April 18-22 in Las Vegas, is set to showcase a wave of new features and optimizations for top video editing applications. Bringing together over 60,000 content professionals from across the broadcast and media and entertainment industries, the event highlights how video editors, livestreamers and professional creators are exploring […]
Apr 10, 2026·NVIDIA Writers
National Robotics Week — Latest Physical AI Research, Breakthroughs and Resources
This National Robotics Week, NVIDIA is highlighting the breakthroughs that are bringing AI into the physical world — as well as the growing wave of robots transforming industries, from agricultural and manufacturing to energy and beyond. Advancements in robot learning, simulation and foundation models are accelerating development, enabling robots to move from training in virtual […]
Apr 9, 2026·GeForce NOW Community
Strength and Destiny Collide: ‘Samson: A Tyndalston Story’ Arrives in the Cloud
A timeless story of grit, faith and rebellion takes center stage as Samson: A Tyndalston Story joins the GeForce NOW library today. The highly anticipated release from Liquid Swords can now be streamed on nearly any device with GeForce NOW bringing cinematic intensity and mythic storytelling to the cloud. Catch it as part of four […]
Apr 2, 2026·Michael Fukuyama
From RTX to Spark: NVIDIA Accelerates Gemma 4 for Local Agentic AI
Open models are driving a new wave of on-device AI, extending innovation beyond the cloud to everyday devices. As these models advance, their value increasingly depends on access to local, real-time context that can turn meaningful insights into action. Designed for this shift, Google’s latest additions to the Gemma 4 family introduce a class of small, fast and omni-capable models built for efficient local execution across a wide range […]
Apr 2, 2026·GeForce NOW Community
Press Start on April: GeForce NOW Brings 10 Games to the Cloud
No joke — GFN Thursday is skipping the tricks and heading straight into the games. April kicks off with ten new titles, bringing fresh adventures to GeForce NOW, including the launch of Capcom’s highly anticipated PRAGMATA. A dozen new games are available to stream this week, including Arknights: Endfield, which expands the acclaimed series into a full […]
Mar 31, 2026·Vladimir Troy
Efficiency at Scale: NVIDIA, Energy Leaders Accelerating Power‑Flexible AI Factories to Fortify the Grid
CERAWeek — dubbed the Davos of energy — is where policymakers, producers, technologists and financiers gather to discuss how the world powers itself next. NVIDIA and Emerald AI unveiled at the conference last week a new way forward — treating AI factories not as static power loads but as flexible, intelligent grid assets. This collaboration […]

Latest activity

Full history →

018 apr. 2026
NVIDIA: Llama 3.1 Nemotron 70B Instruct released
NVIDIA: Llama 3.1 Nemotron 70B Instruct discovered via OpenRouter. Provider: NVIDIA.
NVIDIA: Llama 3.1 Nemotron 70B Instruct
018 apr. 2026
NVIDIA: Nemotron Nano 9B V2 released
NVIDIA: Nemotron Nano 9B V2 discovered via OpenRouter. Provider: NVIDIA.
NVIDIA: Nemotron Nano 9B V2
018 apr. 2026
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 released
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 discovered via OpenRouter. Provider: NVIDIA.
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5
018 apr. 2026
NVIDIA: Nemotron Nano 12B 2 VL released
NVIDIA: Nemotron Nano 12B 2 VL discovered via OpenRouter. Provider: NVIDIA.
NVIDIA: Nemotron Nano 12B 2 VL
018 apr. 2026
NVIDIA: Nemotron 3 Nano 30B A3B released
NVIDIA: Nemotron 3 Nano 30B A3B discovered via OpenRouter. Provider: NVIDIA.
NVIDIA: Nemotron 3 Nano 30B A3B
018 apr. 2026
NVIDIA: Nemotron 3 Super released
NVIDIA: Nemotron 3 Super discovered via OpenRouter. Provider: NVIDIA.
NVIDIA: Nemotron 3 Super

Releases timeline

Showing 8 most recent

Mar 11, 2026NVIDIA: Nemotron 3 Super
NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer...
Dec 14, 2025NVIDIA: Nemotron 3 Nano 30B A3B
NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...
Oct 28, 2025NVIDIA: Nemotron Nano 12B 2 VL
NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s...
Oct 10, 2025NVIDIA: Llama 3.3 Nemotron Super 49B V1.5
Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...
Sep 5, 2025NVIDIA: Nemotron Nano 9B V2
NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and...
Mar 12, 2025Llama 3.3 Nemotron Super 49B
49B parameter efficient model with frontier reasoning capability from NVIDIA.
Oct 15, 2024NVIDIA: Llama 3.1 Nemotron 70B Instruct
NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels...
Oct 3, 2024Llama 3.1 Nemotron 70B
NVIDIA-tuned Llama 3.1 70B with state-of-the-art alignment and helpfulness.

NVIDIA

At a glance

Overview

AI software contributions

Hardware roadmap

Customer base

Latest news from NVIDIA

No Need for Space Gear — Capcom’s ‘PRAGMATA’ Joins GeForce NOW on Launch Day

Rethinking AI TCO: Why Cost per Token Is the Only Metric That Matters

New Adobe Premiere Color Grading Mode Accelerated on NVIDIA GPUs

National Robotics Week — Latest Physical AI Research, Breakthroughs and Resources

Strength and Destiny Collide: ‘Samson: A Tyndalston Story’ Arrives in the Cloud

From RTX to Spark: NVIDIA Accelerates Gemma 4 for Local Agentic AI

Press Start on April: GeForce NOW Brings 10 Games to the Cloud

Efficiency at Scale: NVIDIA, Energy Leaders Accelerating Power‑Flexible AI Factories to Fortify the Grid

Latest activity

NVIDIA: Llama 3.1 Nemotron 70B Instruct released

NVIDIA: Nemotron Nano 9B V2 released

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 released

NVIDIA: Nemotron Nano 12B 2 VL released

NVIDIA: Nemotron 3 Nano 30B A3B released

NVIDIA: Nemotron 3 Super released

Releases timeline

Active models

Community ratings

Rate NVIDIA

Comments