NVIDIA logo

NVIDIA

✓ Verified

GPU maker and AI model provider, including the Nemotron series.

At a glance

Followers
0
Total models
8 (8 active)
Releases tracked
6
Largest context
262K
Flagship model
262K context
Most affordable
$0.04/1M / 1M input tokens
Average pricing
$0.28/1M input · $0.5016666666666667/1M output
per 1M tokens, across all models
NVIDIA supportsimagetextvideo

Overview

NVIDIA is the dominant supplier of AI training and inference hardware globally. Its CUDA software stack and H100/Blackwell GPUs are the de-facto standard for frontier-model training, giving NVIDIA an effective monopoly that briefly made it the most valuable company in the world (>US$3.5 trillion market cap, 2024–2025).

AI software contributions

While best known for hardware, NVIDIA has been an active publisher of AI models and software:

  • Megatron-LM — large-scale transformer training framework
  • Nemotron family — open-weights LLMs (Nemotron-4 340B, etc.)
  • NeMo — speech and LLM toolkit
  • TensorRT-LLM — optimised inference runtime
  • Omniverse — generative AI for 3D worlds
  • NIM — packaged inference microservices

Hardware roadmap

  • A100 (2020) — Ampere generation
  • H100 (2022) — Hopper, the workhorse of the LLM training era
  • H200 (2024) — memory-upgraded H100
  • Blackwell B200/GB200 (2024–2025) — current generation, ~2× perf vs H100
  • Rubin (announced 2025) — successor architecture, sampling 2026

Customer base

Every major AI lab — OpenAI, Anthropic, Google, Meta, xAI, Mistral, DeepSeek — runs primarily or partially on NVIDIA hardware.

Latest news from NVIDIA

8 most recent · auto-synced from RSS
  1. ·GeForce NOW Community

    No Need for Space Gear — Capcom’s ‘PRAGMATA’ Joins GeForce NOW on Launch Day

    Head straight for orbit with GeForce NOW — no space helmet required. PRAGMATA, Capcom’s long-awaited sci-fi action adventure, touches down on GeForce NOW the same day it launches worldwide. The futuristic journey through a cold lunar station in the near future can be streamed instantly from the cloud to almost any device, no console or […]

  2. ·Shruti Koparkar

    Rethinking AI TCO: Why Cost per Token Is the Only Metric That Matters

    Traditional data centers only stored, retrieved and processed data. In the generative and agentic AI era, these facilities have evolved into AI token factories. With AI inference becoming their primary workload, their primary output is intelligence manufactured in the form of tokens. This transformation demands a corresponding shift in how the economics of AI infrastructure, […]

  3. ·Joel Pennington

    New Adobe Premiere Color Grading Mode Accelerated on NVIDIA GPUs

    The NAB Show 2026 trade show, running April 18-22 in Las Vegas, is set to showcase a wave of new features and optimizations for top video editing applications. Bringing together over 60,000 content professionals from across the broadcast and media and entertainment industries, the event highlights how video editors, livestreamers and professional creators are exploring […]

  4. ·NVIDIA Writers

    National Robotics Week — Latest Physical AI Research, Breakthroughs and Resources

    This National Robotics Week, NVIDIA is highlighting the breakthroughs that are bringing AI into the physical world — as well as the growing wave of robots transforming industries, from agricultural and manufacturing to energy and beyond. Advancements in robot learning, simulation and foundation models are accelerating development, enabling robots to move from training in virtual […]

  5. ·GeForce NOW Community

    Strength and Destiny Collide: ‘Samson: A Tyndalston Story’ Arrives in the Cloud

    A timeless story of grit, faith and rebellion takes center stage as Samson: A Tyndalston Story joins the GeForce NOW library today. The highly anticipated release from Liquid Swords can now be streamed on nearly any device with GeForce NOW bringing cinematic intensity and mythic storytelling to the cloud. Catch it as part of four […]

  6. ·Michael Fukuyama

    From RTX to Spark: NVIDIA Accelerates Gemma 4 for Local Agentic AI

    Open models are driving a new wave of on-device AI, extending innovation beyond the cloud to everyday devices. As these models advance, their value increasingly depends on access to local, real-time context that can turn meaningful insights into action. Designed for this shift, Google’s latest additions to the Gemma 4 family introduce a class of small, fast and omni-capable models built for efficient local execution across a wide range […]

  7. ·GeForce NOW Community

    Press Start on April: GeForce NOW Brings 10 Games to the Cloud

    No joke — GFN Thursday is skipping the tricks and heading straight into the games. April kicks off with ten new titles, bringing fresh adventures to GeForce NOW, including the launch of Capcom’s highly anticipated PRAGMATA. A dozen new games are available to stream this week, including Arknights: Endfield, which expands the acclaimed series into a full […]

  8. ·Vladimir Troy

    Efficiency at Scale: NVIDIA, Energy Leaders Accelerating Power‑Flexible AI Factories to Fortify the Grid

    CERAWeek — dubbed the Davos of energy — is where policymakers, producers, technologists and financiers gather to discuss how the world powers itself next. NVIDIA and Emerald AI unveiled at the conference last week a new way forward — treating AI factories not as static power loads but as flexible, intelligent grid assets. This collaboration […]

Latest activity

Full history →
  1. 0

    NVIDIA: Llama 3.1 Nemotron 70B Instruct released

    NVIDIA: Llama 3.1 Nemotron 70B Instruct discovered via OpenRouter. Provider: NVIDIA.

    NVIDIA: Llama 3.1 Nemotron 70B Instruct
  2. 0

    NVIDIA: Nemotron Nano 9B V2 released

    NVIDIA: Nemotron Nano 9B V2 discovered via OpenRouter. Provider: NVIDIA.

    NVIDIA: Nemotron Nano 9B V2
  3. 0

    NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 released

    NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 discovered via OpenRouter. Provider: NVIDIA.

    NVIDIA: Llama 3.3 Nemotron Super 49B V1.5
  4. 0

    NVIDIA: Nemotron Nano 12B 2 VL released

    NVIDIA: Nemotron Nano 12B 2 VL discovered via OpenRouter. Provider: NVIDIA.

    NVIDIA: Nemotron Nano 12B 2 VL
  5. 0

    NVIDIA: Nemotron 3 Nano 30B A3B released

    NVIDIA: Nemotron 3 Nano 30B A3B discovered via OpenRouter. Provider: NVIDIA.

    NVIDIA: Nemotron 3 Nano 30B A3B
  6. 0

    NVIDIA: Nemotron 3 Super released

    NVIDIA: Nemotron 3 Super discovered via OpenRouter. Provider: NVIDIA.

    NVIDIA: Nemotron 3 Super

Releases timeline

Showing 8 most recent
  1. NVIDIA: Nemotron 3 Super

    NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer...

  2. NVIDIA: Nemotron 3 Nano 30B A3B

    NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...

  3. NVIDIA: Nemotron Nano 12B 2 VL

    NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s...

  4. NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

    Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...

  5. NVIDIA: Nemotron Nano 9B V2

    NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and...

  6. Llama 3.3 Nemotron Super 49B

    49B parameter efficient model with frontier reasoning capability from NVIDIA.

  7. NVIDIA: Llama 3.1 Nemotron 70B Instruct

    NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels...

  8. Llama 3.1 Nemotron 70B

    NVIDIA-tuned Llama 3.1 70B with state-of-the-art alignment and helpfulness.

Active models

Community ratings

No ratings yet. Be the first to rate NVIDIA.

Rate NVIDIA

Sign in to rate and review.

Comments

Sign in to leave a comment.