NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

NVIDIA

118 results
NVIDIA
Downloadable

nemotron-3-nano-omni-30b-a3b-reasoning

Nemotron 3 Nano Omni is an omni-modal reasoning model that understands images, video, speech, text.
Model
Image-to-Text
62.92K
3d
NVIDIA
Downloadable

NVIDIA AI for Media Relighting

Re-illuminate people in video to match target lighting from a 360 HDRI environment map.
Model
HDRI
335
2w
NVIDIA
Free Endpoint

nemotron-3-content-safety

Multilingual, multimodal model for detecting unsafe and toxic content.
Model
llm safety
29.12K
2w
NVIDIA
DownloadableFree Endpoint

synthetic-video-detector

NVIDIA Synthetic Video Detector is an AI-powered micro-service for detecting AI‑generated (synthetic) videos.
Model
broadcast
20.16K
2w
NVIDIA
DownloadableFree Endpoint

Active Speaker Detection

Detect and track speaker identities across video frames.
Model
localization
796
2w
NVIDIA
Downloadable

LipSync

Generative lip dubbing that syncs lips in a video to input audio.
Model
lipsync
2w
NVIDIA
Downloadable

ising-calibration-1-35b-a3b

Open VLM for quantum computer calibration chart understanding across a range of qubit modalities.
Model
Quantum
114K
2w
NVIDIA
Downloadable

llama-nemotron-rerank-vl-1b-v2

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
Model
nemo retriever
15.08K
1mo
NVIDIA
Free Endpoint

nemotron-voicechat

Nemotron 3 Voicechat
Model
English
2.96K
1mo
NVIDIA
Downloadable

nemotron-asr-streaming

Real-time speech recognition for English
Model
Automatic Speech Recognition
19.41K
1mo
NVIDIA
Downloadable

nemotron-ocr-v1

Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
Model
Table Extraction
1.71M
1mo
NVIDIA
Downloadable

nemotron-3-super-120b-a12b

Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more
Model
MoE
42.51M
1mo
NVIDIA
Downloadable

llama-nemotron-rerank-1b-v2

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
Model
nemo retriever
190K
1mo
NVIDIA
Downloadable

nemotron-table-structure-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Model
Object Detection
15.92K
1mo
NVIDIA
Downloadable

nemotron-page-elements-v3

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Model
Object Detection
31.5K
1mo
NVIDIA
Downloadable

nemotron-graphic-elements-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Model
Object Detection
17.96K
1mo
NVIDIA
Downloadable

llama-nemotron-embed-1b-v2

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
Model
Text-to-Embedding
2.01M
1mo
NVIDIA
Free Endpoint

gliner-pii

GLiNER PII detects Personally Identifiable Information in text.
Model
PII Detection
124K
1mo
NVIDIA
Free Endpoint

cosmos-transfer2.5-2b

Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.
Model
Synthetic Data Generation
2mo
NVIDIA
Downloadable

llama-nemotron-embed-vl-1b-v2

Multimodal question-answer retrieval representing user queries as text and documents as images.
Model
nemo retriever
6.97M
2mo
NVIDIA
Free Endpoint

nemotron-content-safety-reasoning-4b

A context‑aware safety model that applies reasoning to enforce domain‑specific policies.
Model
NeMo Guardrails
248K
3mo
NVIDIA
Downloadable

cosmos-reason2-8b

Vision language model that excels in understanding the physical world using structured reasoning on videos or images.
Model
video understanding
275K
4mo
NVIDIA
Downloadable

nemoretriever-page-elements-v3

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Model
Object Detection
13.89K
4mo
NVIDIA
Downloadable

nemotron-3-nano-30b-a3b

Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more
Model
MoE
9.28M
4mo
NVIDIA
Free Endpoint

riva-translate-4b-instruct-v1_1

Translation model in 12 languages with few-shots example prompts capability.
Model
nvidia nim
193K
4mo
NVIDIA
Free Endpoint

streampetr

StreamPETR offers efficient 3D object detection for autonomous driving by propagating sparse object queries temporally.
Model
autonomous vehicles
11.96K
5mo
NVIDIA
Downloadable

nemotron-parse

Cutting-edge vision-language model exceling in retrieving text and metadata from images.
Model
text and table extraction
202K
6mo
NVIDIA
Downloadable

nemotron-nano-12b-v2-vl

Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.
Model
language generation
4.8M
6mo
NVIDIA
Free Endpoint

llama-3.1-nemotron-safety-guard-8b-v3

Leading multilingual content safety model for enhancing the safety and moderation capabilities of LLMs
Model
content moderation
115K
6mo
NVIDIA
Downloadable

parakeet-ctc-0.6b-zh-tw

Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.
Model
ASR
182
6mo
NVIDIA
Downloadable

llama-3_2-nemoretriever-300m-embed-v2

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
Model
Text-to-Embedding
86
7mo
NVIDIA
Downloadable

parakeet-ctc-0.6b-zh-cn

Record-setting accuracy and performance for Mandarin English transcriptions.
Model
ASR
183
7mo
NVIDIA
Downloadable

parakeet-ctc-0.6b-es

Accurate and optimized Spanish English transcriptions with punctuation and word timestamps.
Model
ASR
81
7mo
NVIDIA
Downloadable

parakeet-ctc-0.6b-vi

Accurate and optimized Vietnamese-English transcriptions with punctuation and word timestamps.
Model
ASR
66
7mo
NVIDIA
Downloadable

nvidia-nemotron-nano-9b-v2

High‑efficiency LLM with hybrid Transformer‑Mamba design, excelling in reasoning and agentic tasks.
Model
thinking budget
377K
8mo
NVIDIA
Downloadable

nemoretriever-ocr-v1

Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
Model
Table Extraction
1.67M
8mo
NVIDIA
Downloadable

parakeet-tdt-0.6b-v2

Accurate and optimized English transcriptions with punctuation and word timestamps
Model
ASR
2.53K
9mo
NVIDIA
Downloadable

llama-3.3-nemotron-super-49b-v1.5

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.
Model
math
2.34M
9mo
NVIDIA
Free Endpoint

llama-3_2-nemoretriever-300m-embed-v1

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
Model
Text-to-Embedding
313K
9mo
NVIDIA
Downloadable

nemoretriever-ocr

Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
Model
Table Extraction
8.48K
9mo
NVIDIA
Downloadable

riva-translate-1.6b

Enable smooth global interactions in 36 languages.
Model
Neural machine translation
27.24K
10mo
NVIDIA
Downloadable

llama-3.2-nemoretriever-500m-rerank-v2

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
Model
nemo retriever
10.71K
10mo
NVIDIA
Free Endpoint

cosmos-transfer1-7b

Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.
Model
Synthetic Data Generation
169
10mo
NVIDIA
Downloadable

Background Noise Removal

Removes unwanted noises from audio improving speech intelligibility.
Model
Nvidia Maxine
465
10mo
NVIDIA
Downloadable

llama-3.1-nemotron-nano-vl-8b-v1

Multi-modal vision-language model that understands text/img and creates informative responses
Model
doc intelligence
6.93M
10mo
NVIDIA
Free Endpoint

magpie-tts-zeroshot

Expressive and engaging text-to-speech, generated from a short audio sample.
Model
TTS
1.82K
10mo
NVIDIA
Downloadable

parakeet-1.1b-rnnt-multilingual-asr

High accuracy and optimized performance for transcription in 25 languages
Model
Automatic Speech Recognition
12.1K
1y
NVIDIA
Free Endpoint

cosmos-predict1-5b

Generates future frames of a physics-aware world state based on simply an image or short video prompt for physical AI development.
Model
Synthetic Data Generation
472
1y
NVIDIA
Free Endpoint

sparsedrive

End-to-end autonomous driving stack integrating perception, prediction, and planning with sparse scene representations for efficiency and safety.
Model
autonomous vehicles
56
9mo
NVIDIA
Free Endpoint

bevformer

Advanced transformer for multi-frame bird's-eye-view 3D perception in autonomous driving.
Model
autonomous vehicles
103
9mo
NVIDIA
Downloadable

llama-3.3-nemotron-super-49b-v1

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.
Model
math
1.58M
9mo
NVIDIA
Downloadable

llama-3.1-nemotron-nano-8b-v1

Leading reasoning and agentic AI accuracy model for PC and edge.
Model
math
984K
10mo
NVIDIA
Downloadable

magpie-tts-multilingual

Natural and expressive voices in multiple languages. For voice agents and brand ambassadors.
Model
TTS
44.73K
10mo
NVIDIA
Free Endpoint

nv-embedcode-7b-v1

The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.
Model
nemo retriever
118K
11mo
NVIDIA
Downloadable

nemoretriever-table-structure-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Model
Object Detection
27.85K
1y
NVIDIA
Downloadable

nemoretriever-graphic-elements-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Model
Object Detection
5.04K
1y
NVIDIA
Downloadable

nemoretriever-page-elements-v2

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Model
Object Detection
75.29K
1y
NVIDIA
Downloadable

nemoretriever-parse

Cutting-edge vision-language model exceling in retrieving text and metadata from images.
Model
optical character recognition
114K
10mo
NVIDIA
Downloadable

canary-1b-asr

Multi-lingual model supporting speech-to-text recognition and translation.
Model
Automatic Speech Recognition
3.78K
1y
NVIDIA
Downloadable

llama-3.1-nemoguard-8b-topic-control

Topic control model to keep conversations focused on approved topics, avoiding inappropriate content.
Model
nemo guardrails
130K
1y
NVIDIA
Downloadable

nemoguard-jailbreak-detect

Industry leading jailbreak classification model for protection from adversarial attempts
Model
nemo guardrails
34.57K
10mo
NVIDIA
Downloadable

llama-3.1-nemoguard-8b-content-safety

Leading content safety model for enhancing the safety and moderation capabilities of LLMs
Model
nemo guardrails
130K
1y
NVIDIA
Downloadable

genmol

Fragment-Based Molecular Generation by Discrete Diffusion.
Model
Chemistry
6.43K
9mo
NVIDIA
Downloadable

llama-3.2-nv-embedqa-1b-v2

Multilingual and cross-lingual text question-answering retrieval with long context support and optimized data storage efficiency.
Model
nemo retriever
2.4M
9mo
NVIDIA
Downloadable

llama-3.2-nv-rerankqa-1b-v2

Fine-tuned reranking model for multilingual, cross-lingual text question-answering retrieval, with long context support.
Model
nemo retriever
96.93K
9mo
NVIDIA
Free Endpoint

usdcode

State-of-the-art LLM that answers OpenUSD knowledge queries and generates USD-Python code.
Model
Digital Twin
9mo
NVIDIA
Downloadable

nv-yolox-page-elements-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Model
Object Detection
1.2K
9mo
NVIDIA
Downloadable

conformer-ctc-asr

Automatic speech recognition model that transcribes speech in lower case Spanish with record-setting accuracy and performance
Model
ASR
23
1y
NVIDIA
Downloadable

fourcastnet

FourCastNet predicts global atmospheric dynamics of various weather / climate variables.
Model
Weather Simulation
1.74K
1y
NVIDIA
DownloadableFree Endpoint

studiovoice

Enhance input speech recorded with low-quality microphones in noisy or reverberant environments, producing studio-quality speech.
Model
communications
2.73K
10mo
NVIDIA
Free Endpoint

nemotron-mini-4b-instruct

Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling
Model
Chat
291K
1y
NVIDIA
Downloadable

megatron-1b-nmt

Enable smooth global interactions in 36 languages.
Model
Neural machine translation
2
1y
NVIDIA
Downloadable

parakeet-ctc-1.1b-asr

Record-setting accuracy and performance for English transcription.
Model
ASR
61.45K
10mo
NVIDIA
Downloadable

parakeet-ctc-0.6b-asr

State-of-the-art accuracy and speed for English transcriptions.
Model
ASR
1.25K
10mo
NVIDIA
Deprecation in 15dDownloadable

eyecontact

Estimate gaze angles of a person in a video and redirect to make it frontal.
Model
telepresence
878
1y
NVIDIA
Free Endpoint

usdvalidate

Verify compatibility of OpenUSD assets with instant RTX render and rule-based validation.
Model
Validation
512
1y
NVIDIA
Downloadable

nv-embedqa-e5-v5

English text embedding model for question-answering retrieval.
Model
Embedding
9.22M
9mo
NVIDIA
Downloadable

nvclip

NV-CLIP is a multimodal embeddings model for image and text.
Model
Computer vision
48.78K
10mo
NVIDIA
Free Endpoint

nv-embed-v1

Generates high-quality numerical embeddings from text inputs.
Model
Non-Commercial Use Only
3.39M
9mo
NVIDIA
Free Endpoint

rerank-qa-mistral-4b

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
Model
Ranking
308K
1y
NVIDIA
Downloadable

vista-3d

VISTA-3D is a specialized interactive foundation model for segmenting and anotating human anatomies.
Model
Interactive Annotation
743
1y
NVIDIA
Downloadable

molmim

MolMIM performs controlled generation, finding molecules with the right properties.
Model
Chemistry
160K
9mo
NVIDIA
Downloadable

cuopt

World-record accuracy and performance for complex route optimization.
Model
Route Optimization
55.26K
11mo