blackwell

Star

Here are 14 public repositories matching this topic...

vllm-project / vllm

Sponsor

Star

A high-throughput and memory-efficient inference and serving engine for LLMs

Updated Nov 3, 2025
Python

sgl-project / sglang

Star

SGLang is a fast serving framework for large language models and vision language models.

Updated Nov 3, 2025
Python

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

cuda pytorch moe blackwell llm-serving

Updated Nov 3, 2025
C++

GradientHQ / parallax

Star

Parallax is a distributed model serving framework that lets you build your own AI cluster anywhere

python distributed-systems chatbot pytorch transformer llama glm minimax kimi blackwell large-language-models llm llm-serving qwen deepseek oss-gpt decentralized-inference

Updated Nov 3, 2025
Python

IST-DASLab / qutlass

Star

QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learning

cuda blackwell quantization-aware-training post-training-quantization

Updated Oct 30, 2025
C++

6Morpheus6 / deepspeed-windows-wheels

Star

Prebuilt DeepSpeed wheels for Windows with NVIDIA GPU support. Supports GTX 10 - RTX 50 series. Compiled with pytorch 2.7, 2.8 and cuda 12.8

windows blackwell deepspeed prebuilt-wheels

Updated Aug 18, 2025

dconsorte / pytorch-tensorflow-gpu

Star

RTX 5090 & RTX 5060 Docker container with PyTorch + TensorFlow. First fully-tested Blackwell GPU support for ML/AI. CUDA 12.8, Python 3.11, Ubuntu 24.04. Works with RTX 50-series (5090/5080/5070/5060) and RTX 40-series.

docker machine-learning deep-learning tensorflow cuda pytorch gpu-computing blackwell rtx-5090 rtx-5060 blackwell-gpu nvidia-blackwell cuda-12-8 rtx-50-series rtx-5080

Updated Jul 8, 2025
Shell

eelbaz / dgx-spark-vllm-setup

Star

One-command vLLM installation for NVIDIA DGX Spark with Blackwell GB10 GPUs (sm_121 architecture)

machine-learning ai deep-learning gpu cuda pytorch nvidia arm64 blackwell llm vllm llm-inference gb10 dgx-spark

Updated Oct 28, 2025
Shell

Justus0405 / Nvidiainstall

Star

📦 A fully automated method for installing Nvidia drivers on Arch Linux

Updated Sep 21, 2025
Shell

prateekshukla1108 / pytorch-distributed-gemm

Star

Pytorch Operation for distributed gemm in nvidia blackwell gpus

cuda gemm blackwell

Updated Jun 21, 2025
Cuda

insanelywicked1 / literate-dollop

Star

A fully automated PowerShell script to compile PyTorch from source with CUDA 12.1 support for NVIDIA RTX 50-series GPUs, optimized for Windows 11.

windows cuda pytorch blackwell rtx5080 rtx5090 gpu-build

Updated Oct 25, 2025
PowerShell

Fortnumsound / LaQuisha_complete-chat-browser_model-loader_and-backend_for-running-GGUF-models_with-Llama.cpp

Star

A fast API booty-licious back-end for running GGUF models with Llama.cpp

python api cuda nvidia llama quantized textui blackwell gguf 5090 5080

Updated Sep 21, 2025
Python

ThompsonShapiro / Cam-Lug-Well-Family-History

Star

Repository for Campbells-Luggs-Blackwells family history web site

family-history campbell lugg blackwell

Updated Jul 23, 2022
HTML

mikecaronna / GEN3C

Star

GEN3C: Generative Novel 3D Captions - Adapted for NVIDIA Blackwell GPU architecture (sm_120). Includes automatic GPU detection, CPU-based T5 text encoding for Blackwell compatibility, and full backward compatibility with older GPUs.

pytorch nvidia video-generation blackwell gen3c cuda-12-8 sm-120 transformer-engine rtx-blackwell

Updated Oct 23, 2025
Jupyter Notebook

Improve this page

Add a description, image, and links to the blackwell topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the blackwell topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

blackwell

Here are 14 public repositories matching this topic...

vllm-project / vllm

sgl-project / sglang

NVIDIA / TensorRT-LLM

GradientHQ / parallax

IST-DASLab / qutlass

6Morpheus6 / deepspeed-windows-wheels

dconsorte / pytorch-tensorflow-gpu

eelbaz / dgx-spark-vllm-setup

Justus0405 / Nvidiainstall

prateekshukla1108 / pytorch-distributed-gemm

insanelywicked1 / literate-dollop

Fortnumsound / LaQuisha_complete-chat-browser_model-loader_and-backend_for-running-GGUF-models_with-Llama.cpp

ThompsonShapiro / Cam-Lug-Well-Family-History

mikecaronna / GEN3C

Improve this page

Add this topic to your repo