Skip to content
@NVIDIA

NVIDIA Corporation

Pinned Loading

  1. cuopt cuopt Public

    GPU accelerated decision optimization

    Cuda 742 135

  2. cuopt-examples cuopt-examples Public

    NVIDIA cuOpt examples for decision optimization

    Jupyter Notebook 415 67

  3. open-gpu-kernel-modules open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 16.8k 1.6k

  4. aistore aistore Public

    AIStore: scalable storage for AI applications

    Go 1.8k 240

  5. nvidia-container-toolkit nvidia-container-toolkit Public

    Build and run containers leveraging NVIDIA GPUs

    Go 4.1k 488

  6. GenerativeAIExamples GenerativeAIExamples Public

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Jupyter Notebook 3.8k 990

Repositories

Showing 10 of 679 repositories
  • Model-Optimizer Public

    A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

    NVIDIA/Model-Optimizer’s past year of commit activity
    Python 2,119 Apache-2.0 286 74 111 Updated Mar 9, 2026
  • cccl Public

    CUDA Core Compute Libraries

    NVIDIA/cccl’s past year of commit activity
  • NVSentinel Public

    NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments

    NVIDIA/NVSentinel’s past year of commit activity
    Go 199 Apache-2.0 52 43 24 Updated Mar 9, 2026
  • bare-metal-manager-core Public

    NVIDIA Bare Metal Manager - Hardware Lifecycle Management and multitenant networking

    NVIDIA/bare-metal-manager-core’s past year of commit activity
    Rust 86 Apache-2.0 58 72 (3 issues need help) 30 Updated Mar 9, 2026
  • TensorRT-LLM Public

    TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

    NVIDIA/TensorRT-LLM’s past year of commit activity
    Python 13,035 2,154 539 570 Updated Mar 9, 2026
  • KAI-Scheduler Public

    KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale

    NVIDIA/KAI-Scheduler’s past year of commit activity
    Go 1,172 Apache-2.0 161 28 (1 issue needs help) 94 Updated Mar 9, 2026
  • Megatron-LM Public

    Ongoing research training transformer models at scale

    NVIDIA/Megatron-LM’s past year of commit activity
    Python 15,553 3,661 303 (1 issue needs help) 311 Updated Mar 9, 2026
  • k8s-launch-kit Public

    K8s Launch Kit (l8k) is a CLI tool for deploying and managing NVIDIA cloud-native solutions on Kubernetes. The tool helps provide flexible deployment workflows for optimal network performance with SR-IOV, RDMA, and other networking technologies.

    NVIDIA/k8s-launch-kit’s past year of commit activity
    Go 8 Apache-2.0 3 0 0 Updated Mar 9, 2026
  • TileGym Public

    Helpful kernel tutorials and examples for tile-based GPU programming

    NVIDIA/TileGym’s past year of commit activity
    Python 665 51 2 5 Updated Mar 9, 2026
  • nvmesh-upgrader Public

    NVMesh by NVIDIA provides remote shared storage facilities with in-server flash performance characteristics while using commodity off-the-shelf components.

    NVIDIA/nvmesh-upgrader’s past year of commit activity
    Python 5 0 0 0 Updated Mar 9, 2026