NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
-
Updated
Dec 11, 2025 - C++
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
CUDA Core Compute Libraries
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
A high performance anime upscaler
stdgpu: Efficient STL-like Data Structures on the GPU
Cross Platform Professional Procedural Terrain Generation & Texturing Tool
HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training
Deep learning toolkit-enabled VLSI placement
Node-based image editor with GPU-acceleration.
OpenCL is the most powerful programming language ever created. Yet the OpenCL C++ bindings are cumbersome and the code overhead prevents many people from getting started. I created this lightweight OpenCL-Wrapper to greatly simplify OpenCL software development with C++ while keeping functionality and performance.
Vulkan compute for people
Vahana VR & VideoStitch Studio: software to create immersive 360° VR video, live and in post-production
Open-Source CUDA/OpenCL Speed Of Light Ray-tracer
Fast Neural Machine Translation in C++ - development repository
Automatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm
An efficient, extensible occupancy map supporting probabilistic occupancy, normal distribution transforms in CPU and GPU.
Xplace 3.0: An Extremely Fast, Extensible and Deterministic Placement Framework with Detailed-Routability and Timing Optimization
The Next-Gen Database for AI—an infrastructure designed for data and AI. As the MySQL of the AI era.
SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) systems. It is developed as part of the U.S. Department of Energy Exascale Computing Project (ECP).
Add a description, image, and links to the gpu-acceleration topic page so that developers can more easily learn about it.
To associate your repository with the gpu-acceleration topic, visit your repo's landing page and select "manage topics."