gpu

Star

Here are 983 public repositories matching this topic...

pytorch / pytorch

Star

Tensors and Dynamic neural networks in Python with strong GPU acceleration

python machine-learning deep-learning neural-network gpu numpy autograd tensor

Updated Nov 6, 2025
Python

deepspeedai / DeepSpeed

Star

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

machine-learning compression deep-learning gpu inference pytorch zero data-parallelism model-parallelism mixture-of-experts pipeline-parallelism billion-parameters trillion-parameters

Updated Nov 6, 2025
Python

plasma-umass / scalene

Sponsor

Star

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

python cpu profiler gpu performance-analysis memory-allocation profiling cpu-profiling memory-consumption gpu-programming python-profilers scalene profiles-memory performance-cpu

Updated Nov 5, 2025
Python

apache / tvm

Star

Open deep learning compiler stack for cpu, gpu and specialized accelerators

javascript machine-learning performance deep-learning metal compiler gpu vulkan opencl tensor spirv rocm tvm

Updated Nov 6, 2025
Python

cupy / cupy

Sponsor

Star

NumPy & SciPy for GPU

python gpu numpy cuda cublas scipy tensor cudnn rocm cupy cusolver nccl curand cusparse nvrtc cutensor nvtx cusparselt

Updated Nov 6, 2025
Python

triton-inference-server / server

Star

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

machine-learning cloud deep-learning gpu inference edge datacenter

Updated Nov 6, 2025
Python

skypilot-org / skypilot

Star

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 17+ clouds, or on-prem).

Updated Nov 6, 2025
Python

OlafenwaMoses / ImageAI

Sponsor

Star

A python library built to empower developers to build applications and systems with self-contained Computer Vision capabilities

python machine-learning algorithm video gpu detection prediction python3 artificial-intelligence artificial-neural-networks image-recognition densenet object-detection squeezenet inceptionv3 offline-capable image-prediction imageai ai-practice-recommendations

Updated Aug 3, 2024
Python

MVIG-SJTU / AlphaPose

Star

Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System

Updated May 13, 2024
Python

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.

gpu transformers pytorch llm

Updated Oct 14, 2025
Python

XuehaiPan / nvitop

Sponsor

Star

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

console monitoring gpu grafana cuda prometheus nvidia prometheus-exporter curses nvml top command-line-tool htop grafana-dashboard nvidia-smi monitoring-tool process-monitoring gpu-monitoring resource-monitor

Updated Oct 27, 2025
Python

chainer / chainer

Star

A flexible framework of neural networks for deep learning

python machine-learning deep-learning neural-network chainer gpu numpy cuda neural-networks cudnn cupy

Updated Aug 28, 2023
Python

NVIDIA / warp

Star

A Python framework for accelerated simulation, data generation and spatial computing.

python gpu cuda nvidia gpu-acceleration differentiable-programming nvidia-warp

Updated Nov 6, 2025
Python

google / tf-quant-finance

Star

High-performance TensorFlow library for quantitative finance.

python finance tensorflow gpu high-performance quantlib high-performance-computing gpu-computing quantitative-finance numerical-methods numerical-optimization numerical-integration

Updated Mar 21, 2025
Python

sktime / pytorch-forecasting

Sponsor

Star

Time series forecasting with PyTorch

python data-science machine-learning ai timeseries deep-learning gpu pandas pytorch artificial-intelligence uncertainty neural-networks forecasting hacktoberfest temporal timeseries-forecasting pytorch-lightning

Updated Nov 4, 2025
Python

tlkh / asitop

Star

Perf monitoring CLI tool for Apple Silicon

macos cli cpu gpu m1 apple-silicon

Updated Apr 18, 2024
Python

wookayin / gpustat

Star

📊 A simple command-line utility for querying and monitoring GPU status

python monitoring command-line gpu nvidia-smi gpustat

Updated Apr 13, 2025
Python

pytorch / executorch

Star

On-device AI across mobile, embedded and edge for PyTorch

machine-learning mobile embedded deep-learning neural-network gpu tensor

Updated Nov 6, 2025
Python

Jittor / jittor

Star

Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.

python deep-learning gpu cuda jittor

Updated Jul 28, 2025
Python

NVIDIA / TransformerEngine

Star

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.

python machine-learning deep-learning gpu cuda pytorch jax fp8 fp4

Updated Nov 6, 2025
Python

Improve this page

Add a description, image, and links to the gpu topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gpu topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gpu

Here are 983 public repositories matching this topic...

pytorch / pytorch

deepspeedai / DeepSpeed

plasma-umass / scalene

apache / tvm

cupy / cupy

triton-inference-server / server

skypilot-org / skypilot

OlafenwaMoses / ImageAI

MVIG-SJTU / AlphaPose

intel / ipex-llm

XuehaiPan / nvitop

chainer / chainer

NVIDIA / warp

google / tf-quant-finance

sktime / pytorch-forecasting

tlkh / asitop

wookayin / gpustat

pytorch / executorch

Jittor / jittor

NVIDIA / TransformerEngine

Improve this page

Add this topic to your repo