-
AMD
- Richland, WA
- in/jeffrey-daily-87854a86
-
llm.c Public
Forked from karpathy/llm.cLLM training in simple, raw C/CUDA
Cuda MIT License UpdatedJun 12, 2026 -
moat Public
MOAT: a Claude-driven effort to port popular CUDA GitHub projects to ROCm/HIP across AMD targets (Linux gfx90a, gfx1100, Windows gfx1151)
-
sppark Public
Forked from supranational/spparkZero-knowledge template library
Cuda Apache License 2.0 UpdatedJun 12, 2026 -
FBGEMM Public
Forked from pytorch/FBGEMMFB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
C++ Other UpdatedJun 12, 2026 -
trt-shim-rocm Public
TensorRT-API-compatible inference shim for AMD ROCm, backed by MIGraphX
C++ Apache License 2.0 UpdatedJun 12, 2026 -
plvs Public
Forked from luigifreda/plvsPLVS is a real-time SLAM system with points, lines, volumetric mapping and 3D unsupervised incremental segmentation.
C++ GNU General Public License v3.0 UpdatedJun 12, 2026 -
nvdiffrast Public
Forked from NVlabs/nvdiffrastNvdiffrast - Modular Primitives for High-Performance Differentiable Rendering
C++ Other UpdatedJun 12, 2026 -
kaldi Public
Forked from kaldi-asr/kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Shell Other UpdatedJun 12, 2026 -
Monte Carlo eXtreme (MCX) - Physically accurate and validated GPU ray-tracer
Pascal Other UpdatedJun 12, 2026 -
kaldifeat Public
Forked from csukuangfj/kaldifeatKaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
C++ Other UpdatedJun 12, 2026 -
YarnBall Public
Forked from jerry060599/YarnBallA massively parallel high performance GPU Cosserat Rods Simulator
C++ GNU General Public License v3.0 UpdatedJun 12, 2026 -
CubbyFlow Public
Forked from utilForever/CubbyFlowVoxel-based fluid simulation engine for computer games
C++ MIT License UpdatedJun 12, 2026 -
DDN-SLAM Public
Forked from DrLi-Ming/DDN-SLAMDDN-SLAM: Real-time Dense Dynamic Neural Implicit SLAM (RA-L 2025)
Cuda GNU General Public License v3.0 UpdatedJun 12, 2026 -
instant-ngp-kf Public
Forked from MarvinChung/instant-ngp-kfInstant neural graphics primitives: lightning fast NeRF and more
Cuda Other UpdatedJun 12, 2026 -
tiny-rocm-nn Public
Forked from PhysicalAI-AIM/tiny-rocm-nnrocm based mlp tiny network based on tiny-cuda-nn
C++ Other UpdatedJun 12, 2026 -
Pointcept Public
Forked from Pointcept/PointceptPointcept: Perceive the world with sparse points, a codebase for point cloud perception research. Latest works: Utonia (ICML'26), Concerto (NeurIPS'25), Sonata (CVPR'25 Highlight), PTv3 (CVPR'24 Oral)
Python MIT License UpdatedJun 12, 2026 -
cuPDLP-C Public
Forked from COPT-Public/cuPDLP-CCode for solving LP on GPU using first-order methods
C MIT License UpdatedJun 12, 2026 -
GooFit Public
Forked from GooFit/GooFitCode repository for the massively-parallel framework for maximum-likelihood fits, implemented in CUDA/OpenMP
Cuda Other UpdatedJun 12, 2026 -
KittenGpuLBVH Public
Forked from jerry060599/KittenGpuLBVHA high performance and friendly GPU LBVH implementation.
Cuda MIT License UpdatedJun 11, 2026 -
gtsam_points Public
Forked from koide3/gtsam_pointsA collection of GTSAM factors and optimizers for point cloud SLAM
C++ MIT License UpdatedJun 11, 2026 -
CudaSift Public
Forked from Celebrandil/CudaSiftA CUDA implementation of SIFT for NVidia GPUs (1.2 ms on a GTX 1060)
Cuda MIT License UpdatedJun 11, 2026 -
unified-cache-management Public
Forked from ModelEngine-Group/unified-cache-managementPersist and reuse KV Cache to speedup your LLM.
Python MIT License UpdatedJun 11, 2026 -
-
CuRast Public
Forked from m-schuetz/CuRastCuda-Based Software Rasterization for Billions of Triangles
C++ Other UpdatedJun 11, 2026 -
CUDA-ScanMatcher-ICP Public
Forked from botforge/CUDA-ScanMatcher-ICPA high performance CUDA implementation of Scan Matching via the Iterative Closest Point Algorithm
Cuda MIT License UpdatedJun 11, 2026 -
llm-awq Public
Forked from mit-han-lab/llm-awq[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Python MIT License UpdatedJun 11, 2026 -
mahout Public
Forked from apache/mahoutApache Mahout - an environment for quickly creating scalable, performant machine learning applications.
Rust Apache License 2.0 UpdatedJun 11, 2026 -
FSA/FST algorithms, differentiable, with PyTorch compatibility.
Cuda Apache License 2.0 UpdatedJun 11, 2026 -
opencv Public
Forked from opencv/opencvOpen Source Computer Vision Library
C++ Apache License 2.0 UpdatedJun 11, 2026 -
opencv_contrib Public
Forked from opencv/opencv_contribRepository for OpenCV's extra modules
C++ Apache License 2.0 UpdatedJun 11, 2026