Stars
PointNet and PointNet++ implemented by pytorch (pure python) and on ModelNet, ShapeNet and S3DIS.
A high-throughput and memory-efficient inference and serving engine for LLMs
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
A ctc decoder for both online and offline asr model
Lightweight coding agent that runs in your terminal
Model Context Protocol(MCP) 编程极速入门
[NeurIPS 2025] YOLOv12: Attention-Centric Real-Time Object Detectors
OCR, layout analysis, reading order, table recognition in 90+ languages
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]
The reinforcement learning training code for AgiBot X1.
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
Implementation of Alphafold 3 from Google Deepmind in Pytorch
Labeling tool with SAM(segment anything model),supports SAM, SAM2, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
[CVPR 2024] Code for SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes
open Multiple View Geometry library. Basis for 3D computer vision and Structure from Motion.
Algorithm to texture 3D reconstructions from multi-view stereo images
Atlas: End-to-End 3D Scene Reconstruction from Posed Images
Code for "NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video", CVPR 2021 oral
Official implementation of `Splatter Image: Ultra-Fast Single-View 3D Reconstruction' CVPR 2024
Official Pytorch Implementation of SPECTRE: Visual Speech-Aware Perceptual 3D Facial Expression Reconstruction from Videos