Stars
ChatDev 2.0: Dev All through LLM-powered Multi-Agent Collaboration
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
Code of [CVPR 2024] "Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling"
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Efficient vision foundation models for high-resolution generation and perception.
Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation
Tiny http server engine written in Swift programming language.
ImageBind One Embedding Space to Bind Them All
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Implementation of cats-vs-dogs based on CNN.
Technique supports and discussions for the Neural Network app.
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
Core ML tools contain supporting tools for Core ML model conversion, editing, and validation.
Convert JSON annotations into YOLO format.
Cross-platform, customizable ML solutions for live and streaming media.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
Awesome work on hand pose estimation/tracking
Datasets, Transforms and Models specific to Computer Vision