Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Robust Speech Recognition via Large-Scale Weak Supervision
The official Python SDK for Model Context Protocol servers and clients
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
An orchestration platform for the development, production, and observation of data assets.
Pyodide is a Python distribution for the browser and Node.js based on WebAssembly
GenAI Agent Framework, the Pydantic way
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
🐍 Geometric Computer Vision Library for Spatial AI
Hackable and optimized Transformers building blocks, supporting a composable construction.
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Deep universal probabilistic programming with Python and PyTorch
ImageBind One Embedding Space to Bind Them All
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
A Python framework for accelerated simulation, data generation and spatial computing.
A PyTorch Library for Accelerating 3D Deep Learning Research
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
A python parametric CAD scripting framework based on OCCT
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
Recommend new arxiv papers of your interest daily according to your Zotero libarary.
SWE-bench: Can Language Models Resolve Real-world Github Issues?
A procedural Blender pipeline for photorealistic training image generation