Stars
Unofficial fairseq-free PyTorch implementation of UTMOS (v1, 2022), matching the original system.
Documentation sites for Python packages: start simple, go deep
The world's most sophisticated street level image geolocation software
Tooling for exact and MinHash deduplication of large-scale text datasets
New generation of CLIP with strong fine grained discrimination capability, ICML2026 and ICML2025
Synthetic data annotation for retrieval evaluations by ZeroEntropy
Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.
Voice Activity Detector (VAD) : low-latency, high-performance and lightweight
Implementation of papers in 100 lines of code.
General plug-and-play inference library for Recursive Language Models (RLMs), supporting various sandboxes.
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Fully Open Framework for Democratized Multimodal Training
A Conversational Speech Generation Model
Minimal and annotated implementations of key ideas from modern deep learning research.
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Fixed version of `torchrun` on Jülich Supercomputing Centre
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
Muon is an optimizer for hidden layers in neural networks
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'