Robust Speech Recognition via Large-Scale Weak Supervision
Contexts Optical Compression
In-App assistant SDK to build a multimodal conversational UX websites
The no-nonsense RAG chunking library
Video understanding codebase from FAIR for reproducing video models
A fast, powerful, and simple hierarchical vision transformer
Beautiful, fast and modern React UI library.
Kubernetes Native Edge Computing Framework (project under CNCF)
UI Automation Framework for Games and Apps
Foundational Models for State-of-the-Art Speech and Text Translation
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
A library to generate LaTeX expression from Python code
Refer and Ground Anything Anywhere at Any Granularity
Code release for Cut and Learn for Unsupervised Object Detection
PyTorch code and models for VJEPA2 self-supervised learning from video
Language modeling in a sentence representation space
ExDARK dataset is the largest collection of low-light images
Resources, corpora, and tools for Chinese natural language processing
fast C++ library for linear algebra & scientific computing
The free computer aided translation (CAT) tool for professionals
Download, save and convert multiple subtitles from YouTube videos
Air traffic control tower and radar simulator (solo + multi-player)
A lightweight personal finance app hosted by yourself.
The open-source digital workplace for growing teams and enterprises.