Highlights
- Pro
Lists (15)
Sort Name ascending (A-Z)
Stars
A paper list for spatial reasoning
A course on aligning smol models.
Simultaneous speech-to-text model
Voice Activity Detector (VAD) : low-latency, high-performance and lightweight
Top papers related to LLM-based agent evaluation
A full-featured, hackable Next.js AI chatbot built by Vercel
MedRAX: Medical Reasoning Agent for Chest X-ray - ICML 2025
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
SEED-Voken: A Series of Powerful Visual Tokenizers
High-performance Image Tokenizers for VAR and AR
CodonTransformer (1M+ Downloads); The tool for codon optimization, optimizing DNA for protein expression
[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
A Gradio app that transcribes YouTube videos using audio extraction and OpenAIโs Whisper model.
Machine Learning Engineering Open Book
๐ค MLE-Agent: Your intelligent companion for seamless AI engineering and research. ๐ Integrate with arxiv and paper with code to provide better code/research plans ๐งฐ OpenAI, Anthropic, Gemini, Ollamโฆ
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery ๐งโ๐ฌ
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use thโฆ
โฉ Ship faster with Continuous AI. Open-source CLI that can be used in Headless mode to run async cloud agents or TUI mode as an in sync coding agent
llama3 implementation one matrix multiplication at a time
๐ค ๐๐ฒ๐ฎ๐ฟ๐ป for ๐ณ๐ฟ๐ฒ๐ฒ how to ๐ฏ๐๐ถ๐น๐ฑ an end-to-end ๐ฝ๐ฟ๐ผ๐ฑ๐๐ฐ๐๐ถ๐ผ๐ป-๐ฟ๐ฒ๐ฎ๐ฑ๐ ๐๐๐ & ๐ฅ๐๐ ๐๐๐๐๐ฒ๐บ using ๐๐๐ ๐ข๐ฝ๐ best practices: ~ ๐ด๐ฐ๐ถ๐ณ๐ค๐ฆ ๐ค๐ฐ๐ฅ๐ฆ + 12 ๐ฉ๐ข๐ฏ๐ฅ๐ด-๐ฐ๐ฏ ๐ญ๐ฆ๐ด๐ด๐ฐ๐ฏ๐ด
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Open source real-time translation app for Android that runs locally