Stars
🌋LavaSR: Fast Speech restoration and enhancement
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
Trade autonomously on Polymarket using AI Agents
RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.
deepbeepmeep / Wan2GP
Forked from Wan-Video/Wan2.1A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, Qwen Image, Hunyuan Video, LTX Video and Flux.
[ICLR 2026] pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation
bloom - evaluate any behavior immediately 🌸🌱
An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone
"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"
🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.
A react native android sms retriever api library with Nitro module support for new architecture, old architecture, Nitro module, and Expo compatibility
A drag-and-drop library that finally works on React Native
A free, open source, and extensible speech-to-text application that works completely offline.
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Paper2Agent is a multi-agent AI system that automatically transforms research papers into interactive AI agents with minimal human input.
Unofficial InstantDB Admin API client for Python.
This repository delivers end-to-end, code-first tutorials covering every layer of production-grade GenAI agents, guiding you from spark to scale with proven patterns and reusable blueprints for re…
A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.
[SIGGRAPH Asia 2025] DreamO: A Unified Framework for Image Customization
Federated Query Engine for AI - The only MCP Server you'll ever need
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.
The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU clusters.
[ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation
Build Real-Time Knowledge Graphs for AI Agents
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning
A TTS model capable of generating ultra-realistic dialogue in one pass.