Stars
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Hunt down social media accounts by username across social networks
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
AI agents running research on single-GPU nanochat training automatically
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.5, DeepSeek, gpt-oss locally.
No fortress, purely open ground. OpenManus is Coming.
aider is AI pair programming in your terminal
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…
We write your reusable computer vision tools. 💜
Build Real-Time Knowledge Graphs for AI Agents
A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
🕵️♂️ Collect a dossier on a person by username from 3000+ sites
A Conversational Speech Generation Model
Blind&Invisible Watermark ,图片盲水印,提取水印无须原图!
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
Open-source framework for conversational voice AI agents
Unofficial Python API and agentic skill for Google NotebookLM. Full programmatic access to NotebookLM's features—including capabilities the web UI doesn't expose—via Python, CLI, and AI agents like…
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.