Stars
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
Hunt down social media accounts by username across social networks
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
No fortress, purely open ground. OpenManus is Coming.
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
aider is AI pair programming in your terminal
We write your reusable computer vision tools. 💜
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Build Real-Time Knowledge Graphs for AI Agents
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
🕵️♂️ Collect a dossier on a person by username from thousands of sites
A Conversational Speech Generation Model
Blind&Invisible Watermark ,图片盲水印,提取水印无须原图!
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.
Open-source framework for conversational voice AI agents
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.