Lists (2)
Sort Name ascending (A-Z)
Stars
Robust Speech Recognition via Large-Scale Weak Supervision
Python tool for converting files and office documents to Markdown.
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
A high-throughput and memory-efficient inference and serving engine for LLMs
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
LlamaIndex is the leading framework for building LLM-powered agents over your data.
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
A generative world for general-purpose robotics & embodied AI learning.
SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
Build Real-Time Knowledge Graphs for AI Agents
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
Toolkit for linearizing PDFs for LLM datasets/training
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Wan: Open and Advanced Large-Scale Video Generative Models
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
Kronos: A Foundation Model for the Language of Financial Markets
A powerful framework for building realtime voice AI agents 🤖🎙️📹
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
Modeling, training, eval, and inference code for OLMo
Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation
智能闲鱼客服机器人系统:专为闲鱼平台打造的AI值守解决方案,实现闲鱼平台7×24小时自动化值守,支持多专家协同决策、智能议价和上下文感知对话。
Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural language to make computers work by themselves
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭…
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image