Highlights
- Pro
🤖AI
[CVPR2023] The official repo for OC-SORT: Observation-Centric SORT on video Multi-Object Tracking. OC-SORT is simple, online and robust to occlusion/non-linear motion.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择ChatGPT/Claude/DeepSeek/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
Robust Speech Recognition via Large-Scale Weak Supervision
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
Official documentation of CodeRabbit: AI Code Reviews
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
SGLang is a fast serving framework for large language models and vision language models.
Model Context Protocol Servers
No fortress, purely open ground. OpenManus is Coming.
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Production-ready platform for agentic workflow development.
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
An Application Framework for AI Engineering
The ultimate LLM/AI application development framework in Golang.
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
[NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Trae Agent is an LLM-based agent for general purpose software engineering tasks.
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
📄 Configuration files that enhance Cursor AI editor experience with custom rules and behaviors