Stars
Now, Stronger AI Pushes Frontiers, Stronger Our Shared Future.
Democratizing AI scientists with ToolUniverse
🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
Python tool for converting files and office documents to Markdown.
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
Open-source framework for building human-intervenable deep research agents with real-time collaboration between humans and AI. Features multi-agent architecture, comprehensive tool suite, and web-b…
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…
Unlock 650+ MCP servers tools in your favorite agentic framework.
Easiest and laziest way for building multi-agent LLMs applications.
Model Context Protocol(MCP) 编程极速入门
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.
[NeurIPS 2025] OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs), capable of generating complex and detailed SVGs, from sim…
PaperRegister: Boosting Flexible-grained Paper Search via Hierarchical Register Indexing
Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering".
A curated list of top best AI Related Newsletters and ai agents newsletters
The world's first open-source multimodal creative assistant This is a substitute for Canva and Manus that prioritizes privacy and is usable locally.
Code for the paper "AutoPresent: Designing Structured Visuals From Scratch" (CVPR 2025)
【CVPR2025】From Words to Structured Visuals: A Benchmark and Framework for Text-to-Diagram Generation and Editing
PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides [EMNLP 2025]
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
[TCSVT 2024] Official implementation of the paper: Benchmarking Micro-action Recognition: Dataset, Methods, and Applications