Lists (9)
Sort Name ascending (A-Z)
Stars
Spec-driven development (SDD) for AI coding assistants.
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)
Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement Learning"
"Paper2Slides: From Paper to Presentation in One Click"
Ship agents faster. Plano is delivery infrastructure for agentic applications. A models-native proxy server & dataplane that offloads the plumbing work, so you stay focused on product logic.
The official repo of the paper "MMLongBench Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly"
Official Python implementation for DocAgent, accepted to EMNLP 2025
Agentic-RAG explores advanced Retrieval-Augmented Generation systems enhanced with AI LLM agents.
The 500 AI Agents Projects is a curated collection of AI agent use cases across various industries. It showcases practical applications and provides links to open-source projects for implementation…
Agentic Design Patterns: A Hands-On Guide to Building Intelligent Systems by Antonio Gulli
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
Salesforce Enterprise Deep Research
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?
12 Lessons to Get Started Building AI Agents
DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
Glances an Eye on your system. A top/htop alternative for GNU/Linux, BSD, Mac OS and Windows operating systems.
基于nginx-proxy-manager翻译的中文版本
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
SGLang is a fast serving framework for large language models and vision language models.
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
Ultra-high-performance, secure, all-in-one acceleration engine for developer resources
Tongyi Deep Research, the Leading Open-source Deep Research Agent
[EMNLP2025]Official implementation: Agent-style vision question answer in Autonomous Driving!