Starred repositories
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …
AI agents can now use real Android and iOS apps, just like a human.
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
Automate your mobile devices with natural language commands - an LLM agnostic mobile Agent 🤖
Mobile-Agent: The Powerful GUI Agent Family
The Intelligent GUI Agent for Mobile Phones
FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…
UltraRAG v2: A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
A next.js web application that integrates AI capabilities with draw.io diagrams. This app allows you to create, modify, and enhance diagrams through natural language commands and AI-assisted visual…
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement Learning"
A community-supported supercharged document management system: scan, index and archive all your documents
aider is AI pair programming in your terminal
Automate browser based workflows with AI
An open-source AI agent that brings the power of Gemini directly into your terminal.
LLM Frontend for Power Users.
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…
A distributed task scheduling framework.(分布式任务调度平台XXL-JOB)
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
presentations made by ai sharing skills using ai + code to slove any problem
A high-throughput and memory-efficient inference and serving engine for LLMs
Universal icon framework. One syntax for FontAwesome, Material Design Icons, DashIcons, Feather Icons, EmojiOne, Noto Emoji and many other open source icon sets (over 150 icon sets and 200k icons).…
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
GeoAI: Artificial Intelligence for Geospatial Data
We write your reusable computer vision tools. 💜
Universal File Online Preview Project based on Spring-Boot
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.