Starred repositories
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.
A pixel office for your OpenClaw: turn invisible work states into a cozy little space with characters, daily notes, and guest agents. Code under MIT; art assets for non-commercial learning only.
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
A lightweight alternative to OpenClaw that runs in containers for security. Connects to WhatsApp, Telegram, Slack, Discord, Gmail and other messaging apps,, has memory, scheduled jobs, and runs dir…
This is the official repo for the paper "LongCat-Flash-Omni Technical Report"
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.
Tongyi Deep Research, the Leading Open-source Deep Research Agent
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Agent S: an open agentic framework that uses computers like a human
Text-audio foundation model from Boson AI
Multilingual Document Layout Parsing in a Single Vision-Language Model
An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
[ACL 2025 Oral] 🔥🔥 MegaPairs: Massive Data Synthesis for Universal Multimodal Retrieval
Evaluation code and datasets for the ACL 2024 paper, VISTA: Visualized Text Embedding for Universal Multi-Modal Retrieval. The original code and model can be accessed at FlagEmbedding.
E5-V: Universal Embeddings with Multimodal Large Language Models
Retrieval and Retrieval-augmented LLMs
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
LlamaIndex is the leading document agent and OCR platform
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷