Starred repositories
An open-source Text2SQL tool that transforms natural language into SQL using graph-powered schema understanding. Ask your database questions in plain English, QueryWeaver handles the weaving.
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
Multi-Agent System: Data Analysis → Optimization → Business Insights
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
Scalable RL solution for advanced reasoning of language models
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Lemon AI is the first Full-stack, Open-source, Agentic AI framework, offering a fully local alternative to platforms like Manus & Genspark AI. It features an integrated Code Interpreter VM sandbox …
A powerful tool for creating fine-tuning datasets for LLM
Tool for generating high quality Synthetic datasets
The AI coding agent built for the terminal.
How I Scaled from Zero to a Million Store on Dukaan, Without a CS Degree. .. A System Design Handbook by Subhash Choudhary
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
Minimal reproduction of DeepSeek R1-Zero
verl: Volcano Engine Reinforcement Learning for LLMs
21 Lessons, Get Started Building with Generative AI
Viaduct is a GraphQL-based system that provides a unified interface for accessing and interacting with any data source.
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Universal memory layer for AI Agents; Announcing OpenMemory MCP - local and secure memory management.
Platform for creating Agents in a No-Code Visual Builder or TypeScript Agents SDK with full 2-way sync.
LLM agents built for control. Designed for real-world use. Deployed in minutes.
a flexible and customizable React chat component for integrating Parlant's chatbot seamlessly into your website.
A system for agentic LLM-powered data processing and ETL
Low-Cost LLM-Powered Data Processing with Theoretical Guarantees
The open source post-building layer for agents. Our environment data and evals power agent post-training (RL, SFT) and monitoring.
AgentScope: Agent-Oriented Programming for Building LLM Applications
Battle-testing local LLM models on complex reasoning