Starred repositories
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
[TKDE2025] Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL | A curated list of resources (surveys, papers, benchmarks, and opensource projects) on large language model-based β¦
[ICLR 2025 Oral] Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
MetricFlow allows you to define, build, and maintain metrics in code.
π Cube Core is open-source semantic layer for AI, BI and embedded analytics
Data Apps & Dashboards for Python. No JavaScript Required.
Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.
Squrve is a lightweight yet powerful framework for translating natural language into SQL over complex databases.
Interact with your SQL database, Natural Language to SQL using LLMs
β‘οΈ GenBI (Generative BI) queries any database in natural language, generates accurate SQL (Text-to-SQL), charts (Text-to-Chart), and AI-powered business intelligence in seconds.
open-source agentic AI data assistant for the next generation of AI + Data products.
π€ Chat with your SQL database π. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval π.
π Awesome OCR multiple programing languages toolkits based on ONNX Runtime, OpenVINO, MNN, PaddlePaddle, TensorRT and PyTorch.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
ApeRAG: Production-ready GraphRAG with multi-modal indexing, AI agents, MCP support, and scalable K8s deployment
SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
[Required for large models] Office to Markdown service implementation, based on Microsoft Markitdown.
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website β¦
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.