Stars
An extensible, state of the art columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Linux Foundation.
Model Context Protocol Servers
Crawl a site to generate knowledge files to create your own custom GPT from a URL
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents
LOFT: A 1 Million+ Token Long-Context Benchmark
DSPy: The framework for programming—not prompting—language models
A curated list of Diffusion Model in RL resources (continually updated)
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
High-speed Large Language Model Serving for Local Deployment
Multi agent system for AI-driven software development. Combine LLM with DevOps tools to convert natural language requirements into working software. Supports any development language and extends th…
Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.
Retrieval and Retrieval-augmented LLMs
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Running large language models on a single GPU for throughput-oriented scenarios.
🦜🔗 The platform for reliable agents.
💫 Industrial-strength Natural Language Processing (NLP) in Python
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
A markdown version emoji cheat sheet
🏕️ Reproducible development environment for humans and agents