Stars
The paper list of "Memory in the Age of AI Agents: A Survey"
🤗 smolagents: a barebones library for agents that think in code.
[EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation
FastAPI framework, high performance, easy to learn, fast to code, ready for production
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
【TMM 2025🔥】 Mixture-of-Experts for Large Vision-Language Models
[EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records
Speech-to-text server framework with next-gen Kaldi
A high-throughput and memory-efficient inference and serving engine for LLMs
Open-source vector similarity search for Postgres
[ICLR 2024] DNABERT-2: Efficient Foundation Model and Benchmark for Multi-Species Genome
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/
Python packaging and dependency management made easy
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Universal LLM Deployment Engine with ML Compilation
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
Pytorch implementation of the paper Exploring Simple Siamese Representation Learning.
A scalable & efficient active learning/data selection system for everyone.
The official repository for "Intermediate Layers Matter in Momentum Contrastive Self Supervised Learning" paper.
PICARD - Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models. PICARD is a ServiceNow Research project that was started at Element AI.
An example Spring-boot project that uses a combination of liquibase and embedded postgres for component test
Official implementation of our paper: Towards Robust and Reproducible Active Learning using Neural Networks, accepted at CVPR 2022.
A concise but complete full-attention transformer with a set of promising experimental features from various papers