Highlights
Lists (3)
Sort Name ascending (A-Z)
Stars
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
nanoRLHF: from-scratch journey into how LLMs and RLHF really work.
🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines
This repository is part of a course on Elasticsearch in Python. It includes notebooks that demonstrate its usage, along with a YouTube series to guide you through the material.
Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & C…
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
AirLLM 70B inference with single 4GB GPU
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
The definitive Web UI for local AI, with powerful features and easy setup.
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.
Accessible large language models via k-bit quantization for PyTorch.
Unified framework for building enterprise RAG pipelines with small, specialized models
A high-throughput and memory-efficient inference and serving engine for LLMs
🚀✨ Help beginners to contribute to open source projects
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.