Stars
Cookiecutter Django is a framework for jumpstarting production-ready Django projects quickly.
A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 97+ formats. Available for Rust, Pyt…
PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.
Official code for the paper "Scaling Multilingual Visual Speech Recognition"
The absolute trainer to light up AI agents.
An Open Source implementation of Notebook LM with more flexibility and features
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.
🚀 Lightning-fast computer vision models. Fine-tune SOTA models with just a few lines of code. Ready for cloud ☁️ and edge 📱 deployment.
All-in-one training for vision models (YOLO, ViTs, RT-DETR, DINOv3): pretraining, fine-tuning, distillation.
An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. Each technique has a detailed notebook tutorial.
Fabric is an open-source framework for augmenting humans using AI. It provides a modular system for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
#1 PDF Application on GitHub that lets you edit PDFs on any device anywhere
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.
Retrieval and Retrieval-augmented LLMs
Get your documents ready for gen AI
LlamaIndex is the leading document agent and OCR platform
An open-source RAG-based tool for chatting with your documents.
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.
Generate audiobooks from e-books, voice cloning & 1158+ languages!
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
A Next-Generation Training Engine Built for Ultra-Large MoE Models
Official repository for "DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D Reassembly" accepted at CVPR2024
List of open-source alternatives to everyday SaaS products.
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Code for "Positional Diffusion: Ordering Unordered Sets with Diffusion Probabilistic Models"