Stars
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Tesseract Open Source OCR Engine (main repository)
DSPy: The framework for programming—not prompting—language models
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
OCR & Document Extraction using vision models
A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like RF-DETR, YOLO11, SAM …
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
Simplifying reinforcement learning for complex game environments
Chrome Extension Boilerplate with React + Vite + Typescript
streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.
Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦