Stars
Simplifying reinforcement learning for complex game environments
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.
A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM …
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
DSPy: The framework for programming—not prompting—language models
OCR & Document Extraction using vision models
Chrome Extension Boilerplate with React + Vite + Typescript
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL
Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦
Tesseract Open Source OCR Engine (main repository)