Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Implement a reasoning LLM in PyTorch from scratch, step by step
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods
A complete AI agency at your fingertips - From frontend wizards to Reddit community ninjas, from whimsy injectors to reality checkers. Each agent is a specialized expert with personality, processes…
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
[CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback
[CVPR'25 highlight] RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness
verl: Volcano Engine Reinforcement Learning for LLMs
Fine tune Gemma 3 on an object detection task
Hallucination-Aware Multimodal Benchmark for Gastrointestinal Image Analysis with Large Vision Language Models
A Transparent Generalist Model towards Holistic Medical Vision-Language Understanding
Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调
Solve Visual Understanding with Reinforced VLMs
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Official repository for "AM-RADIO: Reduce All Domains Into One"
[DEIMv2] Real Time Object Detection Meets DINOv3
All-in-one training for vision models (YOLO, ViTs, RT-DETR, DINOv3): pretraining, fine-tuning, distillation.
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Python wrapper for TA-Lib (http://ta-lib.org/).
[CVPR 2025] DEIM: DETR with Improved Matching for Fast Convergence
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!
Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)