Skip to content
View RobertLuo1's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report RobertLuo1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
392 stars written in Python
Clear filter

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 70,575 9,817 Updated Feb 10, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 67,139 8,164 Updated Feb 10, 2026

Inference code for Llama models

Python 59,136 9,826 Updated Jan 26, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 54,414 9,536 Updated Jan 5, 2026

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 52,860 8,952 Updated Nov 12, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,402 4,780 Updated Jun 2, 2025

The official Meta Llama 3 GitHub site

Python 29,239 3,514 Updated Jan 26, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 28,519 2,887 Updated Apr 30, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 28,115 2,604 Updated Feb 10, 2026

Generative Models by Stability AI

Python 26,909 3,037 Updated Dec 16, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 26,548 1,876 Updated Jan 9, 2026

Fully open reproduction of DeepSeek-R1

Python 25,871 2,411 Updated Nov 24, 2025

Official inference repo for FLUX.1 models

Python 25,209 1,854 Updated Jul 31, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,441 2,729 Updated Aug 12, 2024

Contexts Optical Compression

Python 22,445 2,062 Updated Jan 27, 2026

Fast and memory-efficient exact attention

Python 22,190 2,366 Updated Feb 8, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,756 2,035 Updated Jan 13, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 19,125 3,222 Updated Feb 10, 2026

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 18,211 1,409 Updated Feb 7, 2026

Bring portraits to life!

Python 17,795 1,843 Updated Nov 16, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,706 2,238 Updated Feb 1, 2025

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team.

Python 15,444 1,076 Updated Feb 3, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,308 2,386 Updated Dec 15, 2025

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 13,244 1,249 Updated Feb 3, 2026

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 13,135 1,314 Updated Oct 28, 2025

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

Python 12,915 3,066 Updated Dec 17, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,731 1,553 Updated Apr 24, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 12,614 1,201 Updated Feb 10, 2026

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,420 1,254 Updated Nov 4, 2025
Next