-
Tsinghua University
- Beijing
-
03:50
(UTC +08:00) - robertluo1.github.io
Lists (3)
Sort Name ascending (A-Z)
Stars
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
No fortress, purely open ground. OpenManus is Coming.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Open-Sora: Democratizing Efficient Video Production for All
A generative world for general-purpose robotics & embodied AI learning.
Generative Models by Stability AI
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Fully open reproduction of DeepSeek-R1
Official inference repo for FLUX.1 models
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Fast and memory-efficient exact attention
A curated collection of fun and creative examples generated with Nano Banana & Nano Banana Pro🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the commu…
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
verl: Volcano Engine Reinforcement Learning for LLMs
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Janus-Series: Unified Multimodal Understanding and Generation Models
A C++ header-only HTTP/HTTPS server and client library
Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team.