-
WuhanUniversity
- liu-yii.github.io
Highlights
- Pro
Stars
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…
🚀 「大模型」1小时从0训练67M参数的视觉多模态VLM!🌏 Train a 67M-parameter VLM from scratch in just 1 hours!
🚀🚀 「大模型」2小时完全从0训练64M的小参数GPT!🌏 Train a 64M-parameter GPT from scratch in just 2h!
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
[CVPR 2026] Gen3R: 3D Scene Generation Meets Feed-Forward Reconstruction
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
每个人都能看懂的大模型知识分享,LLMs春/秋招大模型面试前必看,让你和面试官侃侃而谈
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Community trainer for Lightricks' LTX Video model 🎬 ⚡️
[NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models
[CVPR '25] MEt3R: Measuring Multi-View Consistency in Generated Images
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…
[NeurIPS'25] One-Step Diffusion for Detail-Rich and Temporally Consistent Video Super-Resolution
Generative Models by Stability AI
PEACE: Empowering Geologic Map Holistic Understanding with MLLMs [Official, CVPR 2025]
Muon is an optimizer for hidden layers in neural networks
StereoINR: Cross-View Geometry Consistent Stereo Super Resolution with Implicit Neural Representation
[ECCV 2024 - Oral] HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution
Official PyTorch Code for our ICCV25 paper- Generalized and Efficient 2D Gaussian Splatting for Arbitrary-scale Super-Resolution
OpenStereo: A Comprehensive Benchmark for Stereo Matching
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
An unofficial implementation for "CoSeR: Bridging Image and Language for Cognitive Super-Resolution (CVPR 2024)"
[CVPR 2024] CoSeR: Bridging Image and Language for Cognitive Super-Resolution
[IEEE TPAMI 2025] A Survey on All-in-One Image Restoration: Taxonomy, Evaluation and Future Trends
This is a summary of research on All-In-One Image/Video Restoration. There may be omissions. If anything is missing please get in touch with us. Our emails: liboyun.gm@gmail.com; gouyuanbiao@gmail.…
Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer