Stars
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
A 3B-active-parameter native unified multimodal model for image and video understanding, generation, and editing.
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
MovieAgent: Automated Movie Generation via Multi-Agent CoT Planning
[ICLR 2025 Oral] TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation
[NeurIPS 2025] Improving Video Generation with Human Feedback
A unified inference and post-training framework for accelerated video generation.
An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models
Enjoy the magic of Diffusion models!
Wan: Open and Advanced Large-Scale Video Generative Models
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.