-
Hong Kong University of Science and Technology
- Shanghai,China
-
12:52
(UTC -12:00) - https://hq-King.github.io
Stars
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
A generative world for general-purpose robotics & embodied AI learning.
Fully open reproduction of DeepSeek-R1
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
verl: Volcano Engine Reinforcement Learning for LLMs
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
✨✨Latest Advances on Multimodal Large Language Models
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Lets make video diffusion practical!
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用
Wan: Open and Advanced Large-Scale Video Generative Models
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
🔥Highlighting the top ML papers every week.
[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide
Reference PyTorch implementation and models for DINOv3
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…