Skip to content
View MuggleWang's full-sized avatar

Block or report MuggleWang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 32,516 3,760 Updated Nov 2, 2025

Minimalistic large language model 3D-parallelism training

Python 2,296 253 Updated Sep 3, 2025

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,884 141 Updated Aug 26, 2025

从零实现一个小参数量中文大语言模型。

Python 869 100 Updated Aug 22, 2024

📚 Collection of awesome generation acceleration resources.

361 11 Updated Jul 7, 2025

每个人都能看懂的大模型知识分享,LLMs春/秋招大模型面试前必看,让你和面试官侃侃而谈

Jupyter Notebook 4,675 460 Updated Oct 13, 2025

Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

Python 3,060 375 Updated Jun 11, 2025

Best practices & guides on how to write distributed pytorch training code

Python 529 52 Updated Oct 22, 2025

Train VAE like a boss

Jupyter Notebook 298 13 Updated Oct 21, 2024

An extremely fast Python package and project manager, written in Rust.

Rust 72,002 2,195 Updated Nov 5, 2025

Run PyTorch LLMs locally on servers, desktop and mobile

Python 3,617 248 Updated Sep 10, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 47,920 3,916 Updated Nov 5, 2025

A curated list of awesome Multimodal studies.

286 23 Updated Oct 27, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,608 4,613 Updated Nov 5, 2025

50+ mini web projects using HTML, CSS & JS

CSS 40,008 9,699 Updated Feb 26, 2025

Kolors Team

Python 4,566 352 Updated Nov 13, 2024

✨✨Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models

Python 162 7 Updated Dec 26, 2024

4M: Massively Multimodal Masked Modeling

Python 1,769 110 Updated Jun 2, 2025

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,955 131 Updated Oct 30, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 48,969 8,196 Updated Dec 9, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,318 807 Updated Oct 31, 2025

A curated list of awesome vision and language resources (still under construction... stay tuned!)

551 44 Updated Nov 4, 2024

LLM101n: Let's build a Storyteller

35,454 1,929 Updated Aug 1, 2024

Supporting PyTorch models with the Google AI Edge TFLite runtime.

Jupyter Notebook 818 119 Updated Nov 4, 2025

👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]

Python 636 45 Updated Feb 29, 2024

Foundation Architecture for (M)LLMs

Python 3,119 220 Updated Apr 11, 2024

Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model

Python 3,579 359 Updated May 13, 2025
Next