MuggleWang

Muggle Wang MuggleWang

21 followers · 8 following

Achievements

Starred repositories

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 32,516 3,760 Updated Nov 2, 2025

huggingface / nanotron

Minimalistic large language model 3D-parallelism training

Python 2,296 253 Updated Sep 3, 2025

huggingface / picotron

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,884 141 Updated Aug 26, 2025

wdndev / tiny-llm-zh

从零实现一个小参数量中文大语言模型。

Python 869 100 Updated Aug 22, 2024

multimodal-art-projection / DailyPaper

HTML 55 Updated Nov 14, 2024

xuyang-liu16 / Awesome-Generation-Acceleration

📚 Collection of awesome generation acceleration resources.

361 11 Updated Jul 7, 2025

luhengshiwo / LLMForEverybody

每个人都能看懂的大模型知识分享，LLMs春/秋招大模型面试前必看，让你和面试官侃侃而谈

Jupyter Notebook 4,675 460 Updated Oct 13, 2025

XinJingHao / DRL-Pytorch

Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

Python 3,060 375 Updated Jun 11, 2025

ZJU-LLMs / Foundations-of-LLMs

12,166 1,104 Updated Jan 14, 2025

LambdaLabsML / distributed-training-guide

Best practices & guides on how to write distributed pytorch training code

Python 529 52 Updated Oct 22, 2025

cloneofsimo / vqgan-training

Train VAE like a boss

Jupyter Notebook 298 13 Updated Oct 21, 2024

Infatoshi / cuda-course

Cuda 1,905 365 Updated Nov 3, 2025

astral-sh / uv

An extremely fast Python package and project manager, written in Rust.

Rust 72,002 2,195 Updated Nov 5, 2025

pytorch / torchchat

Run PyTorch LLMs locally on servers, desktop and mobile

Python 3,617 248 Updated Sep 10, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 47,920 3,916 Updated Nov 5, 2025

friedrichor / Awesome-Multimodal-Papers

A curated list of awesome Multimodal studies.

286 23 Updated Oct 27, 2025

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,608 4,613 Updated Nov 5, 2025

bradtraversy / 50projects50days

50+ mini web projects using HTML, CSS & JS

CSS 40,008 9,699 Updated Feb 26, 2025

Kwai-Kolors / Kolors

Kolors Team

Python 4,566 352 Updated Nov 13, 2024

yfzhang114 / SliME

✨✨Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models

Python 162 7 Updated Dec 26, 2024

apple / ml-4m

4M: Massively Multimodal Masked Modeling

Python 1,769 110 Updated Jun 2, 2025

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,955 131 Updated Oct 30, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 48,969 8,196 Updated Dec 9, 2024

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,318 807 Updated Oct 31, 2025