Skip to content
View MuggleWang's full-sized avatar

Block or report MuggleWang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

506 results for source starred repositories
Clear filter

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 33,265 3,874 Updated Nov 10, 2025

Minimalistic large language model 3D-parallelism training

Python 2,308 256 Updated Sep 3, 2025

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,891 145 Updated Aug 26, 2025

从零实现一个小参数量中文大语言模型。

Python 871 100 Updated Aug 22, 2024

📚 Collection of awesome generation acceleration resources.

362 11 Updated Jul 7, 2025

每个人都能看懂的大模型知识分享,LLMs春/秋招大模型面试前必看,让你和面试官侃侃而谈

Jupyter Notebook 4,705 462 Updated Oct 13, 2025

Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

Python 3,078 376 Updated Jun 11, 2025

Best practices & guides on how to write distributed pytorch training code

Python 533 53 Updated Oct 22, 2025

Train VAE like a boss

Jupyter Notebook 298 13 Updated Oct 21, 2024

An extremely fast Python package and project manager, written in Rust.

Rust 72,511 2,216 Updated Nov 11, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,158 3,949 Updated Nov 10, 2025

A curated list of awesome Multimodal studies.

287 23 Updated Oct 27, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,664 4,620 Updated Nov 11, 2025

50+ mini web projects using HTML, CSS & JS

CSS 40,066 9,708 Updated Feb 26, 2025

Kolors Team

Python 4,571 349 Updated Nov 13, 2024

✨✨Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models

Python 162 7 Updated Dec 26, 2024

4M: Massively Multimodal Masked Modeling

Python 1,770 110 Updated Jun 2, 2025

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,963 131 Updated Nov 7, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 49,342 8,264 Updated Dec 9, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,355 810 Updated Nov 9, 2025

A curated list of awesome vision and language resources (still under construction... stay tuned!)

552 44 Updated Nov 4, 2024

Supporting PyTorch models with the Google AI Edge TFLite runtime.

Jupyter Notebook 825 121 Updated Nov 10, 2025

👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]

Python 637 45 Updated Feb 29, 2024

Foundation Architecture for (M)LLMs

Python 3,119 221 Updated Apr 11, 2024

Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model

Python 3,578 359 Updated May 13, 2025

✨✨Latest Advances on Multimodal Large Language Models

16,669 1,075 Updated Nov 9, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,077 3,188 Updated Nov 11, 2025
Next