Skip to content
View MuggleWang's full-sized avatar

Block or report MuggleWang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

506 results for source starred repositories
Clear filter

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 32,622 3,778 Updated Nov 6, 2025

Minimalistic large language model 3D-parallelism training

Python 2,297 253 Updated Sep 3, 2025

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,886 142 Updated Aug 26, 2025

从零实现一个小参数量中文大语言模型。

Python 869 100 Updated Aug 22, 2024

📚 Collection of awesome generation acceleration resources.

361 11 Updated Jul 7, 2025

每个人都能看懂的大模型知识分享,LLMs春/秋招大模型面试前必看,让你和面试官侃侃而谈

Jupyter Notebook 4,680 460 Updated Oct 13, 2025

Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

Python 3,067 375 Updated Jun 11, 2025

Best practices & guides on how to write distributed pytorch training code

Python 530 52 Updated Oct 22, 2025

Train VAE like a boss

Jupyter Notebook 298 13 Updated Oct 21, 2024

An extremely fast Python package and project manager, written in Rust.

Rust 72,111 2,199 Updated Nov 6, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 47,985 3,925 Updated Nov 6, 2025

A curated list of awesome Multimodal studies.

286 23 Updated Oct 27, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,624 4,613 Updated Nov 6, 2025

50+ mini web projects using HTML, CSS & JS

CSS 40,020 9,701 Updated Feb 26, 2025

Kolors Team

Python 4,568 352 Updated Nov 13, 2024

✨✨Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models

Python 162 7 Updated Dec 26, 2024

4M: Massively Multimodal Masked Modeling

Python 1,769 110 Updated Jun 2, 2025

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,955 131 Updated Oct 30, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 49,045 8,216 Updated Dec 9, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,328 808 Updated Oct 31, 2025

A curated list of awesome vision and language resources (still under construction... stay tuned!)

551 44 Updated Nov 4, 2024

Supporting PyTorch models with the Google AI Edge TFLite runtime.

Jupyter Notebook 819 119 Updated Nov 5, 2025

👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]

Python 636 45 Updated Feb 29, 2024

Foundation Architecture for (M)LLMs

Python 3,119 221 Updated Apr 11, 2024

Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model

Python 3,579 359 Updated May 13, 2025

✨✨Latest Advances on Multimodal Large Language Models

16,631 1,072 Updated Nov 6, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,050 3,179 Updated Nov 6, 2025
Next