Skip to content
View agneszhang435's full-sized avatar

Block or report agneszhang435

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

《大模型白盒子构建指南》:一个全手搓的Tiny-Universe

Jupyter Notebook 4,181 413 Updated Dec 2, 2025

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 5,748 610 Updated Dec 5, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 35,900 4,235 Updated Dec 14, 2025
Jupyter Notebook 6,099 1,613 Updated Jun 26, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 9,001 881 Updated Dec 4, 2025

kindle电子书

105 43 Updated Jan 19, 2021

个人构建MoE大模型:从预训练到DPO的完整实践

Python 2,082 156 Updated Dec 16, 2025

Performance analysis of predictive (alpha) stock factors

Jupyter Notebook 4,046 1,284 Updated Feb 12, 2024

We are committed to the open-sourcing quantitative knowledge, aiming to bridge the information gap between the domestic and international quantitative finance industries. 我们致力于量化知识的开源与汉化,打破国内外量化金融行…

2,761 224 Updated Dec 1, 2025

An intuitive library to extract features from time series. To cite this software publication: https://www.sciencedirect.com/science/article/pii/S2352711020300017

Jupyter Notebook 107 27 Updated Feb 17, 2020
Jupyter Notebook 69 19 Updated Mar 26, 2024

Muon is an optimizer for hidden layers in neural networks

Python 2,116 99 Updated Nov 23, 2025

My learning notes for ML SYS.

Python 4,732 299 Updated Dec 19, 2025

✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】【大飞 大模型Agent】

Jupyter Notebook 15,430 1,793 Updated Dec 18, 2025

slime is an LLM post-training framework for RL Scaling.

Python 2,925 353 Updated Dec 21, 2025

Muon is Scalable for LLM Training

1,386 78 Updated Aug 3, 2025

EasyRL: An easy-to-use and comprehensive reinforcement learning package.

Python 252 45 Updated Feb 25, 2022

[ICLR 2025] 🧬 RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)

Jupyter Notebook 181 14 Updated Feb 17, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,866 3,823 Updated Dec 21, 2025
Python 962 101 Updated Dec 21, 2025

A collection of multimodal reasoning papers, codes, datasets, benchmarks and resources.

348 24 Updated Dec 15, 2025

✨✨Latest Advances on Multimodal Large Language Models

17,031 1,095 Updated Dec 12, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,417 163 Updated Mar 20, 2025

R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Python 668 47 Updated Aug 5, 2025

Python语言基础50课

12,438 2,987 Updated Dec 18, 2025

FinRobot: An Open-Source AI Agent Platform for Financial Analysis using LLMs 🚀 🚀 🚀

Jupyter Notebook 4,805 882 Updated Dec 20, 2025

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 3,560 245 Updated Dec 18, 2025

Puzzles for learning Triton

Jupyter Notebook 2,188 179 Updated Nov 18, 2024
Python 1,662 99 Updated Sep 30, 2025
Next