Robin jianzhnie

📊 GitHub Stats

👨‍💻 About Me

I'm an AI engineer focused on building production-grade LLM systems and scalable reinforcement learning frameworks. I love turning cutting-edge research into clean, usable code.

🛠️ Tech Stack

🧠 LLM & AI Systems

Project	Description
mini-vLLM	A compact implementation of vLLM, designed to demystify the complexities of modern LLM serving systems.
ScaleTorch	A scalable PyTorch framework for training large models, implementing 4D parallelism (TP, PP, SP, DP).
Open-R1	Open-source DeepSeek-R1-style and RLHF training pipeline.
LLMEval	A modular framework to evaluate LLMs across tasks and settings.
LLMReasoning	Techniques and toolkit for reasoning with LLMs.
LLMToolkit	A PyTorch toolkit for NLP and LLM development.
LLamaTuner	Easy and efficient finetuning pipelines for LLMs.

🎮 Reinforcement Learning

Project	Description
Deep-RL-Toolkit	Single-agent RL toolkit (DQN, Rainbow, DDPG, PPO, SAC, TD3, …).
Deep-MARL-Toolkit	Multi-agent RL toolkit (VDN, QMIX, MADDPG, MAPPO, …).
RLZero	MCTS for general sequential decision making (AlphaZero, MuZero, …).
ScaleRL	Simple, scalable distributed RL (A3C, Ape-X, IMPALA, …).
CyberAttackSimulator	RL environment for autonomous cyber attack and defense on simulated networks.

🔧 More Projects

Project	Description
Diffusion Toolkit	Image/audio generation with diffusion models in PyTorch.
AutoTimm	AutoML for deep learning tasks.
AutoTabular	AutoML for tabular data.

How to reach me 📫

Email: jianzhnie@gmail.com
Homepage: https://jianzhnie.github.io
Blog: https://jianzhnie.github.io/llmtech/
ZhiHu: https://www.zhihu.com/column/fengnie
Hugging Face Org: https://huggingface.co/GaussianTech
LinkedIn: https://www.linkedin.com/in/jianzheng-nie-2749b7156/
Ask me about: statistics, machine learning, LLMs, and RL.
❤️ Sponsor me on GitHub

Have an awesome day! 🌟

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Robin jianzhnie

Achievements

Achievements

Highlights

Block or report jianzhnie