Skip to content
View Lichang-Chen's full-sized avatar

Organizations

@tianyi-lab

Block or report Lichang-Chen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 96 11 Updated Nov 22, 2025

AI-Driven Research Systems (ADRS)

Jupyter Notebook 142 23 Updated Dec 17, 2025

Automated tool for running Python programs in a streamlined manner

JavaScript 390 24 Updated Jan 12, 2026

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Python 1,285 191 Updated Feb 9, 2021

This repo mainly contains CS234 (Spring 2024) assignment's coding problems

Python 60 22 Updated Feb 4, 2025

Stanford CS234: Reinforcement Learning assignments and practices

Python 63 11 Updated Jul 31, 2024

This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small langua…

Jupyter Notebook 3,737 497 Updated May 13, 2026

Using PPO, I am attempting to solve the cartpole environment

Python 1 Updated Jan 20, 2022

Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch

Python 1,620 2,073 Updated Apr 7, 2026

The LLM Training Puzzles by Sasha Rush

Jupyter Notebook 4 Updated Jul 8, 2023

A version of verl to support diverse tool use

Python 982 80 Updated Mar 2, 2026
Jupyter Notebook 502 44 Updated Oct 18, 2024

LeetCode Training and Evaluation Dataset

Python 49 2 Updated Apr 22, 2025

Reproducing R1 for Code with Reliable Rewards

Python 309 20 Updated May 5, 2025

LM engine is a library for pretraining/finetuning LLMs

Python 171 29 Updated May 17, 2026
Python 6,084 471 Updated Aug 29, 2025

What would you do with 1000 H100s...

Jupyter Notebook 1,172 72 Updated Jan 10, 2024

Solve puzzles. Learn CUDA.

Jupyter Notebook 12,151 935 Updated Sep 1, 2024

Puzzles for learning Triton

Jupyter Notebook 2,440 229 Updated Apr 1, 2026

Recipes to train the self-rewarding reasoning LLMs.

Python 232 14 Updated Mar 2, 2025

Solve puzzles. Improve your pytorch.

Jupyter Notebook 4,054 367 Updated Jul 15, 2024

https://huyenchip.com/ml-interviews-book/

HTML 4,624 669 Updated Mar 21, 2025

Codebase for Iterative DPO Using Rule-based Rewards

Python 271 34 Updated Apr 11, 2025

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Jupyter Notebook 8,272 1,473 Updated Nov 28, 2025

[ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective

Python 76 3 Updated Jun 25, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,214 51 Updated Nov 16, 2024

Recipes to train reward model for RLHF.

Python 1,531 109 Updated Apr 24, 2025

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

C++ 5,224 1,131 Updated May 16, 2026

ODIN: Disentangled Reward Mitigates Hacking in RLHF (ICML 2024)

Python 6 Updated Sep 5, 2024

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

17,199 1,560 Updated Feb 13, 2023
Next