zhichul

Follow

Brian Lu zhichul

Follow

PhD student in NLP.

2 followers · 1 following

Achievements

Achievements

Highlights

Pro

Pinned Loading

llm_causal_reasoning llm_causal_reasoning Public

Code for synthetic data generation, GRPO/DAPO/SFT training, and reasoning trace analysis to study algorithmic generalization of RL post-training.

Jupyter Notebook
annotation annotation Public

A library for collaboratively prompt engineering to annotate social media posts

Python
batched_vocabulary_optimization batched_vocabulary_optimization Public

Training UnigramLM style tokenizers jointly with Transformer task model

Shell
dblm dblm Public

Language models that condition on joint probability distributions, and interleave probabilistic inference with next-token prediction

Python
regression-gradient-estimator regression-gradient-estimator Public

Demo of using regression over perturbations to estimate gradient

Python
cslm cslm Public

Synthesizing code-switching data from a language model that was trained only on parallel or separate monolingual corpuses over two languages

Shell