zhuhanqing

🎯

Focusing

zhuHQ zhuhanqing

🎯

Focusing

Ph.D. student at the UT-Austin.

33 followers · 29 following

Austin

Achievements

Highlights

zhuhanqing.github.io Public
Forked from ywwwer/ywwwer.github.io

My personal website

JavaScript MIT License Updated Dec 3, 2025
ML-Interview Public
Forked from wenhuchen/ML-Interview

Preparing for ML Interviews.

Python Updated Nov 30, 2025
APOLLO Public

APOLLO: SGD-like Memory, AdamW-level Performance; MLSys'25 Oustanding Paper Honorable Mention

optimizer memory-efficient efficient-training llm llm-training

Python 265 14 Other Updated Nov 29, 2025
ToolOrchestra Public
Forked from NVlabs/ToolOrchestra

ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.

Python Apache License 2.0 Updated Nov 27, 2025
ArcherCodeR Public
Forked from wizard-III/ArcherCodeR

ArcherCodeR is an open-source initiative enhancing code reasoning in large language models through scalable, rule-governed reinforcement learning.

Python MIT License Updated Jul 22, 2025
reasoning_loading_bar Public
Forked from royeisen/reasoning_loading_bar

Python Other Updated Jul 7, 2025
Entropy-Mechanism-of-RL Public
Forked from PRIME-RL/Entropy-Mechanism-of-RL

The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

Python Updated Jun 9, 2025
HRPO Public
Forked from Yueeeeeeee/HRPO

Hybrid Latent Reasoning via Reinforcement Learning

Python Updated May 27, 2025
Soft-Thinking Public
Forked from eric-ai-lab/Soft-Thinking

Official implementation of the paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"

Python Updated May 22, 2025
SQuat Public
Forked from Red-Hat-AI-Innovation-Team/SQuat

Python MIT License Updated Apr 3, 2025
Long-to-Short-via-Model-Merging Public
Forked from hahahawu/Long-to-Short-via-Model-Merging

Model merging is a highly efficient approach for long-to-short reasoning.

Python Updated Mar 27, 2025
COAT Public
Forked from NVlabs/COAT

Python Updated Feb 16, 2025
transformers Public
Forked from huggingface/transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python Apache License 2.0 Updated Feb 8, 2025
Lightening-Transformer Public

Lightening-Transformer: A Dynamically-operated Optically-interconnected Photonic Transformer Accelerator, HPCA'24

Python 39 7 GNU General Public License v3.0 Updated Feb 5, 2025
LLaMA-Factory Public
Forked from hiyouga/LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python Apache License 2.0 Updated Jan 13, 2025
lectures Public
Forked from gpu-mode/lectures

Material for cuda-mode lectures

Jupyter Notebook Apache License 2.0 Updated Dec 20, 2024
PACE-Light Public

PACE: Pacing Operator Learning to Accurate Optical Field Simulation for Complicated Photonic Devices, NeurIPs 2024

Python 13 3 Updated Dec 13, 2024
Adam-mini Public
Forked from zyushun/Adam-mini

Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793

Python Updated Dec 5, 2024
MARS Public
Forked from AGI-Arena/MARS

The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models

Python Apache License 2.0 Updated Nov 29, 2024
GaLore Public
Forked from jiaweizzhao/GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python Apache License 2.0 Updated Oct 28, 2024
Fira Public
Forked from xichen-fy/Fira

Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?

Python Apache License 2.0 Updated Oct 6, 2024
CATS Public
Forked from ScalingIntelligence/CATS

Python Updated Sep 30, 2024
mini-s Public
Forked from wdlctc/mini-s

Python MIT License Updated Sep 16, 2024
LLM-for-Photonics Public
Forked from renjieli08/LLM-for-Photonics

Leveraging LLMs to design and optimize nanophotonics

Python Updated Aug 10, 2024
AICircuit Public
Forked from AvestimehrResearchGroup/AICircuit

The implementation of AICircuit: A Multi-Level Dataset and Benchmark for AI-Driven Analog Integrated Circuit Design

Python 2 MIT License Updated Aug 6, 2024
SparseTeMPO Public
Forked from ScopeX-ASU/SparseTeMPO

Python Updated Jul 25, 2024
Q-GaLore Public
Forked from VITA-Group/Q-GaLore

Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.

Python Apache License 2.0 Updated Jul 17, 2024
SpinQuant Public
Forked from facebookresearch/SpinQuant

Code repo for the paper "SpinQuant LLM quantization with learned rotations"

Python Other Updated Jul 17, 2024
SCATTER Public
Forked from ScopeX-ASU/SCATTER

Python MIT License Updated Jul 9, 2024
MicroAdam Public
Forked from IST-DASLab/MicroAdam

This repository contains code for the MicroAdam paper.

Python Apache License 2.0 Updated Jun 28, 2024

zhuHQ zhuhanqing

Achievements

Achievements

Highlights

zhuhanqing.github.io Public

Uh oh!

ML-Interview Public

Uh oh!

APOLLO Public

Uh oh!

ToolOrchestra Public

Uh oh!

ArcherCodeR Public

Uh oh!

reasoning_loading_bar Public

Uh oh!

Entropy-Mechanism-of-RL Public

Uh oh!

HRPO Public

Uh oh!

Soft-Thinking Public

Uh oh!

SQuat Public

Uh oh!

Long-to-Short-via-Model-Merging Public

Uh oh!

COAT Public

Uh oh!

transformers Public

Uh oh!

Lightening-Transformer Public

Uh oh!

LLaMA-Factory Public

Uh oh!

lectures Public

Uh oh!

PACE-Light Public

Uh oh!

Adam-mini Public

Uh oh!

MARS Public

Uh oh!

GaLore Public

Uh oh!

Fira Public

Uh oh!

CATS Public

Uh oh!

mini-s Public

Uh oh!

LLM-for-Photonics Public

Uh oh!

AICircuit Public

Uh oh!

SparseTeMPO Public

Uh oh!

Q-GaLore Public

Uh oh!

SpinQuant Public

Uh oh!

SCATTER Public

Uh oh!

MicroAdam Public

Uh oh!