Highlights
- Pro
-
zhuhanqing.github.io Public
Forked from ywwwer/ywwwer.github.ioMy personal website
JavaScript MIT License UpdatedDec 3, 2025 -
ML-Interview Public
Forked from wenhuchen/ML-InterviewPreparing for ML Interviews.
Python UpdatedNov 30, 2025 -
APOLLO Public
APOLLO: SGD-like Memory, AdamW-level Performance; MLSys'25 Oustanding Paper Honorable Mention
-
ToolOrchestra Public
Forked from NVlabs/ToolOrchestraToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.
Python Apache License 2.0 UpdatedNov 27, 2025 -
ArcherCodeR Public
Forked from wizard-III/ArcherCodeRArcherCodeR is an open-source initiative enhancing code reasoning in large language models through scalable, rule-governed reinforcement learning.
Python MIT License UpdatedJul 22, 2025 -
reasoning_loading_bar Public
Forked from royeisen/reasoning_loading_barPython Other UpdatedJul 7, 2025 -
Entropy-Mechanism-of-RL Public
Forked from PRIME-RL/Entropy-Mechanism-of-RLThe Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
Python UpdatedJun 9, 2025 -
HRPO Public
Forked from Yueeeeeeee/HRPOHybrid Latent Reasoning via Reinforcement Learning
Python UpdatedMay 27, 2025 -
Soft-Thinking Public
Forked from eric-ai-lab/Soft-ThinkingOfficial implementation of the paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"
Python UpdatedMay 22, 2025 -
-
Long-to-Short-via-Model-Merging Public
Forked from hahahawu/Long-to-Short-via-Model-MergingModel merging is a highly efficient approach for long-to-short reasoning.
Python UpdatedMar 27, 2025 -
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedFeb 8, 2025 -
Lightening-Transformer Public
Lightening-Transformer: A Dynamically-operated Optically-interconnected Photonic Transformer Accelerator, HPCA'24
-
LLaMA-Factory Public
Forked from hiyouga/LLaMA-FactoryUnified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Python Apache License 2.0 UpdatedJan 13, 2025 -
lectures Public
Forked from gpu-mode/lecturesMaterial for cuda-mode lectures
Jupyter Notebook Apache License 2.0 UpdatedDec 20, 2024 -
PACE-Light Public
PACE: Pacing Operator Learning to Accurate Optical Field Simulation for Complicated Photonic Devices, NeurIPs 2024
-
Adam-mini Public
Forked from zyushun/Adam-miniCode for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793
Python UpdatedDec 5, 2024 -
MARS Public
Forked from AGI-Arena/MARSThe official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models
Python Apache License 2.0 UpdatedNov 29, 2024 -
GaLore Public
Forked from jiaweizzhao/GaLoreGaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Python Apache License 2.0 UpdatedOct 28, 2024 -
Fira Public
Forked from xichen-fy/FiraFira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?
Python Apache License 2.0 UpdatedOct 6, 2024 -
-
-
LLM-for-Photonics Public
Forked from renjieli08/LLM-for-PhotonicsLeveraging LLMs to design and optimize nanophotonics
Python UpdatedAug 10, 2024 -
AICircuit Public
Forked from AvestimehrResearchGroup/AICircuitThe implementation of AICircuit: A Multi-Level Dataset and Benchmark for AI-Driven Analog Integrated Circuit Design
-
-
Q-GaLore Public
Forked from VITA-Group/Q-GaLoreQ-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.
Python Apache License 2.0 UpdatedJul 17, 2024 -
SpinQuant Public
Forked from facebookresearch/SpinQuantCode repo for the paper "SpinQuant LLM quantization with learned rotations"
Python Other UpdatedJul 17, 2024 -
-
MicroAdam Public
Forked from IST-DASLab/MicroAdamThis repository contains code for the MicroAdam paper.
Python Apache License 2.0 UpdatedJun 28, 2024