-
The University of Texas at Austin
- Austin, TX, USA
-
14:55
(UTC -05:00) - zhenyu.gallery
- @KyriectionZhang
Lists (8)
Sort Name ascending (A-Z)
🦾 Benchmarking
LLM Hospital💎 Efficient ML
Prune & Sparse & Quantization & KD & NAS🤖 General Topics
Architectures & Optimization & BlockChain & SSL & Speech & Recsys💍 Large Language Models
Next Step of LLMs🚀 My Stack
Open-source of Our Works💁 Quantum ML
ML for Quantum & Quantum for ML🗼 Toolbox
Visualization & Coding Tool🚩 Trustworthy ML
OoD & Adversarial & BackdoorStars
A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.
Kronos: A Foundation Model for the Language of Financial Markets
This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.
The official implementation for "Mitigating Overthinking in Large Reasoning Models via Manifold Steering"
GPQA: A Graduate-Level Google-Proof Q&A Benchmark
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Official code for SEAL: Steerable Reasoning Calibration of Large Language Models for Free
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
Code for the paper: "Learning to Reason without External Rewards"
Inverse Scaling in Test-Time Compute
Stanford NLP Python library for benchmarking the utility of LLM interpretability methods
Representation Engineering: A Top-Down Approach to AI Transparency
Tiny-FSDP, a minimalistic re-implementation of the PyTorch FSDP
The implementation of paper "AlphaSteer: Learning Refusal Steering with Principled Null-Space Constraint"
Kinetics: Rethinking Test-Time Scaling Laws
SkyRL: A Modular Full-stack RL Library for LLMs
Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.
[ICLR 2025 Workshop] "Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models"