-
The Hong Kong University of Science and Technology
- HongKong
-
18:20
(UTC +08:00) - https://zijianzhao.netlify.app/
- https://orcid.org/0000-0002-3326-9650
- https://scholar.google.com/citations?user=XkA3qCcAAAAJ&hl=en
- https://openreview.net/profile?id=~Zijian_Zhao7
- https://huggingface.co/RS2002
Stars
[EMNLP 2024 (main)] Attention Score is not All You Need for Token Importance Indicator in KV Cache Reduction: Value Also Matters
BertViz: Visualize Attention in Transformer Models
[ICML2026] Official Pytorch Implement for "Search or Accelerate: Confidence-Switched Position Beam Search for Diffusion Language Models"
Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
HKUST Thesis LaTeX3 Template (Available on Overleaf/TeXPage)
Train transformer language models with reinforcement learning.
multi-agent deep reinforcement learning for networked system control.
Official implementation for "Unifying Masked Diffusion Models with Various Generation Orders and Beyond"
A framework for few-shot evaluation of language models.
Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"
Revisiting Discrete Gradient Estimation in MADDPG
Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"
Multi-Agent Reinforcement Learning (MARL) papers
Official implementation of "Improving Discrete Diffusion Unmasking Policies Beyond Explicit Reference Policies"
dUltra: Ultra-Fast Diffusion Large Language Models via Reinforcement Learning
Repository companioning the paper "Learning Unmasking Policies for Diffusion Language Models"
Official PyTorch implementation for "Large Language Diffusion Models"
Implementation of "Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models" [NeurIPS 2025]
Official inference implementation of the paper "DON'T SETTLE TOO EARLY: SELF-REFLECTIVE REMASKING FOR DIFFUSION LANGUAGE MODELS". [ICLR 2026]
MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models
The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".
Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language Models"
This is the official implementation of our NeurIPS 2025 paper "Gated Integration of Low-Rank Adaptation for Continual Learning of Large Language Models".
qianlima-lab / awesome-lifelong-learning-methods-for-llm
Forked from zzz47zzz/awesome-lifelong-learning-methods-for-llmThis repository collects awesome survey, resource, and paper for Lifelong Learning for Large Language Models. (Updated Regularly)
Data Efficient Adaptation in Large Language Models via Continuous Low-Rank Fine-Tuning
The official implementation of "ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering"