Skip to content
View RS2002's full-sized avatar
🎸
🎸

Block or report RS2002

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[EMNLP 2024 (main)] Attention Score is not All You Need for Token Importance Indicator in KV Cache Reduction: Value Also Matters

Python 13 2 Updated Nov 5, 2024

BertViz: Visualize Attention in Transformer Models

Python 8,086 882 Updated Jan 8, 2026

[ICML2026] Official Pytorch Implement for "Search or Accelerate: Confidence-Switched Position Beam Search for Diffusion Language Models"

Python 8 Updated Apr 3, 2026

Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

Python 1,745 301 Updated Sep 8, 2022

HKUST Thesis LaTeX3 Template (Available on Overleaf/TeXPage)

TeX 149 16 Updated Dec 26, 2025

Train transformer language models with reinforcement learning.

Python 18,629 2,789 Updated Jun 13, 2026

multi-agent deep reinforcement learning for networked system control.

Python 446 93 Updated Sep 29, 2020

Official implementation for "Unifying Masked Diffusion Models with Various Generation Orders and Beyond"

Python 4 Updated May 14, 2026

A framework for few-shot evaluation of language models.

Python 12,936 3,333 Updated Jun 2, 2026

Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"

Python 447 52 Updated Jan 26, 2026

Revisiting Discrete Gradient Estimation in MADDPG

Python 29 4 Updated Feb 24, 2023

dLLM: Simple Diffusion Language Modeling

Python 2,574 271 Updated Jun 12, 2026

Dream 7B, a large diffusion language model

Python 1,246 79 Updated Nov 21, 2025

Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"

Python 383 28 Updated Dec 22, 2024

Multi-Agent Reinforcement Learning (MARL) papers

299 41 Updated Sep 19, 2022

Official implementation of "Improving Discrete Diffusion Unmasking Policies Beyond Explicit Reference Policies"

Python 6 Updated Feb 26, 2026

dUltra: Ultra-Fast Diffusion Large Language Models via Reinforcement Learning

Python 16 2 Updated Apr 17, 2026

Repository companioning the paper "Learning Unmasking Policies for Diffusion Language Models"

Python 13 1 Updated Mar 30, 2026
Python 1 Updated Nov 11, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,821 267 Updated Nov 12, 2025

Implementation of "Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models" [NeurIPS 2025]

Python 81 5 Updated Dec 17, 2025

Official inference implementation of the paper "DON'T SETTLE TOO EARLY: SELF-REFLECTIVE REMASKING FOR DIFFUSION LANGUAGE MODELS". [ICLR 2026]

Python 14 1 Updated Jan 28, 2026

MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models

Python 44 1 Updated Jan 28, 2026

The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".

1,094 48 Updated May 29, 2026

Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language Models"

Python 40 8 Updated Jan 13, 2025
Python 43 2 Updated Nov 10, 2025

This is the official implementation of our NeurIPS 2025 paper "Gated Integration of Low-Rank Adaptation for Continual Learning of Large Language Models".

Python 23 3 Updated Nov 27, 2025

This repository collects awesome survey, resource, and paper for Lifelong Learning for Large Language Models. (Updated Regularly)

72 2 Updated May 30, 2025

Data Efficient Adaptation in Large Language Models via Continuous Low-Rank Fine-Tuning

Python 6 Updated Sep 19, 2025

The official implementation of "ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering"

Python 63 5 Updated Jun 21, 2025
Next