Skip to content
View lxaw's full-sized avatar
🗺️
🗺️

Highlights

  • Pro

Block or report lxaw

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

RLP: Reinforcement as a Pretraining Objective

216 13 Updated Oct 5, 2025

dInfer: An Efficient Inference Framework for Diffusion Language Models

Python 364 34 Updated Dec 21, 2025

GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 training.

Python 303 25 Updated Nov 11, 2025

CANDI: Continuous and Discrete Diffusion

Python 16 Updated Oct 27, 2025

Leetcode for Pytorch

Jupyter Notebook 1,736 196 Updated Jul 26, 2025

[NeurIPS'19] Deep Equilibrium Models

Python 784 87 Updated Jul 4, 2022

🧀 Pytorch code for the Fromage optimiser.

Jupyter Notebook 129 8 Updated Jul 14, 2024

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,631 135 Updated Dec 4, 2025

PC で動く高機能な将棋の GUI「ShogiHome」の開発リポジトリ

TypeScript 197 37 Updated Dec 21, 2025

[ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

Python 181 23 Updated Sep 13, 2025

MTEB: Massive Text Embedding Benchmark

Python 3,036 524 Updated Dec 20, 2025

LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)

Python 145 9 Updated Nov 9, 2024

The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".

587 27 Updated Dec 16, 2025
Python 150 21 Updated Oct 29, 2025

Collection of Summer 2026 tech internships!

42,748 3,132 Updated Dec 21, 2025

PyTorch implementation of Variational Diffusion Models.

Python 104 10 Updated Apr 23, 2024

Awesome Reasoning LLM Tutorial/Survey/Guide

Python 2,216 153 Updated Oct 14, 2025

Discrete Flow Matching implemented in PyTorch

Python 33 2 Updated Mar 23, 2025
Python 109 7 Updated May 29, 2023

DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation

Python 778 51 Updated Jul 9, 2025

Remasking Discrete Diffusion Models with Inference-Time Scaling

Python 62 9 Updated Mar 8, 2025

Minimal Implementation of a D3PM in pytorch

Jupyter Notebook 279 21 Updated Apr 22, 2024

[ICML 2025] The Diffusion Duality

Python 180 23 Updated Oct 13, 2025

Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"

Python 748 70 Updated Nov 28, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,277 7,789 Updated Dec 21, 2025

Esoteric Language Models

Python 109 15 Updated Nov 24, 2025

Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling

Python 463 22 Updated May 17, 2025

A simple yet powerful tool to turn traditional container/OS images into unprivileged sandboxes.

Shell 873 114 Updated Nov 12, 2025

[ICLR 2021 Spotlight] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yining Ding, Vikas Chandra, and Yingyan (Celine) Lin.

Python 31 6 Updated Mar 2, 2024

[ICLR'23] DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models

Python 821 105 Updated Mar 1, 2024
Next