-
NVIDIA
- ATL, GA
-
03:08
(UTC -05:00) - in/lxaw
- https://lxaw.github.io/index.html
Highlights
- Pro
Stars
dInfer: An Efficient Inference Framework for Diffusion Language Models
GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 training.
CANDI: Continuous and Discrete Diffusion
🧀 Pytorch code for the Fromage optimiser.
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
PC で動く高機能な将棋の GUI「ShogiHome」の開発リポジトリ
[ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
MTEB: Massive Text Embedding Benchmark
LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)
The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".
Collection of Summer 2026 tech internships!
PyTorch implementation of Variational Diffusion Models.
Awesome Reasoning LLM Tutorial/Survey/Guide
Discrete Flow Matching implemented in PyTorch
DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation
Remasking Discrete Diffusion Models with Inference-Time Scaling
Minimal Implementation of a D3PM in pytorch
Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling
A simple yet powerful tool to turn traditional container/OS images into unprivileged sandboxes.
[ICLR 2021 Spotlight] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yining Ding, Vikas Chandra, and Yingyan (Celine) Lin.
[ICLR'23] DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models