Skip to content
View WayneDW's full-sized avatar
🌴
On vacation
🌴
On vacation

Block or report WayneDW

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementation of "Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models" [NeurIPS 2025]

Python 76 5 Updated Dec 17, 2025

Minimalist RL for Diffusion LLMs with SOTA reasoning performance (89.1% GSM8K). Official implementation of "The Flexibility Trap".

Python 132 4 Updated Mar 24, 2026

dUltra: Ultra-Fast Diffusion Large Language Models via Reinforcement Learning

Python 10 2 Updated Jan 18, 2026

[ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs

Python 62 3 Updated Feb 22, 2026

使用符号回归在中国股市与加密市场上进行高效因子挖掘。

Python 1,923 2,633 Updated Feb 24, 2026

[ICLR 2026] Improving Reasoning for Diffusion Language Models via Group Diffusion Policy Optimization

Python 8 2 Updated Jan 16, 2026

d3LLM: Ultra-Fast Diffusion LLM 🚀

Python 112 6 Updated Mar 19, 2026

Unofficial implementation of the toy example in JiT https://arxiv.org/abs/2511.13720

Jupyter Notebook 60 5 Updated Nov 21, 2025

dLLM: Simple Diffusion Language Modeling

Python 2,297 224 Updated Feb 27, 2026

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 4,289 324 Updated Jan 5, 2026

Pytorch implementation for MeanFlow

Jupyter Notebook 335 34 Updated Jul 30, 2025

Post-training with Tinker

Python 3,012 362 Updated Apr 1, 2026

A research project exploring fine-tuning BERT-style models for text generation

Python 39 7 Updated Nov 30, 2025

The best ChatGPT that $100 can buy.

Python 50,834 6,682 Updated Mar 27, 2026

[ICLR 2026] Discrete Diffusion Divergence Instruct (DiDi-Instruct)

Python 153 10 Updated Mar 4, 2026

Offical implementation of our paper "Exploring the Potential of Diffusion Large Language Models in Code Generation".

Python 20 Updated Oct 29, 2025

The official repository for the paper "Optimal Flow Matching: Learning Straight Trajectories in Just One Step" (NeurIPS 2024)

Jupyter Notebook 110 4 Updated Dec 19, 2024

DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation

Python 807 53 Updated Jul 9, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 55,938 9,532 Updated Nov 12, 2025

The official implementation of "Optimal Stochastic Trace Estimation in Generative Modeling (AISTATS 2025)"

Python 20 Updated Mar 2, 2025

An AI Hedge Fund Team

Python 49,884 8,663 Updated Mar 28, 2026
Python 3 1 Updated Oct 22, 2025

Exploring Applications of GRPO

Python 252 34 Updated Aug 25, 2025

A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.

Jupyter Notebook 220 20 Updated Jun 26, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 28,803 2,923 Updated Apr 30, 2025

Fully open reproduction of DeepSeek-R1

Python 25,965 2,410 Updated Nov 24, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,708 253 Updated Nov 12, 2025

Our library for RL environments + evals

Python 3,959 524 Updated Apr 1, 2026

Pytorch implementation of Deep Hedging, Utility Maximization and Portfolio Optimization

Jupyter Notebook 18 2 Updated Sep 22, 2024
Next