Skip to content
View WayneDW's full-sized avatar
🌴
On vacation
🌴
On vacation

Block or report WayneDW

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementation of "Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models" [NeurIPS 2025]

Python 78 5 Updated Dec 17, 2025

Minimalist RL for Diffusion LLMs with SOTA reasoning performance (89.1% GSM8K). Official implementation of "The Flexibility Trap".

Python 133 5 Updated Apr 3, 2026

dUltra: Ultra-Fast Diffusion Large Language Models via Reinforcement Learning

Python 11 2 Updated Jan 18, 2026

[ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs

Python 61 3 Updated Apr 12, 2026

使用符号回归在中国股市与加密市场上进行高效因子挖掘。

Python 1,959 2,637 Updated Feb 24, 2026

[ICLR 2026] Improving Reasoning for Diffusion Language Models via Group Diffusion Policy Optimization

Python 9 2 Updated Jan 16, 2026

d3LLM: Ultra-Fast Diffusion LLM 🚀

Python 116 6 Updated Mar 19, 2026

Unofficial implementation of the toy example in JiT https://arxiv.org/abs/2511.13720

Jupyter Notebook 60 5 Updated Nov 21, 2025

dLLM: Simple Diffusion Language Modeling

Python 2,376 236 Updated Feb 27, 2026

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 4,332 331 Updated Jan 5, 2026

Pytorch implementation for MeanFlow

Jupyter Notebook 342 32 Updated Jul 30, 2025

Post-training with Tinker

Python 3,072 381 Updated Apr 14, 2026

A research project exploring fine-tuning BERT-style models for text generation

Python 40 8 Updated Nov 30, 2025

The best ChatGPT that $100 can buy.

Python 51,812 6,883 Updated Apr 13, 2026

[ICLR 2026] Discrete Diffusion Divergence Instruct (DiDi-Instruct)

Python 153 10 Updated Mar 4, 2026

Offical implementation of our paper "Exploring the Potential of Diffusion Large Language Models in Code Generation".

Python 20 Updated Oct 29, 2025

The official repository for the paper "Optimal Flow Matching: Learning Straight Trajectories in Just One Step" (NeurIPS 2024)

Jupyter Notebook 110 4 Updated Dec 19, 2024

DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation

Python 812 56 Updated Jul 9, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 56,650 9,689 Updated Nov 12, 2025

The official implementation of "Optimal Stochastic Trace Estimation in Generative Modeling (AISTATS 2025)"

Python 20 Updated Mar 2, 2025

An AI Hedge Fund Team

Python 53,776 9,333 Updated Apr 9, 2026
Python 3 1 Updated Oct 22, 2025

Exploring Applications of GRPO

Python 252 34 Updated Aug 25, 2025

A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.

Jupyter Notebook 221 21 Updated Jun 26, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 28,872 2,928 Updated Apr 9, 2026

Fully open reproduction of DeepSeek-R1

Python 25,988 2,414 Updated Apr 2, 2026

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,721 256 Updated Nov 12, 2025

Our library for RL environments + evals

Python 4,006 529 Updated Apr 14, 2026

Pytorch implementation of Deep Hedging, Utility Maximization and Portfolio Optimization

Jupyter Notebook 18 2 Updated Sep 22, 2024
Next