Skip to content
View dropreg's full-sized avatar

Block or report dropreg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.

2,451 108 Updated Mar 25, 2026

Revealing and unlocking the context boundary of reward models

Python 21 1 Updated Jan 11, 2026

context denoising training for long-context modeling

Python 16 1 Updated Oct 10, 2025

Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation length and maintaining KV-cache compatibility, achieving high eff…

Python 95 4 Updated Dec 27, 2025

Tools for OpenDataArena: Fair, Open, and Transparent Arena for Data

Python 139 13 Updated Mar 15, 2026

The official repository of paper Unlocking Recursive Thinking of LLMs: Alignment via Refinement

Python 2 1 Updated Jul 27, 2025
Python 14 Updated Sep 16, 2025

Fine-grained Language Model Evaluation and Correction via Branching and Bridging

Python 7 Updated May 7, 2024

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,711 254 Updated Nov 12, 2025

[ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Python 386 28 Updated Mar 7, 2025

💪 A toolkit to help search for papers from aclanthology, arXiv and dblp.

Python 43 4 Updated Mar 4, 2023

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 13,921 2,235 Updated Dec 30, 2025

Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt, and Mihaela van der Schaar

Python 38 2 Updated Aug 11, 2024

A bibliography and survey of the papers surrounding o1

TeX 1,214 51 Updated Nov 16, 2024

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 90,005 13,773 Updated Apr 4, 2026

Optimizing inference proxy for LLMs

Python 3,407 268 Updated Mar 19, 2026

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,561 188 Updated Mar 28, 2026

🍎APPL: A Prompt Programming Language. Seamlessly integrate LLMs with programs.

Python 265 7 Updated Feb 20, 2025

Open-ended Long Text Generation via Masked Language Modeling

Python 7 2 Updated Mar 19, 2024

The framework to prune LLMs to any size and any config.

Python 95 3 Updated Mar 1, 2024
Python 96 7 Updated Oct 8, 2023

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,800 251 Updated Dec 12, 2023

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,257 4,001 Updated Jul 17, 2024

Inference code for Llama models

Python 59,296 9,826 Updated Jan 26, 2025

LSTM and QRNN Language Model Toolkit for PyTorch 1.2.0!

Python 20 2 Updated Mar 2, 2020