dropreg

lxb dropreg

Achievements

Stars

knightnemo / Awesome-World-Models

A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.

2,583 113 Updated Apr 14, 2026

LCM-Lab / LongRM

Revealing and unlocking the context boundary of reward models

Python 21 1 Updated Jan 11, 2026

LCM-Lab / context-denoising-training

context denoising training for long-context modeling

Python 16 1 Updated Oct 10, 2025

OpenGVLab / SDLM

Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation length and maintaining KV-cache compatibility, achieving high eff…

Python 97 4 Updated Dec 27, 2025

OpenDataArena / OpenDataArena-Tool

Tools for OpenDataArena: Fair, Open, and Transparent Arena for Data

Python 140 13 Updated Mar 15, 2026

Banner-Z / AvR

The official repository of paper Unlocking Recursive Thinking of LLMs: Alignment via Refinement

Python 2 1 Updated Jul 27, 2025

wwfnb / Laser

Python 14 Updated Sep 16, 2025

dropreg / Fennec

Fine-grained Language Model Evaluation and Correction via Branching and Bridging

Python 7 Updated May 7, 2024

ML-GSAI / LLaDA

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,728 258 Updated Nov 12, 2025

xlang-ai / aguvis

[ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Python 387 28 Updated Mar 7, 2025

Spico197 / paper-hero

💪 A toolkit to help search for papers from aclanthology, arXiv and dblp.

Python 43 4 Updated Mar 4, 2023

datawhalechina / easy-rl

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 14,025 2,247 Updated Dec 30, 2025

XanderJC / attention-based-credit

Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt, and Mihaela van der Schaar

Python 38 2 Updated Aug 11, 2024

srush / awesome-o1

A bibliography and survey of the papers surrounding o1

TeX 1,213 51 Updated Nov 16, 2024

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 90,960 13,974 Updated Apr 16, 2026

algorithmicsuperintelligence / optillm

Optimizing inference proxy for LLMs

Python 3,434 267 Updated Mar 19, 2026

opendilab / LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,569 189 Updated Apr 5, 2026

appl-team / appl

🍎APPL: A Prompt Programming Language. Seamlessly integrate LLMs with programs.

Python 264 7 Updated Feb 20, 2025

dropreg / OpenLTG-MLM

Open-ended Long Text Generation via Masked Language Modeling

Python 7 2 Updated Mar 19, 2024

jordddan / Pruning-LLMs

The framework to prune LLMs to any size and any config.

Python 96 3 Updated Mar 1, 2024

OpenNLG / OpenBA

Python 95 7 Updated Oct 8, 2023

PhoebusSi / Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,798 251 Updated Dec 12, 2023

tatsu-lab / stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,262 3,999 Updated Jul 17, 2024

meta-llama / llama

Inference code for Llama models

Python 59,341 9,830 Updated Jan 26, 2025

chenfei-wu / TaskMatrix

Python 34,159 3,238 Updated Jan 6, 2024

mourga / awd-lstm-lm

Forked from salesforce/awd-lstm-lm

LSTM and QRNN Language Model Toolkit for PyTorch 1.2.0!

Python 20 2 Updated Mar 2, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lxb dropreg

Achievements

Achievements

Block or report dropreg

Stars

knightnemo / Awesome-World-Models

LCM-Lab / LongRM

LCM-Lab / context-denoising-training

OpenGVLab / SDLM

OpenDataArena / OpenDataArena-Tool

Banner-Z / AvR

wwfnb / Laser

dropreg / Fennec

ML-GSAI / LLaDA

xlang-ai / aguvis

Spico197 / paper-hero

datawhalechina / easy-rl

XanderJC / attention-based-credit

srush / awesome-o1

rasbt / LLMs-from-scratch

algorithmicsuperintelligence / optillm

opendilab / LightZero

appl-team / appl

dropreg / OpenLTG-MLM

jordddan / Pruning-LLMs

OpenNLG / OpenBA

PhoebusSi / Alpaca-CoT

tatsu-lab / stanford_alpaca

meta-llama / llama

chenfei-wu / TaskMatrix

mourga / awd-lstm-lm