Stars
A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.
Revealing and unlocking the context boundary of reward models
context denoising training for long-context modeling
Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation length and maintaining KV-cache compatibility, achieving high eff…
Tools for OpenDataArena: Fair, Open, and Transparent Arena for Data
The official repository of paper Unlocking Recursive Thinking of LLMs: Alignment via Refinement
Fine-grained Language Model Evaluation and Correction via Branching and Bridging
Official PyTorch implementation for "Large Language Diffusion Models"
[ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
💪 A toolkit to help search for papers from aclanthology, arXiv and dblp.
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt, and Mihaela van der Schaar
A bibliography and survey of the papers surrounding o1
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Optimizing inference proxy for LLMs
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
🍎APPL: A Prompt Programming Language. Seamlessly integrate LLMs with programs.
Open-ended Long Text Generation via Masked Language Modeling
The framework to prune LLMs to any size and any config.
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…
Code and documentation to train Stanford's Alpaca models, and generate the data.
mourga / awd-lstm-lm
Forked from salesforce/awd-lstm-lmLSTM and QRNN Language Model Toolkit for PyTorch 1.2.0!