Skip to content
View jiarui-liu's full-sized avatar

Organizations

@UM-temp

Block or report jiarui-liu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A framework bridging cognitive science and LLM reasoning research to diagnose and improve how large language models reason, based on analysis of 192K model traces and 54 human think-aloud traces.

Python 23 6 Updated Nov 26, 2025

This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.

236 4 Updated Dec 17, 2025

Training Large Language Model to Reason in a Continuous Latent Space

Python 1,401 154 Updated Aug 12, 2025

A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..

3 Updated Jun 5, 2025

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 2,081 122 Updated Jun 1, 2023

📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.

313 6 Updated Nov 5, 2025

Code for the paper: "Learning to Reason without External Rewards"

Python 383 41 Updated Jul 10, 2025
Python 402 54 Updated Dec 15, 2025

L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning

Jupyter Notebook 258 29 Updated May 14, 2025

Go ahead and axolotl questions

Python 10,953 1,220 Updated Dec 17, 2025

Code for BLT research paper

Python 2,018 187 Updated Nov 3, 2025

Environments for LLM Reinforcement Learning

Python 3,636 453 Updated Dec 17, 2025

LogicBench is a natural language question-answering dataset consisting of 25 different reasoning patterns spanning over propositional, first-order, and non-monotonic logics.

33 3 Updated May 2, 2024

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,277 105 Updated Dec 15, 2025

Official Repo for Open-Reasoner-Zero

Python 2,079 119 Updated Jun 2, 2025

Simple RL training for reasoning

Python 3,808 281 Updated Aug 3, 2025

✨✨Latest Advances on Multimodal Large Language Models

17,007 1,093 Updated Dec 12, 2025

A comprehensive evaluation dataset encompassing multi-step logical reasoning with various inference rules and depths

6 1 Updated Jul 10, 2024

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,660 308 Updated Nov 13, 2025

Repository for the paper "Seemingly Plausible Distractors in Multi-Hop Reasoning: Are Large Language Models Attentive Readers?"

Python 6 Updated Dec 1, 2024

[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"

150 7 Updated Sep 21, 2024

Taming Stable Diffusion for Lip Sync!

Python 5,259 843 Updated Jun 20, 2025

A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"

Python 411 60 Updated Apr 13, 2025
Python 240 17 Updated Feb 22, 2024

Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

Python 562 48 Updated Jan 28, 2025

Kim, J., Evans, J., & Schein, A. (2025). Linear Representations of Political Perspective Emerge in Large Language Models. ICLR.

Jupyter Notebook 23 5 Updated Mar 27, 2025

A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。

714 37 Updated Apr 7, 2024
Next