A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward models and learning strategies across training, inference, and po…

71 3 Updated Jun 13, 2025

LHRLAB / Graph-R1

[ICML 2026] Official resources of "Graph-R1: Towards Agentic GraphRAG Framework via End-to-end Reinforcement Learning".

Python 566 73 Updated Apr 30, 2026

LHRLAB / HyperGraphRAG

[NeurIPS 2025] Official resources of "HyperGraphRAG: Retrieval-Augmented Generation via Hypergraph-Structured Knowledge Representation".

Python 403 63 Updated May 12, 2026

LHRLAB / KBQA-o1

[ICML 2025] Official resources of "KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search".

Python 37 5 Updated Dec 6, 2025

PetrKorab / Topic-Modelling-in-Business-Intelligence-BERTopic-and-FASTopic-in-Code

Comparison of two cutting-edge dynamic topic models solving consumer complaints classification exercise

Jupyter Notebook 8 1 Updated Apr 17, 2025

bobxwu / AntiLeakBench

Python 5 Updated Jan 25, 2026

Elfsong / GDM

Code for "Can Group Decision-Making Mitigates Social Bias in Large Language Models?"

Jupyter Notebook 4 Updated Dec 4, 2024

YuxiXie / SG-Deep-Question-Generation

This repository contains code and models for the paper: Semantic Graphs for Generating Deep Questions (ACL 2020).

Python 75 32 Updated Jul 16, 2022

YuxiXie / MCTS-DPO

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Jupyter Notebook 331 38 Updated Jan 29, 2026

YuxiXie / SelfEval-Guided-Decoding

Python 103 7 Updated Dec 7, 2023

YuxiXie / COrAL

This repository contains the source code for COrAL, an Order-Agnostic Language Modeling framework for Efficient Iterative Refinement.

Python 8 Updated Apr 2, 2025

panFJCharlotte98 / Fallacy_Detection

Official Implementation of EMNLP 2024 Are LLMs Good Zero-Shot Fallacy Classifiers?

Jupyter Notebook 5 1 Updated Jan 25, 2025

zhiyuanhubj / Long_form_VideoQA

[EMNLP’24 Main] Encoding and Controlling Global Semantics for Long-form Video Question Answering

Python 18 Updated Oct 9, 2024

nguyentthong / video-language-understanding

[ACL’24 Findings] Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives

48 Updated May 12, 2026

bobxwu / AKEW

EMNLP2024 - AKEW: Assessing Knowledge Editing in the Wild

4 Updated Feb 12, 2025

nelson-liu / lost-in-the-middle

Code and data for "Lost in the Middle: How Language Models Use Long Contexts"

Python 383 41 Updated Jan 4, 2024

hamedR96 / ANTM

Aligned Neural Topic Model (ANTM) for Exploring Evolving Topics: a dynamic neural topic model that uses document embeddings (data2vec) to compute clusters of semantically similar documents at diffe…

Jupyter Notebook 37 9 Updated Nov 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xiaobao Wu bobxwu

Achievements

Achievements

Highlights

Block or report bobxwu

Stars

walkinglabs / hands-on-modern-rl

ponhvoan / TruthAnchor

PurCL / muke

SkyRiver-2000 / Epistemic-Context-Learning

AheadOFpotato / Awesome-LRM-Mechanisms

ponhvoan / iris

bobxwu / Paper-Neural-Topic-Models

SkyRiver-2000 / RuleArena

nguyentthong / MAMA

shuaizhao95 / w2sdefense

Anna7355 / SCOPE

panFJCharlotte98 / HMC

bobxwu / FASTopic

bobxwu / learning-from-rewards-llm-papers