Skip to content
View bobxwu's full-sized avatar
💭
打怪升级
💭
打怪升级

Highlights

  • Pro

Block or report bobxwu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🚀 An open-source, hands-on curriculum bridging the gap from basic RL concepts to LLM alignment, RLVR, and advanced Agentic systems.

Python 2,960 188 Updated Jun 18, 2026
Python 1 Updated Mar 23, 2026

[COLM 2025] Official implementation of μKE - edit LLM knowledge while preserving memory dependencies via Matryoshka-style objectives.

Python 14 Updated Aug 20, 2025

Epistemic Context Learning: Building Trust the Right Way in LLM-Based Multi-Agent Systems

Python 8 Updated Jan 30, 2026

Towards a Mechanistic Understanding of Large Reasoning Models: A Survey of Training, Inference, and Failures

33 4 Updated Jan 29, 2026
Python 3 Updated Mar 19, 2026

Papers of Neural Topic Models (NTMs)

96 9 Updated Jul 25, 2024

[ACL 2025] RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios

Python 27 Updated Jul 2, 2025

[ECCV’24 Main] MAMA: A Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning

Python 9 Updated Oct 16, 2024

Unlearning backdoor

Python 8 2 Updated May 18, 2025
Python 2 Updated May 20, 2025

Detecting Harmful Memes with Decoupled Understanding and Guided CoT Reasoning

Jupyter Notebook 4 Updated Jul 4, 2025

A Fast, Adaptive, Stable, and Transferable Topic Model (NeurIPS 2024)

Python 162 13 Updated Jul 29, 2025

A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward models and learning strategies across training, inference, and po…

71 3 Updated Jun 13, 2025

[ICML 2026] Official resources of "Graph-R1: Towards Agentic GraphRAG Framework via End-to-end Reinforcement Learning".

Python 566 73 Updated Apr 30, 2026

[NeurIPS 2025] Official resources of "HyperGraphRAG: Retrieval-Augmented Generation via Hypergraph-Structured Knowledge Representation".

Python 403 63 Updated May 12, 2026

[ICML 2025] Official resources of "KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search".

Python 37 5 Updated Dec 6, 2025

Comparison of two cutting-edge dynamic topic models solving consumer complaints classification exercise

Jupyter Notebook 8 1 Updated Apr 17, 2025
Python 5 Updated Jan 25, 2026

Code for "Can Group Decision-Making Mitigates Social Bias in Large Language Models?"

Jupyter Notebook 4 Updated Dec 4, 2024

This repository contains code and models for the paper: Semantic Graphs for Generating Deep Questions (ACL 2020).

Python 75 32 Updated Jul 16, 2022

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Jupyter Notebook 331 38 Updated Jan 29, 2026

This repository contains the source code for COrAL, an Order-Agnostic Language Modeling framework for Efficient Iterative Refinement.

Python 8 Updated Apr 2, 2025

Official Implementation of EMNLP 2024 Are LLMs Good Zero-Shot Fallacy Classifiers?

Jupyter Notebook 5 1 Updated Jan 25, 2025

[EMNLP’24 Main] Encoding and Controlling Global Semantics for Long-form Video Question Answering

Python 18 Updated Oct 9, 2024

[ACL’24 Findings] Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives

48 Updated May 12, 2026

EMNLP2024 - AKEW: Assessing Knowledge Editing in the Wild

4 Updated Feb 12, 2025

Code and data for "Lost in the Middle: How Language Models Use Long Contexts"

Python 383 41 Updated Jan 4, 2024

Aligned Neural Topic Model (ANTM) for Exploring Evolving Topics: a dynamic neural topic model that uses document embeddings (data2vec) to compute clusters of semantically similar documents at diffe…

Jupyter Notebook 37 9 Updated Nov 6, 2023
Next