Skip to content
View john1226966735's full-sized avatar

Highlights

  • Pro

Block or report john1226966735

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A collection of graph foundation models including papers, codes, and datasets.

151 14 Updated Jul 10, 2025

Must-read papers on graph foundation models (GFMs)

355 29 Updated Aug 24, 2025

Metacognitive Prompting Improves Understanding in Large Language Models (NAACL 2024)

40 8 Updated Nov 8, 2023

NAACL2025 - Decomposition Dilemmas: Does Claim Decomposition Boost or Burden Fact-Checking Performance?

Python 7 Updated Sep 9, 2025

This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"

Python 72 3 Updated Apr 22, 2025

Official repository for the paper "Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search" [ICML'25]

Python 6 1 Updated Dec 21, 2025
Python 12 Updated Sep 27, 2024

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python 11,287 1,182 Updated Dec 20, 2025

Skill-Targeted Adaptive Training

7 2 Updated Oct 21, 2025

[ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet

Python 227 26 Updated Nov 13, 2025

Reinforced Multi-LLM Agents training

Python 61 4 Updated Jun 9, 2025
Python 14 2 Updated May 23, 2025

Deep Research

Python 303 11 Updated Aug 26, 2025

Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning

Python 53 2 Updated Oct 24, 2025
Jupyter Notebook 5 Updated Jun 13, 2025

Fully open reproduction of DeepSeek-R1

Python 25,742 2,405 Updated Nov 24, 2025

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,179 120 Updated Nov 9, 2025

[EMNLP 2025] Awesome RAG Reasoning Resources

366 29 Updated Jul 24, 2025

DeepConf: Deep Think with Confidence

Python 334 50 Updated Sep 18, 2025

Agentic-RAG explores advanced Retrieval-Augmented Generation systems enhanced with AI LLM agents.

1,345 161 Updated Oct 20, 2025

[arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents

Python 46 5 Updated Jul 1, 2025
Python 250 19 Updated Aug 12, 2025
Python 402 30 Updated Oct 16, 2025

[NeurIPS'25] Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement Learning

Python 14 Updated Dec 12, 2025

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,279 106 Updated Dec 15, 2025
Next