john1226966735

JohnZhou john1226966735

10 followers · 69 following

https://john1226966735.github.io/

Highlights

Starred repositories

Zehong-Wang / Awesome-Foundation-Models-on-Graphs

A collection of graph foundation models including papers, codes, and datasets.

151 14 Updated Jul 10, 2025

BUPT-GAMMA / GFMPapers

Must-read papers on graph foundation models (GFMs)

355 29 Updated Aug 24, 2025

EternityYW / Metacognitive-Prompting

Metacognitive Prompting Improves Understanding in Large Language Models (NAACL 2024)

40 8 Updated Nov 8, 2023

qishenghu / Decomp_Dilemmas

NAACL2025 - Decomposition Dilemmas: Does Claim Decomposition Boost or Burden Fact-Checking Performance?

Python 7 Updated Sep 9, 2025

NineAbyss / S2R

This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"

Python 72 3 Updated Apr 22, 2025

Ahm-rgb / Alpha-SQL

Official repository for the paper "Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search" [ICML'25]

Python 6 1 Updated Dec 21, 2025

Aurora-slz / BEATS

Python 12 Updated Sep 27, 2024

datawhalechina / hello-agents

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python 11,287 1,182 Updated Dec 20, 2025

aakaran / reasoning-with-sampling

Python 358 47 Updated Nov 7, 2025

princeton-pli / STAT

Skill-Targeted Adaptive Training

7 2 Updated Oct 21, 2025

gouki510 / Topology_of_Reasoning

Python 34 2 Updated Jun 11, 2025

bingreeky / MaAS

[ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet

Python 227 26 Updated Nov 13, 2025

ziyuwan / ReMA-public

Reinforced Multi-LLM Agents training

Python 61 4 Updated Jun 9, 2025

hanqi-qi / LLM_MetaReasoning

9 1 Updated Jul 29, 2025

chicosirius / think-or-not

Python 14 2 Updated May 23, 2025

antgroup / Research-Venus

Deep Research

Python 303 11 Updated Aug 26, 2025

TIGER-AI-Lab / Hierarchical-Reasoner

Forked from HaozheH3/Hierarchical-Reasoner

Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning

Python 53 2 Updated Oct 24, 2025

yuelinan / Awesome-Efficient-R1-style-LRMs

45 Updated Aug 14, 2025

jianshuod / CogTest

Jupyter Notebook 5 Updated Jun 13, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,742 2,405 Updated Nov 24, 2025

TsinghuaC3I / Awesome-RL-for-LRMs

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,179 120 Updated Nov 9, 2025

DavidZWZ / Awesome-RAG-Reasoning

[EMNLP 2025] Awesome RAG Reasoning Resources

366 29 Updated Jul 24, 2025

facebookresearch / deepconf

DeepConf: Deep Think with Confidence

Python 334 50 Updated Sep 18, 2025

sail-sg / Video-Next-Event-Prediction

Python 19 1 Updated Aug 9, 2025

asinghcsu / AgenticRAG-Survey

Agentic-RAG explores advanced Retrieval-Augmented Generation systems enhanced with AI LLM agents.

1,345 161 Updated Oct 20, 2025

MozerWang / AMPO

[arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents

Python 46 5 Updated Jul 1, 2025

ReTool-RL / ReTool

Python 250 19 Updated Aug 12, 2025

qiancheng0 / ToolRL

Python 402 30 Updated Oct 16, 2025

sparkle-reasoning / sparkle

[NeurIPS'25] Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement Learning

Python 14 Updated Dec 12, 2025

open-thought / reasoning-gym

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,279 106 Updated Dec 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly