jxzhangjhu

🎯

Focusing

Jiaxin Zhang jxzhangjhu

🎯

Focusing

AI Researcher

158 followers · 95 following

Mountain View
17:11 (UTC -07:00)

Achievements

Starred repositories

long-horizon-execution / measuring-execution

Python 56 9 Updated Mar 18, 2026

HKUDS / nanobot

"🐈 nanobot: The Ultra-Lightweight Personal AI Agent"

Python 39,804 6,984 Updated Apr 16, 2026

OpenRaiser / NanoResearch

🦞+🔬: NanoResearch: The Autonomous AI Research Assistant

Python 697 138 Updated Apr 13, 2026

tongjingqi / AI-Can-Learn-Scientific-Taste

We propose Reinforcement Learning from Community Feedback (RLCF), a training paradigm that uses large-scale community signals as supervision, and formulate scientific taste learning as a preference…

391 10 Updated Mar 29, 2026

Gen-Verse / OpenClaw-RL

OpenClaw-RL: Train any agent simply by talking

Python 4,991 527 Updated Apr 16, 2026

sevn-ai / agentic-uncertainty

Can AI agents predict whether they will succeed at a task?

Python 7 1 Updated Feb 9, 2026

NVIDIA-NeMo / Automodel

🚀 Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

Python 441 129 Updated Apr 16, 2026

huggingface / trl

Train transformer language models with reinforcement learning.

Python 18,072 2,647 Updated Apr 16, 2026

open-tinker / OpenTinker

OpenTinker is an RL-as-a-Service infrastructure for foundation models

Python 660 63 Updated Mar 21, 2026

lasgroup / SDPO

Reinforcement Learning via Self-Distillation (SDPO)

Python 776 83 Updated Feb 18, 2026

WooooDyy / AgentGym

Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.

Python 766 109 Updated Sep 11, 2025

blacksnail789521 / Agentic-RL-Training-Recipes

Training Recipes for Agentic Reinforcement Learning in LLMs: A Survey

26 Updated Jan 30, 2026

datawhalechina / hello-agents

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python 37,550 4,451 Updated Apr 15, 2026

LAMDA-NeSy / Self-Backtracking

Python 53 7 Updated Feb 12, 2025

MIT-MI / MEM1

Python 303 20 Updated Jan 3, 2026

ByteDance-Seed / Agent-R

Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"

Python 171 20 Updated Oct 20, 2025

TeleAI-UAGI / Awesome-Agent-Memory

Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.

352 18 Updated Apr 16, 2026

MiroMindAI / MiroThinker

MiroThinker is a deep research agent optimized for complex research and prediction tasks. Our latest models, MiroThinker-1.7 and MiroThinker-H1, achieve 74.0 and 88.2 on the BrowseComp, respectively.

Python 8,123 607 Updated Apr 13, 2026

OpenDCAI / Paper2Any

Turn paper/text/topic into editable research figures, technical route diagrams, and presentation slides.

Python 2,151 149 Updated Apr 15, 2026

UniPat-AI / BabyVision

We introduce BabyVision, a benchmark revealing the infancy of AI vision.

Python 211 7 Updated Jan 13, 2026

PLUM-Lab / R2I-Bench

Python 13 1 Updated Mar 14, 2026

SalesforceAIResearch / enterprise-deep-research

Salesforce Enterprise Deep Research

Python 1,153 180 Updated Jan 30, 2026

modelscope / AgentEvolver

AgentEvolver: Towards Efficient Self-Evolving Agent System

Python 1,392 160 Updated Apr 1, 2026

Pi3AI / DreamGym

This is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXiv:2511.03773).

Python 39 3 Updated Nov 9, 2025