jxzhangjhu

🎯

Focusing

Jiaxin Zhang jxzhangjhu

🎯

Focusing

AI Researcher

156 followers · 95 following

Mountain View
06:56 (UTC -07:00)

Achievements

Starred repositories

HKUDS / nanobot

"🐈 nanobot: The Ultra-Lightweight OpenClaw"

Python 35,923 6,127 Updated Mar 24, 2026

OpenRaiser / NanoResearch

🦞+🔬: NanoResearch: The Autonomous AI Research Assistant

Python 233 19 Updated Mar 24, 2026

tongjingqi / AI-Can-Learn-Scientific-Taste

We propose Reinforcement Learning from Community Feedback (RLCF), a training paradigm that uses large-scale community signals as supervision, and formulate scientific taste learning as a preference…

353 10 Updated Mar 23, 2026

Gen-Verse / OpenClaw-RL

OpenClaw-RL: Train any agent simply by talking

Python 4,129 405 Updated Mar 23, 2026

sevn-ai / agentic-uncertainty

Can AI agents predict whether they will succeed at a task?

Python 6 1 Updated Feb 9, 2026

NVIDIA-NeMo / Automodel

Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

Python 386 101 Updated Mar 24, 2026

huggingface / trl

Train transformer language models with reinforcement learning.

Python 17,772 2,586 Updated Mar 24, 2026

open-tinker / OpenTinker

OpenTinker is an RL-as-a-Service infrastructure for foundation models

Python 650 61 Updated Mar 21, 2026

lasgroup / SDPO

Reinforcement Learning via Self-Distillation (SDPO)

Python 681 64 Updated Feb 18, 2026

WooooDyy / AgentGym

Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.

Python 749 107 Updated Sep 11, 2025

blacksnail789521 / Agentic-RL-Training-Recipes

Training Recipes for Agentic Reinforcement Learning in LLMs: A Survey

21 Updated Jan 30, 2026

datawhalechina / hello-agents

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python 30,090 3,423 Updated Mar 24, 2026

LAMDASZ-ML / Self-Backtracking

Python 52 7 Updated Feb 12, 2025

MIT-MI / MEM1

Python 289 19 Updated Jan 3, 2026

ByteDance-Seed / Agent-R

Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"

Python 170 20 Updated Oct 20, 2025

TeleAI-UAGI / Awesome-Agent-Memory

Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.

301 13 Updated Mar 23, 2026

MiroMindAI / MiroThinker

MiroThinker is a deep research agent optimized for complex research and prediction tasks. Our latest models, MiroThinker-1.7 and MiroThinker-H1, achieve 74.0 and 88.2 on the BrowseComp, respectively.

Python 8,061 585 Updated Mar 24, 2026

OpenDCAI / Paper2Any

Turn paper/text/topic into editable research figures, technical route diagrams, and presentation slides.

Python 1,994 140 Updated Mar 22, 2026

UniPat-AI / BabyVision

We introduce BabyVision, a benchmark revealing the infancy of AI vision.

Python 201 7 Updated Jan 13, 2026

PLUM-Lab / R2I-Bench

Python 12 1 Updated Mar 14, 2026

SalesforceAIResearch / enterprise-deep-research

Salesforce Enterprise Deep Research

Python 1,146 178 Updated Jan 30, 2026

modelscope / AgentEvolver

AgentEvolver: Towards Efficient Self-Evolving Agent System

Python 1,302 148 Updated Jan 30, 2026

Pi3AI / DreamGym

This is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXiv:2511.03773).

Python 39 3 Updated Nov 9, 2025