jxzhangjhu

🎯

Focusing

Jiaxin Zhang jxzhangjhu

🎯

Focusing

AI Researcher

153 followers · 93 following

Mountain View
10:44 (UTC -08:00)

Achievements

jxzhangjhu.github.io Public

HTML 1 MIT License Updated Dec 11, 2025
AgentEvolver Public
Forked from modelscope/AgentEvolver

AgentEvolver: Towards Efficient Self-Evolving Agent System

Python Apache License 2.0 Updated Nov 21, 2025
enterprise-deep-research Public
Forked from SalesforceAIResearch/enterprise-deep-research

Salesforce Enterprise Deep Research

Python 1 Apache License 2.0 Updated Nov 19, 2025
lm-polygraph Public
Forked from IINemo/lm-polygraph

Python MIT License Updated Nov 16, 2025
DreamGym Public
Forked from Pi3AI/DreamGym

This is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXiv:2511.03773).

Python Updated Nov 9, 2025
DeepAgent Public
Forked from RUC-NLPIR/DeepAgent

🛠️ DeepAgent: A General Reasoning Agent with Scalable Toolsets

Python MIT License Updated Nov 2, 2025
torchforge Public
Forked from meta-pytorch/torchforge

PyTorch-native post-training at scale

Python BSD 3-Clause "New" or "Revised" License Updated Oct 28, 2025
magic-wormhole Public
Forked from magic-wormhole/magic-wormhole

get things from one computer to another, safely

Python MIT License Updated Oct 23, 2025
AgentRL Public
Forked from THUDM/AgentRL

Python MIT License Updated Oct 23, 2025
verl-agent Public
Forked from langfengQ/verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python Apache License 2.0 Updated Oct 20, 2025
MUSE Public
Forked from KnowledgeXLab/MUSE

Learning on the Job: An Experience-Driven, Self-Evolving Agent for Long-Horizon Tasks

Python MIT License Updated Oct 16, 2025
AgentBench Public
Forked from THUDM/AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Python Apache License 2.0 Updated Oct 14, 2025
SEED-GRPO Public
Forked from Dreamer312/SEED-GRPO

The official repository of SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

Python Apache License 2.0 Updated Oct 14, 2025
Open-AgentRL Public
Forked from Gen-Verse/Open-AgentRL

Demystifying Reinforcement Learning in Agentic Reasoning

Python Apache License 2.0 Updated Oct 14, 2025
KnowRL Public
Forked from zjunlp/KnowRL

KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality

Python MIT License Updated Oct 10, 2025
OpenManus-RL Public
Forked from OpenManus/OpenManus-RL

A live stream development of RL tunning for LLM agents

Python Apache License 2.0 Updated Oct 8, 2025
tinker-cookbook Public
Forked from thinking-machines-lab/tinker-cookbook

Post-training with Tinker

Python Apache License 2.0 Updated Oct 5, 2025
SpecBench Public
Forked from zzzhr97/SpecBench

Python Apache License 2.0 Updated Oct 5, 2025
AgentDebug Public
Forked from ulab-uiuc/AgentDebug

Python Updated Oct 1, 2025
EPO Public
Forked from WujiangXu/EPO

The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"

Python Apache License 2.0 Updated Oct 1, 2025
CapBound Public
Forked from qingjiesjtu/CapBound

Python Updated Sep 29, 2025
AgentGym-RL Public
Forked from WooooDyy/AgentGym-RL

Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.

Python MIT License Updated Sep 11, 2025
TrustEval-toolkit Public
Forked from TrustGen/TrustEval-toolkit

Toolkit for evaluating the trustworthiness of generative foundation models.

Python Other Updated Aug 22, 2025
RLCR Public
Forked from damanimehul/RLCR

Official repository for Beyond Binary Rewards: Training LMs to Reason about Their Uncertainty

Python MIT License Updated Aug 20, 2025
Awesome-Efficient-Reasoning-LLMs Public
Forked from Eclipsess/Awesome-Efficient-Reasoning-LLMs

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

Updated Aug 11, 2025
MiroFlow Public
Forked from MiroMindAI/MiroFlow

Miroflow is an agent framework that simplifies the development of complex, multi-agent systems. Build, manage, and scale your AI agents with ease.

Python Apache License 2.0 Updated Aug 8, 2025
AWorld Public
Forked from inclusionAI/AWorld

Build, evaluate and train General Multi-Agent Assistance with ease

Python MIT License Updated Aug 6, 2025
deep_research_bench Public
Forked from Ayanami0730/deep_research_bench

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Python Apache License 2.0 Updated Aug 3, 2025
Influences-on-LLM-Calibration Public
Forked from Yuuxii/Influences-on-LLM-Calibration

Python MIT License Updated Jul 29, 2025
MUR Public
Forked from yayayacc/MUR

Python Updated Jul 25, 2025

Jiaxin Zhang jxzhangjhu

Achievements

Achievements

jxzhangjhu.github.io Public

Uh oh!

AgentEvolver Public

Uh oh!

enterprise-deep-research Public

Uh oh!

lm-polygraph Public

Uh oh!

DreamGym Public

Uh oh!

DeepAgent Public

Uh oh!

torchforge Public

Uh oh!

magic-wormhole Public

Uh oh!

AgentRL Public

Uh oh!

verl-agent Public

Uh oh!

MUSE Public

Uh oh!

AgentBench Public

Uh oh!

SEED-GRPO Public

Uh oh!

Open-AgentRL Public

Uh oh!

KnowRL Public

Uh oh!

OpenManus-RL Public

Uh oh!

tinker-cookbook Public

Uh oh!

SpecBench Public

Uh oh!

AgentDebug Public

Uh oh!

EPO Public

Uh oh!

CapBound Public

Uh oh!

AgentGym-RL Public

Uh oh!

TrustEval-toolkit Public

Uh oh!

RLCR Public

Uh oh!

Awesome-Efficient-Reasoning-LLMs Public

Uh oh!

MiroFlow Public

Uh oh!

AWorld Public

Uh oh!

deep_research_bench Public

Uh oh!

Influences-on-LLM-Calibration Public

Uh oh!

MUR Public

Uh oh!