Skip to content
View jxzhangjhu's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Mountain View
  • 16:42 (UTC -08:00)

Block or report jxzhangjhu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 10 1 Updated Jul 7, 2025

Salesforce Enterprise Deep Research

Python 1,007 161 Updated Nov 19, 2025

AgentEvolver: Towards Efficient Self-Evolving Agent System

Python 883 105 Updated Dec 18, 2025

This is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXiv:2511.03773).

Python 30 2 Updated Nov 9, 2025

🛠️ DeepAgent: A General Reasoning Agent with Scalable Toolsets

Python 869 111 Updated Nov 2, 2025
Python 21 1 Updated Dec 14, 2024
Python 403 54 Updated Dec 18, 2025

Learning on the Job: An Experience-Driven, Self-Evolving Agent for Long-Horizon Tasks

Python 29 9 Updated Oct 16, 2025

A live stream development of RL tunning for LLM agents

Python 3,681 515 Updated Oct 8, 2025
Python 61 10 Updated Oct 1, 2025

Self-Reflection in LLM Agents: Effects on Problem-Solving Performance

Python 92 10 Updated Nov 25, 2024

ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.

Scala 321 33 Updated Dec 3, 2025

PyTorch-native post-training at scale

Python 569 71 Updated Dec 18, 2025

get things from one computer to another, safely

Python 22,094 716 Updated Dec 16, 2025

Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.

Python 529 57 Updated Sep 11, 2025

Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework

Python 155 8 Updated Dec 16, 2025

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,293 113 Updated Dec 11, 2025

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Python 3,007 220 Updated Nov 17, 2025

The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"

Python 36 1 Updated Oct 1, 2025

KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality

Python 38 5 Updated Dec 1, 2025

Toolkit for evaluating the trustworthiness of generative foundation models.

Python 123 10 Updated Aug 22, 2025
Python 5 1 Updated Sep 29, 2025

Official repository for Beyond Binary Rewards: Training LMs to Reason about Their Uncertainty

Python 46 6 Updated Aug 20, 2025

The official repository of SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

Python 148 11 Updated Oct 14, 2025

Demystifying Reinforcement Learning in Agentic Reasoning

Python 131 22 Updated Oct 14, 2025
Python 22 2 Updated Oct 30, 2025

Post-training with Tinker

Python 2,572 243 Updated Dec 19, 2025

[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

710 34 Updated Oct 20, 2025
Python 46 6 Updated Oct 2, 2025
Next