Skip to content
View jxzhangjhu's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Mountain View
  • 21:56 (UTC -07:00)

Block or report jxzhangjhu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
jxzhangjhu/README.md

Hi, I'm Jiaxin Zhang 👋

I am an AI Researcher, working on reliable long-horizon AI agents, agentic reinforcement learning, and calibrated post-training.

My research asks a simple question:

How can AI agents know what they don’t know, act under uncertainty, and improve from their own prediction–reality gaps?

I build methods, environments, and evaluation frameworks that turn uncertainty, confidence, and consistency into first-class training signals for reliable and self-improving AI systems.

Homepage · Google Scholar · LinkedIn · X/Twitter · Email

Research Focus

  • Agentic RL & Post-training
    Calibration-aware on-policy distillation, GRPO/RL training, self-evolving environments, synthetic feedback, and reward/evaluator design for long-horizon agents.
  • Alignment, Calibration & Honesty
    Uncertainty-aware supervision, confidence calibration, hallucination detection, factuality, scalable oversight, and reliable model behavior.
  • Long-horizon Agents & Evaluation
    Tool use, planning, trajectory-level evaluation, deep research agents, evidence grounding, failure attribution, and enterprise-scale agent benchmarks.

Selected Work

For the full list of publications, please see my Google Scholar or homepage.

Contact

I am interested in reliable AI agents, agentic RL, post-training, calibration, uncertainty, scalable evaluation, and self-improving AI systems. Feel free to reach out via email or visit my homepage.

Pinned Loading

  1. SURGroup/UQpy SURGroup/UQpy Public

    UQpy (Uncertainty Quantification with python) is a general purpose Python toolbox for modeling uncertainty in physical and mathematical systems.

    Python 359 98

  2. Awesome-LLM-Uncertainty-Reliability-Robustness Awesome-LLM-Uncertainty-Reliability-Robustness Public

    Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models

    823 59

  3. intuit/sac3 intuit/sac3 Public

    Official repo for SAC3: Reliable Hallucination Detection in Black-Box Language Models via Semantic-aware Cross-check Consistency

    Jupyter Notebook 39 8

  4. Awesome-LLM-RAG Awesome-LLM-RAG Public

    Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models

    1.3k 83

  5. Awesome-LLM-Prompt-Optimization Awesome-LLM-Prompt-Optimization Public

    Awesome-LLM-Prompt-Optimization: a curated list of advanced prompt optimization and tuning methods in Large Language Models

    411 22

  6. SalesforceAIResearch/CaOPD SalesforceAIResearch/CaOPD Public

    CaOPD: Calibration-Aware On-Policy Distillation

    Python 13 2