jxzhangjhu

🎯

Focusing

Jiaxin Zhang jxzhangjhu

🎯

Focusing

AI Researcher

158 followers · 95 following

Mountain View
22:11 (UTC -07:00)

Achievements

Awesome-LLM-RAG Public

Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models

embeddings rag retrieval-information large-language-models llm retrieval-augmented-generation rag-embeddings

1,320 78 Updated Apr 6, 2026
Awesome-LLM-Uncertainty-Reliability-Robustness Public

Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models

reliability calibration safety awesome-list uncertainty-quantification uncertainty-estimation robustness

816 54 MIT License Updated Apr 5, 2026
claude-code-source-code Public
Forked from sanbuphy/learn-coding-agent

Claude Code v2.1.88 Source Code

TypeScript Updated Mar 31, 2026
nanobot Public
Forked from HKUDS/nanobot

"🐈 nanobot: The Ultra-Lightweight OpenClaw"

Python MIT License Updated Mar 22, 2026
AI-Can-Learn-Scientific-Taste Public
Forked from tongjingqi/AI-Can-Learn-Scientific-Taste

We propose Reinforcement Learning from Community Feedback (RLCF), a training paradigm that uses large-scale community signals as supervision, and formulate scientific taste learning as a preference…

Apache License 2.0 Updated Mar 22, 2026
NanoResearch Public
Forked from OpenRaiser/NanoResearch

🦞+🔬: NanoResearch: The Autonomous AI Research Assistant

Python MIT License Updated Mar 21, 2026
measuring-execution Public
Forked from long-horizon-execution/measuring-execution

Python Updated Mar 18, 2026
OpenClaw-RL Public
Forked from Gen-Verse/OpenClaw-RL

OpenClaw-RL: Train any agent simply by talking

TypeScript MIT License Updated Mar 15, 2026
jxzhangjhu.github.io Public

HTML 1 MIT License Updated Feb 22, 2026
Automodel Public
Forked from NVIDIA-NeMo/Automodel

Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

Python Apache License 2.0 Updated Feb 12, 2026
trl Public
Forked from huggingface/trl

Train transformer language models with reinforcement learning.

Python Apache License 2.0 Updated Feb 11, 2026
agentic-uncertainty Public
Forked from sevn-ai/agentic-uncertainty

Python Updated Feb 9, 2026
OpenTinker Public
Forked from open-tinker/OpenTinker

OpenTinker is an RL-as-a-Service infrastructure for foundation models

Python Apache License 2.0 Updated Feb 8, 2026
SDPO Public
Forked from lasgroup/SDPO

Reinforcement Learning via Self-Distillation (SDPO)

Python Apache License 2.0 Updated Feb 5, 2026
hello-agents Public
Forked from datawhalechina/hello-agents

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python Other Updated Feb 2, 2026
Paper2Any Public
Forked from OpenDCAI/Paper2Any

Turn paper/text/topic into editable research figures, technical route diagrams, and presentation slides.

Python Apache License 2.0 Updated Jan 20, 2026
Awesome-Agent-Memory Public
Forked from TeleAI-UAGI/Awesome-Agent-Memory

Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.

Apache License 2.0 Updated Jan 18, 2026
MiroThinker Public
Forked from MiroMindAI/MiroThinker

MiroThinker is an open source deep research agent optimized for research and prediction. It achieves a 60.2% Avg@8 score on the challenging GAIA benchmark.

Python MIT License Updated Jan 16, 2026
BabyVision Public
Forked from UniPat-AI/BabyVision

We introduce BabyVision, a benchmark revealing the infancy of AI vision.

Python Updated Jan 13, 2026
MEM1 Public
Forked from MIT-MI/MEM1

Python MIT License Updated Jan 3, 2026
AgentEvolver Public
Forked from modelscope/AgentEvolver

AgentEvolver: Towards Efficient Self-Evolving Agent System

Python Apache License 2.0 Updated Nov 21, 2025
enterprise-deep-research Public
Forked from SalesforceAIResearch/enterprise-deep-research

Salesforce Enterprise Deep Research

Python 1 Apache License 2.0 Updated Nov 19, 2025
lm-polygraph Public
Forked from IINemo/lm-polygraph

Python MIT License Updated Nov 16, 2025
DreamGym Public
Forked from Pi3AI/DreamGym

This is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXiv:2511.03773).

Python Updated Nov 9, 2025
DeepAgent Public
Forked from RUC-NLPIR/DeepAgent

🛠️ DeepAgent: A General Reasoning Agent with Scalable Toolsets

Python MIT License Updated Nov 2, 2025
torchforge Public
Forked from meta-pytorch/torchforge

PyTorch-native post-training at scale

Python BSD 3-Clause "New" or "Revised" License Updated Oct 28, 2025
magic-wormhole Public
Forked from magic-wormhole/magic-wormhole

get things from one computer to another, safely

Python MIT License Updated Oct 23, 2025
AgentRL Public
Forked from THUDM/AgentRL

Python MIT License Updated Oct 23, 2025
Agent-R Public
Forked from ByteDance-Seed/Agent-R

Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"

Python Apache License 2.0 Updated Oct 20, 2025
verl-agent Public
Forked from langfengQ/verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python Apache License 2.0 Updated Oct 20, 2025

Jiaxin Zhang jxzhangjhu

Achievements

Achievements

Awesome-LLM-RAG Public

Uh oh!

Awesome-LLM-Uncertainty-Reliability-Robustness Public

Uh oh!

claude-code-source-code Public

Uh oh!

nanobot Public

Uh oh!

AI-Can-Learn-Scientific-Taste Public

Uh oh!

NanoResearch Public

Uh oh!

measuring-execution Public

Uh oh!

OpenClaw-RL Public

Uh oh!

jxzhangjhu.github.io Public

Uh oh!

Automodel Public

Uh oh!

trl Public

Uh oh!

agentic-uncertainty Public

Uh oh!

OpenTinker Public

Uh oh!

SDPO Public

Uh oh!

hello-agents Public

Uh oh!

Paper2Any Public

Uh oh!

Awesome-Agent-Memory Public

Uh oh!

MiroThinker Public

Uh oh!

BabyVision Public

Uh oh!

MEM1 Public

Uh oh!

AgentEvolver Public

Uh oh!

enterprise-deep-research Public

Uh oh!

lm-polygraph Public

Uh oh!

DreamGym Public

Uh oh!

DeepAgent Public

Uh oh!

torchforge Public

Uh oh!

magic-wormhole Public

Uh oh!

AgentRL Public

Uh oh!

Agent-R Public

Uh oh!

verl-agent Public

Uh oh!