jxhe

Junxian He jxhe

Assistant Professor@HKUST. PhD@CMU LTI. Working on NLP/ML.

536 followers · 27 following

The Hong Kong University of Science and Technology
jxhe.github.io
@junxian_he

Achievements

x3 x2

Achievements

x3 x2

Organizations

Stars

GAIR-NLP / daVinci-MagiHuman

Python 1,395 113 Updated Mar 30, 2026

shiqichen17 / SkillCraft

Python 52 2 Updated Mar 13, 2026

hkust-nlp / AgentVista

Benchmarking multimodal agents on realistic, ultra-challenging visual scenarios requiring long-horizon hybrid tool use.

Python 43 5 Updated Mar 10, 2026

hkust-nlp / LOCA-bench

Benchmarking Language Agents Under Controllable and Extreme Context Growth

Python 34 3 Updated Mar 30, 2026

hkust-nlp / KernelGYM

[KernelGYM & Dr. Kernel] A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations

Python 153 12 Updated Mar 29, 2026

affaan-m / everything-claude-code

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

JavaScript 124,500 16,578 Updated Mar 31, 2026

daytonaio / daytona

Daytona is a Secure and Elastic Infrastructure for Running AI-Generated Code

TypeScript 70,928 5,534 Updated Mar 31, 2026

harbor-framework / harbor

Harbor is a framework for running agent evaluations and creating and using RL environments.

Python 1,186 834 Updated Mar 31, 2026

supermemoryai / apple-mcp

Collection of apple-native tools for the model context protocol.

TypeScript 3,043 268 Updated Aug 11, 2025

MiniMax-AI / MiniMax-M2.1

MiniMax M2.1, a SOTA model for real-world dev & agents.

542 44 Updated Jan 28, 2026

anthropics / claude-cookbooks

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Jupyter Notebook 36,810 3,996 Updated Mar 31, 2026

thinking-machines-lab / tinker-cookbook

Post-training with Tinker

Python 3,004 363 Updated Mar 31, 2026

anthropics / claude-code

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 87,922 9,410 Updated Mar 31, 2026

deepseek-ai / DeepSeek-Math-V2

Python 1,569 141 Updated Dec 1, 2025

ComposioHQ / awesome-claude-skills

A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows

Python 49,764 5,157 Updated Feb 19, 2026

hkust-nlp / Toolathlon

[ICLR 2026] The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Python 298 30 Updated Mar 31, 2026

MiniMax-AI / MiniMax-M2

MiniMax-M2, a model built for Max coding & agentic workflows.

2,529 203 Updated Nov 13, 2025

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 22,773 2,095 Updated Jan 27, 2026

deep-symbolic-mathematics / llm-srbench

[ICML2025 Oral] LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models

Python 98 13 Updated Jul 31, 2025

hkust-nlp / deepsearch-tts

Pushing Test-Time Scaling Limits of Deep Search with Asymmetric Verification

Python 22 1 Updated Oct 8, 2025

MineDojo / MineDojo

Building Open-Ended Embodied Agents with Internet-Scale Knowledge

Java 2,176 195 Updated Mar 18, 2024

facebookresearch / meta-agents-research-environments

Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike static benchmarks, this platform introduces evolving environment…

Python 464 63 Updated Mar 26, 2026

axon-rl / gem

A Gym for Agentic LLMs

Python 472 31 Updated Jan 21, 2026

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 5,051 677 Updated Mar 29, 2026

deepseek-ai / DeepSeek-V3.2-Exp

Python 1,539 150 Updated Nov 18, 2025

anthropics / claude-agent-sdk-python

Python 5,993 805 Updated Mar 31, 2026

OpenHands / software-agent-sdk

A clean, modular SDK for building AI agents with OpenHands V1.

Python 615 194 Updated Mar 31, 2026

hkust-nlp / WebExplorer

The official repo of "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"

Python 113 2 Updated Sep 29, 2025

meituan-longcat / LongCat-Flash-Chat

1,321 67 Updated Mar 22, 2026

hkust-nlp / model-task-align-rl

[ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".

Python 17 Updated Feb 9, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Junxian He jxhe

Achievements

Achievements

Organizations

Block or report jxhe

Stars

GAIR-NLP / daVinci-MagiHuman

shiqichen17 / SkillCraft

hkust-nlp / AgentVista

hkust-nlp / LOCA-bench

hkust-nlp / KernelGYM

affaan-m / everything-claude-code

daytonaio / daytona

harbor-framework / harbor

supermemoryai / apple-mcp

MiniMax-AI / MiniMax-M2.1

anthropics / claude-cookbooks

thinking-machines-lab / tinker-cookbook

anthropics / claude-code

deepseek-ai / DeepSeek-Math-V2

ComposioHQ / awesome-claude-skills

hkust-nlp / Toolathlon

MiniMax-AI / MiniMax-M2

deepseek-ai / DeepSeek-OCR

deep-symbolic-mathematics / llm-srbench

hkust-nlp / deepsearch-tts

MineDojo / MineDojo

facebookresearch / meta-agents-research-environments

axon-rl / gem

THUDM / slime

deepseek-ai / DeepSeek-V3.2-Exp

anthropics / claude-agent-sdk-python

OpenHands / software-agent-sdk

hkust-nlp / WebExplorer

meituan-longcat / LongCat-Flash-Chat

hkust-nlp / model-task-align-rl