xiyanghu

Focusing

Xiyang (Sean) Hu xiyanghu

Focusing

PhD Student @ Carnegie Mellon University

81 followers · 49 following

Carnegie Mellon University
San Jose, CA
https://xiyanghu.github.io/

Achievements

Highlights

Organizations

Lists (12)

Sort

Stars

lszshu / SSDataBench

Python 4 1 Updated Dec 23, 2025

llm-eval-mental-health / CounselBench

Python 20 4 Updated May 2, 2026

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 16,358 3,962 Updated May 17, 2026

hqhq1025 / ai-course-notes

303 份 AI/LLM 中文讲义，支持在线阅读、PDF 下载和 LaTeX 源码查看 | Stanford CS336/CS224R/CS25 | Berkeley LLM Agents | Agent 工程实践

TeX 116 4 Updated May 17, 2026

yzhao062 / anywhere-agents

One config to rule all your AI agents: portable (every project, every session), effective (curated writing, routing, skills), and safer (destructive-command guard).

Python 173 19 Updated May 17, 2026

Rose-STL-Lab / ml-hilbert

Forked from apple/ml-hilbert

Python 39 7 Updated Oct 31, 2025

DaRL-GenAI / instructional_agents

Forked from Hyan-Yao/instructional_agents

(EACL'26 Main) Instructional Agents: Reducing Teaching Faculty Workload through Multi-Agent Instructional Design

Python 622 125 Updated Apr 18, 2026

stanford-iris-lab / meta-harness-tbench2-artifact

Meta-Harness: 76.4% on Terminal-Bench 2.0 (Claude Opus 4.6)

Python 1,036 149 Updated Mar 26, 2026

zlab-princeton / vero

Vero: An Open RL Recipe for General Visual Reasoning

Python 121 10 Updated Apr 19, 2026

Orchestra-Research / AI-Research-SKILLs

Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…

TeX 8,542 652 Updated Apr 28, 2026

t54-labs / AgenticRiskStandard

Agentic Risk Standard is a settlement-layer standard for trustworthy transactions with AI Agent

Python 31 2 Updated Mar 29, 2026

xlang-ai / AgentNetTool

This is the official code base of AgentNetTool in OpenCUA. Website: https://opencua.xlang.ai/

TypeScript 47 10 Updated Sep 3, 2025

zou-group / sleepfm-clinical

Python 693 139 Updated Mar 7, 2026

GeniusHTX / SWE-Skills-Bench

The official repo of our paper, "SWE-Skills-Bench:Do Agent Skills Actually Help in Real-World Software Engineering?"

Python 41 7 Updated Apr 14, 2026

ZJU-REAL / SkillZero

Official code for "SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization"

Python 269 9 Updated May 16, 2026

KaihuaTang / Self-Critical-Inference-Framework

This is the official implementation of the CVPR 2026 paper: "Scaling Test-Time Robustness of Vision-Language Models via Self-Critical Inference Framework"

Python 1 Updated Mar 10, 2026

ZJU-REAL / CoVerRL

[ACL 2026 main] CoVerRL: Breaking the Consensus Trap in Label-Free Reasoning via Generator-Verifier Co-Evolution

Python 23 Updated Apr 18, 2026

hhh675597 / revisiting_opd

Python 54 4 Updated Apr 12, 2026

orcetra / orcetra

HTML 4 Updated Apr 4, 2026

ultraworkers / claw-code

The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.

Rust 191,769 109,915 Updated May 16, 2026