Jun-jie-Huang

Jun-jie-Huang

Let's go

57 followers · 56 following

The Chinese University of Hong Kong
Hong Kong
https://jun-jie-huang.github.io/

Achievements

Stars

karpathy / autoresearch

AI agents running research on single-GPU nanochat training automatically

Python 53,208 7,408 Updated Mar 21, 2026

Orchestra-Research / AI-Research-SKILLs

Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…

TeX 5,528 435 Updated Mar 24, 2026

TianHongZXY / RLVR-Decomposed

[NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"

Python 163 9 Updated Mar 2, 2026

Wusiwei0410 / TerminalTraj

This is the repo for the paper TerminalTraj: Large-Scale Terminal Agentic Trajectory Generation from Dockerized Environments

7 Updated Feb 10, 2026

hkust-nlp / KernelGYM

[KernelGYM & Dr. Kernel] A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations

Python 150 7 Updated Mar 24, 2026

ByteDance-Seed / Seed-1.8

Jupyter Notebook 214 3 Updated Dec 19, 2025

MoonshotAI / Kimi-K2.5

Moonshot's most powerful model

1,524 165 Updated Jan 31, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 333,533 65,016 Updated Mar 24, 2026

LeapLabTHU / JustGRPO

Minimalist RL for Diffusion LLMs with SOTA reasoning performance (89.1% GSM8K). Official implementation of "The Flexibility Trap".

Python 130 4 Updated Mar 24, 2026

simular-ai / Agent-S

Agent S: an open agentic framework that uses computers like a human

Python 10,563 1,228 Updated Feb 21, 2026

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes for ML SYS.

Python 5,761 373 Updated Mar 19, 2026

opendatalab / MinerU

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 57,056 4,716 Updated Mar 24, 2026

microsoft / LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 4,310 370 Updated Mar 23, 2026

sjtu-sai-agents / ML-Master

The official implementation of "ML-Master: Towards AI-for-AI via Integration of Exploration and Reasoning"

Python 379 47 Updated Jan 16, 2026

jd-opensource / joycode-agent

Repository-level Repair Agent Based on SWE-Bench—JoyCode Agent

Python 325 20 Updated Oct 11, 2025

OpsPAI / PreServe

PreServe: Intelligent Management for LMaaS Systems via Hierarchical Prediction [ICSE'26]

Jupyter Notebook 6 Updated Oct 20, 2025

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 5,276 523 Updated Mar 24, 2026

SWE-Perf / SWE-Perf

Python 48 9 Updated Oct 28, 2025

tongye98 / Awesome-Code-Benchmark

A comprehensive code domain benchmark review of LLM researches.

210 16 Updated Sep 22, 2025

facebookresearch / cwm

Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.

Python 859 69 Updated Dec 26, 2025

aorwall / moatless-tree-search

Python 131 28 Updated Jun 6, 2025

zhenyuhe00 / SWE-Swiss

SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolution

Python 103 6 Updated Sep 24, 2025

SWE-agent / mini-swe-agent

The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!

Python 3,470 475 Updated Mar 24, 2026

OpenHands / ToM-SWE

The theory of mind module for the SWE agent

Python 92 12 Updated Jan 13, 2026

OpenAutoCoder / Agentless

Agentless🐱: an agentless approach to automatically solve software development problems

Python 2,022 228 Updated Dec 22, 2024

Elfsong / Afterburner

Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization

Python 12 Updated Aug 20, 2025

tile-ai / tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 5,418 483 Updated Mar 24, 2026

ZitongYang / Synthetic_Continued_Pretraining

Code implementation of synthetic continued pretraining

Jupyter Notebook 157 16 Updated Jan 6, 2025

ganler / code-r1

Reproducing R1 for Code with Reliable Rewards

Python 299 18 Updated May 5, 2025

anomalyco / opencode

The open source coding agent.

TypeScript 129,258 13,680 Updated Mar 24, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jun-jie-Huang

Achievements

Achievements

Block or report Jun-jie-Huang

Stars

karpathy / autoresearch

Orchestra-Research / AI-Research-SKILLs

TianHongZXY / RLVR-Decomposed

Wusiwei0410 / TerminalTraj

hkust-nlp / KernelGYM

ByteDance-Seed / Seed-1.8

MoonshotAI / Kimi-K2.5

openclaw / openclaw

LeapLabTHU / JustGRPO

simular-ai / Agent-S

zhaochenyang20 / Awesome-ML-SYS-Tutorial

opendatalab / MinerU

microsoft / LMOps

sjtu-sai-agents / ML-Master

jd-opensource / joycode-agent

OpsPAI / PreServe

rllm-org / rllm

SWE-Perf / SWE-Perf

tongye98 / Awesome-Code-Benchmark

facebookresearch / cwm

aorwall / moatless-tree-search

zhenyuhe00 / SWE-Swiss

SWE-agent / mini-swe-agent

OpenHands / ToM-SWE

OpenAutoCoder / Agentless

Elfsong / Afterburner

tile-ai / tilelang

ZitongYang / Synthetic_Continued_Pretraining

ganler / code-r1

anomalyco / opencode