jmiao24

Jiacheng Miao jmiao24

Building AI to do research @Stanford

145 followers · 14 following

Stanford University
Palo Alto, CA
jiachengmiao.com
@Jiacheng_Miao

Achievements

Highlights

Stars

openai / parameter-golf

Train the smallest LM you can that fits in 16MB. Best model wins!

Python 3,887 2,258 Updated Mar 24, 2026

Al-Murphy / alphagenome_FT_MPRA

Benchmarking approaches to fine-tune AlphaGenome on lentiMPRA data

Python 5 Updated Mar 19, 2026

mutable-state-inc / autoresearch-at-home

Forked from karpathy/autoresearch

AI agents running research on single-GPU nanochat training automatically

Python 443 23 Updated Mar 13, 2026

NousResearch / hermes-agent-self-evolution

⚒ Evolutionary self-improvement for Hermes Agent — optimize skills, prompts, and code using DSPy + GEPA

Python 261 22 Updated Mar 9, 2026

karpathy / autoresearch

AI agents running research on single-GPU nanochat training automatically

Python 54,017 7,518 Updated Mar 21, 2026

pablodelucca / pixel-agents

Pixel office.

TypeScript 5,275 757 Updated Mar 24, 2026

bytedance / deer-flow

An open-source SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of tasks that c…

Python 42,898 5,028 Updated Mar 24, 2026

snarktank / ralph

Ralph is an autonomous AI agent loop that runs repeatedly until all PRD items are complete.

TypeScript 13,681 1,409 Updated Feb 2, 2026

huggingface / trl

Train transformer language models with reinforcement learning.

Python 17,775 2,586 Updated Mar 24, 2026

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 9,235 903 Updated Mar 24, 2026

allenai / open-instruct

AllenAI's post-training codebase

Python 3,651 515 Updated Mar 24, 2026

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 4,288 369 Updated Nov 13, 2025

janetmalzahn / llm-phacking

Replication archive for "Do Claude Code and Codex P-Hack? Sycophancy and Statistical Analysis in Large Language Models"

R 17 Updated Mar 3, 2026

alibaba / OpenSandbox

Secure, Fast, and Extensible Sandbox runtime for AI agents.

Python 9,236 701 Updated Mar 24, 2026

g-luo / generative_latent_prior

Official PyTorch Implementation for Learning a Generative Meta-Model of LLM Activations

Jupyter Notebook 73 12 Updated Mar 18, 2026

badlogic / pi-mono

AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods

TypeScript 27,622 2,916 Updated Mar 24, 2026

SakanaAI / doc-to-lora

Hypernetworks that update LLMs to remember factual information

Python 609 65 Updated Mar 2, 2026

SkyworkAI / Skywork-Reward-V2

Scaling Preference Data Curation via Human-AI Synergy

145 5 Updated Jul 3, 2025

openai / emergent-misalignment-persona-features

Python 53 16 Updated Jun 26, 2025

McGill-NLP / nano-aha-moment

Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"

Jupyter Notebook 604 54 Updated Oct 7, 2025

zou-group / humanlm

HumanLM: Simulating Users with State Alignment Beats Response Imitation

Python 69 8 Updated Feb 27, 2026

yfzhang114 / Awesome-Multimodal-Large-Language-Models

Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models

1,040 40 Updated Mar 15, 2026

sierra-research / tau-bench

Code and Data for Tau-Bench

Python 1,140 187 Updated Mar 18, 2026

symbolica-ai / arcgentica

An ARC-AGI solution using Agentica from Symbolica

Python 167 16 Updated Feb 12, 2026

HKUDS / nanobot

"🐈 nanobot: The Ultra-Lightweight OpenClaw"

Python 35,978 6,141 Updated Mar 24, 2026

sdan / continualcode

pip install continualcode

Python 37 3 Updated Feb 10, 2026

opentargets / open-targets-platform-mcp

Official MCP server implementation for accessing Open Targets Data

Python 25 2 Updated Mar 20, 2026

thinking-machines-lab / tinker-project-ideas

Ideas for projects related to Tinker

173 9 Updated Nov 6, 2025

lasgroup / SDPO

Reinforcement Learning via Self-Distillation (SDPO)

Python 681 64 Updated Feb 18, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 334,248 65,185 Updated Mar 24, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jiacheng Miao jmiao24

Achievements

Achievements

Highlights

Block or report jmiao24

Stars

openai / parameter-golf

Al-Murphy / alphagenome_FT_MPRA

mutable-state-inc / autoresearch-at-home

NousResearch / hermes-agent-self-evolution

karpathy / autoresearch

pablodelucca / pixel-agents

bytedance / deer-flow

snarktank / ralph

huggingface / trl

OpenRLHF / OpenRLHF

allenai / open-instruct

PeterGriffinJin / Search-R1

janetmalzahn / llm-phacking

alibaba / OpenSandbox

g-luo / generative_latent_prior

badlogic / pi-mono

SakanaAI / doc-to-lora

SkyworkAI / Skywork-Reward-V2

openai / emergent-misalignment-persona-features

McGill-NLP / nano-aha-moment

zou-group / humanlm

yfzhang114 / Awesome-Multimodal-Large-Language-Models

sierra-research / tau-bench

symbolica-ai / arcgentica

HKUDS / nanobot

sdan / continualcode

opentargets / open-targets-platform-mcp

thinking-machines-lab / tinker-project-ideas

lasgroup / SDPO

openclaw / openclaw