jmiao24

Follow

Jiacheng Miao jmiao24

Follow

Building AI to do research @Stanford

146 followers · 14 following

Stanford University
Palo Alto, CA
jiachengmiao.com
@Jiacheng_Miao

Achievements

Achievements

Highlights

Pro

Stars

yibie / awesome-autoresearch

awesome autoresearch list

Python 252 17 Updated Apr 8, 2026

googleworkspace / cli

Google Workspace CLI — one command-line tool for Drive, Gmail, Calendar, Sheets, Docs, Chat, Admin, and more. Dynamically built from Google Discovery Service. Includes AI agent skills.

Rust 24,144 1,217 Updated Apr 8, 2026

garrytan / gstack

Use Garry Tan's exact Claude Code setup: 23 opinionated tools that serve as CEO, Designer, Eng Manager, Release Manager, Doc Engineer, and QA

TypeScript 67,252 9,287 Updated Apr 8, 2026

openai / parameter-golf

Train the smallest LM you can that fits in 16MB. Best model wins!

Python 4,697 3,073 Updated Mar 30, 2026

Al-Murphy / alphagenome_FT_MPRA

Benchmarking approaches to fine-tune AlphaGenome on lentiMPRA data

Python 5 Updated Apr 6, 2026

mutable-state-inc / autoresearch-at-home

Forked from karpathy/autoresearch

AI agents running research on single-GPU nanochat training automatically

Python 464 25 Updated Mar 13, 2026

NousResearch / hermes-agent-self-evolution

⚒ Evolutionary self-improvement for Hermes Agent — optimize skills, prompts, and code using DSPy + GEPA

Python 782 70 Updated Mar 29, 2026

karpathy / autoresearch

AI agents running research on single-GPU nanochat training automatically

Python 68,743 9,957 Updated Mar 26, 2026

pablodelucca / pixel-agents

Pixel office.

TypeScript 6,268 921 Updated Apr 6, 2026

bytedance / deer-flow

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…

Python 59,531 7,507 Updated Apr 8, 2026

snarktank / ralph

Ralph is an autonomous AI agent loop that runs repeatedly until all PRD items are complete.

TypeScript 14,658 1,494 Updated Feb 2, 2026

huggingface / trl

Train transformer language models with reinforcement learning.

Python 17,973 2,625 Updated Apr 8, 2026

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 9,323 913 Updated Apr 8, 2026

allenai / open-instruct

AllenAI's post-training codebase

Python 3,679 530 Updated Apr 8, 2026

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 4,411 380 Updated Nov 13, 2025

janetmalzahn / llm-phacking

Replication archive for "Do Claude Code and Codex P-Hack? Sycophancy and Statistical Analysis in Large Language Models"

R 17 Updated Mar 3, 2026

alibaba / OpenSandbox

Secure, Fast, and Extensible Sandbox runtime for AI agents.

Python 9,841 763 Updated Apr 8, 2026

g-luo / generative_latent_prior

Official PyTorch Implementation for Learning a Generative Meta-Model of LLM Activations

Jupyter Notebook 79 13 Updated Mar 18, 2026

badlogic / pi-mono

AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods

TypeScript 33,258 3,701 Updated Apr 8, 2026

SakanaAI / doc-to-lora

Hypernetworks that update LLMs to remember factual information

Python 664 71 Updated Mar 2, 2026

SkyworkAI / Skywork-Reward-V2

Scaling Preference Data Curation via Human-AI Synergy

146 5 Updated Jul 3, 2025

openai / emergent-misalignment-persona-features

Python 54 16 Updated Jun 26, 2025

McGill-NLP / nano-aha-moment

Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"

Jupyter Notebook 606 55 Updated Oct 7, 2025

zou-group / humanlm

HumanLM: Simulating Users with State Alignment Beats Response Imitation

Python 70 9 Updated Feb 27, 2026

yfzhang114 / Awesome-Multimodal-Large-Language-Models

Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models

1,064 41 Updated Mar 15, 2026

sierra-research / tau-bench

Code and Data for Tau-Bench

Python 1,169 189 Updated Mar 18, 2026

symbolica-ai / arcgentica

An ARC-AGI solution using Agentica from Symbolica

Python 176 16 Updated Feb 12, 2026

HKUDS / nanobot

"🐈 nanobot: The Ultra-Lightweight Personal AI Agent"

Python 38,570 6,733 Updated Apr 8, 2026

sdan / continualcode

pip install continualcode

Python 39 4 Updated Feb 10, 2026

opentargets / open-targets-platform-mcp

Official MCP server implementation for accessing Open Targets Data

Python 26 2 Updated Apr 7, 2026