Skip to content
View jmiao24's full-sized avatar

Highlights

  • Pro

Block or report jmiao24

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

awesome autoresearch list

Python 252 17 Updated Apr 8, 2026

Google Workspace CLI — one command-line tool for Drive, Gmail, Calendar, Sheets, Docs, Chat, Admin, and more. Dynamically built from Google Discovery Service. Includes AI agent skills.

Rust 24,144 1,217 Updated Apr 8, 2026

Use Garry Tan's exact Claude Code setup: 23 opinionated tools that serve as CEO, Designer, Eng Manager, Release Manager, Doc Engineer, and QA

TypeScript 67,252 9,287 Updated Apr 8, 2026

Train the smallest LM you can that fits in 16MB. Best model wins!

Python 4,697 3,073 Updated Mar 30, 2026

Benchmarking approaches to fine-tune AlphaGenome on lentiMPRA data

Python 5 Updated Apr 6, 2026

AI agents running research on single-GPU nanochat training automatically

Python 464 25 Updated Mar 13, 2026

⚒ Evolutionary self-improvement for Hermes Agent — optimize skills, prompts, and code using DSPy + GEPA

Python 782 70 Updated Mar 29, 2026

AI agents running research on single-GPU nanochat training automatically

Python 68,743 9,957 Updated Mar 26, 2026

Pixel office.

TypeScript 6,268 921 Updated Apr 6, 2026

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…

Python 59,531 7,507 Updated Apr 8, 2026

Ralph is an autonomous AI agent loop that runs repeatedly until all PRD items are complete.

TypeScript 14,658 1,494 Updated Feb 2, 2026

Train transformer language models with reinforcement learning.

Python 17,973 2,625 Updated Apr 8, 2026

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 9,323 913 Updated Apr 8, 2026

AllenAI's post-training codebase

Python 3,679 530 Updated Apr 8, 2026

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 4,411 380 Updated Nov 13, 2025

Replication archive for "Do Claude Code and Codex P-Hack? Sycophancy and Statistical Analysis in Large Language Models"

R 17 Updated Mar 3, 2026

Secure, Fast, and Extensible Sandbox runtime for AI agents.

Python 9,841 763 Updated Apr 8, 2026

Official PyTorch Implementation for Learning a Generative Meta-Model of LLM Activations

Jupyter Notebook 79 13 Updated Mar 18, 2026

AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods

TypeScript 33,258 3,701 Updated Apr 8, 2026

Hypernetworks that update LLMs to remember factual information

Python 664 71 Updated Mar 2, 2026

Scaling Preference Data Curation via Human-AI Synergy

146 5 Updated Jul 3, 2025

Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"

Jupyter Notebook 606 55 Updated Oct 7, 2025

HumanLM: Simulating Users with State Alignment Beats Response Imitation

Python 70 9 Updated Feb 27, 2026

Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models

1,064 41 Updated Mar 15, 2026

Code and Data for Tau-Bench

Python 1,169 189 Updated Mar 18, 2026

An ARC-AGI solution using Agentica from Symbolica

Python 176 16 Updated Feb 12, 2026

"🐈 nanobot: The Ultra-Lightweight Personal AI Agent"

Python 38,570 6,733 Updated Apr 8, 2026

pip install continualcode

Python 39 4 Updated Feb 10, 2026

Official MCP server implementation for accessing Open Targets Data

Python 26 2 Updated Apr 7, 2026
Next