Skip to content
View sh0416's full-sized avatar
🏃
🏃

Highlights

  • Pro

Organizations

@PoApper @lmgsg-sh0416

Block or report sh0416

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 477 68 Updated May 18, 2026

PyTorch implementation of soft actor critic

Python 944 190 Updated Jul 17, 2025

An alignment auditing agent capable of quickly exploring alignment hypothesis

Python 1,234 198 Updated Jun 17, 2026

Realistic examples of building evals and optimizing agents with Harbor

Python 104 10 Updated Apr 23, 2026

Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]

Jupyter Notebook 692 42 Updated Jul 29, 2025

[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents

Python 290 62 Updated Jul 13, 2025

The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!

Python 5,256 715 Updated Jun 15, 2026

"OpenHarness: Open Agent Harness with a Built-in Personal Agent--Ohmo!"

Python 13,965 2,292 Updated Jun 4, 2026

An educational resource to help anyone learn deep reinforcement learning.

Python 11,813 2,454 Updated Aug 5, 2024

Simple RL training for reasoning

Python 3,867 287 Updated Dec 23, 2025

The nnsight package enables interpreting and manipulating the internals of deep learned models.

Python 961 93 Updated Jun 16, 2026

Repository for "The Curious Case of Hallucinatory (Un)answerability: Finding Truths in the Hidden States of Over-Confident Large Language Models", EMNLP 2023

Jupyter Notebook 12 4 Updated May 16, 2024
Python 13 1 Updated May 8, 2023

multilspy is a lsp client library in Python intended to be used to build applications around language servers.

Python 588 107 Updated Apr 16, 2026

EMNLP 2025 Findings

Python 1 1 Updated Sep 8, 2025

Harbor is a framework for running agent evaluations and creating and using RL environments.

Python 2,518 1,173 Updated Jun 17, 2026

Nano vLLM

Python 14,080 2,231 Updated Apr 26, 2026

🌟 Yi-Coder is a series of open-source code language models that delivers state-of-the-art coding performance with fewer than 10 billion parameters.

HTML 450 37 Updated Sep 18, 2024

LOFT: A 1 Million+ Token Long-Context Benchmark

Python 233 17 Updated Apr 13, 2026

An Open-Source Asynchronous Coding Agent

Python 9,996 1,135 Updated Jun 17, 2026

Google Gen AI Python SDK provides an interface for developers to integrate Google's generative models into their Python applications.

Python 3,809 918 Updated Jun 18, 2026

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Python 2,946 481 Updated Jun 10, 2026

An awesome code differencing tool

Java 1,313 187 Updated Jun 8, 2026

mcp for handling hwp

Python 257 62 Updated Jan 29, 2026

CycleResearcher: Improving Automated Research via Automated Review

Jupyter Notebook 393 36 Updated Mar 5, 2026

LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step (ACL'24)

Python 587 57 Updated Sep 10, 2024

TinyAGI is the agent teams orchestrator for One Person Company. (fka TinyClaw)

TypeScript 3,585 505 Updated Mar 30, 2026
Next