Skip to content
View boyiwei's full-sized avatar
🤡
🤡

Highlights

  • Pro

Organizations

@princeton-polaris-lab

Block or report boyiwei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
104 results for source starred repositories
Clear filter

Official Inspect Implementation for "ImpossibleBench: Measuring LLMs' Propensity of Exploiting Test Cases"

Python 30 3 Updated Dec 1, 2025

A Diagnostic Guardrail Framework for AI Agent Safety and Security

Python 319 9 Updated Feb 5, 2026

A curated list of Multi-Modal Reinforcement Learning resources (continually updated)

571 21 Updated Dec 15, 2025

本人的科研经验

10,121 528 Updated Jan 29, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,317 413 Updated Jan 19, 2026

ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution

Python 825 160 Updated Jan 25, 2026
Python 82 11 Updated Nov 22, 2025

KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)

Jupyter Notebook 791 132 Updated Jan 20, 2026

MiroThinker is an open source deep research agent optimized for research and prediction. It achieves a 80.8% Avg@8 score on the challenging GAIA benchmark.

Python 6,101 450 Updated Feb 4, 2026

Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Python 551 49 Updated Feb 2, 2026

A version of verl to support diverse tool use

Python 859 72 Updated Jan 6, 2026

My learning notes for ML SYS.

Python 5,276 342 Updated Jan 30, 2026

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Python 3,292 435 Updated Feb 5, 2026

🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

Python 624 59 Updated Jan 29, 2026

open source codebase for BioRiskEval

Jupyter Notebook 6 2 Updated Feb 3, 2026

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 360 35 Updated Feb 4, 2026

[ICLR'26] Stronger-MAS: A RL Framework for multi LLM agent system

Python 99 14 Updated Feb 3, 2026

slime is an LLM post-training framework for RL Scaling.

Python 3,680 495 Updated Feb 5, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 19,010 3,193 Updated Feb 5, 2026

An extensible RL framework for training LLM agents with advanced search capabilities, built on VERL and supporting state-of-the-art search strategies.

Python 30 2 Updated Dec 1, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,943 338 Updated Nov 13, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 52,655 8,900 Updated Nov 12, 2025

Friends of OLMo and their links.

356 30 Updated Sep 15, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 84,639 12,803 Updated Jan 29, 2026

A convenient way to trigger synchronizations to wandb / Weights & Biases if your compute nodes don't have internet!

Python 89 8 Updated Feb 2, 2026
Python 466 38 Updated Aug 28, 2025

Foundation Models for Genomics & Transcriptomics

Jupyter Notebook 814 87 Updated Jan 15, 2026

Curated coding interview preparation materials for busy software engineers

TypeScript 137,444 16,433 Updated Jan 26, 2026

Pretraining infrastructure for multi-hybrid AI model architectures

Python 199 22 Updated Jul 16, 2025
Next