boyiwei

🤡

Boyi Wei boyiwei

🤡

PhD student @princeton-polaris-lab

27 followers · 29 following

Princeton University
Princeton, NJ
18:56 (UTC -05:00)
www.boyiwei.com
@wei_boyi

Achievements

Highlights

Organizations

Lists (3)

Sort

Stars

104 results for source starred repositories

Clear filter

safety-research / impossiblebench

Official Inspect Implementation for "ImpossibleBench: Measuring LLMs' Propensity of Exploiting Test Cases"

Python 30 3 Updated Dec 1, 2025

AI45Lab / AgentDoG

A Diagnostic Guardrail Framework for AI Agent Safety and Security

Python 319 9 Updated Feb 5, 2026

opendilab / awesome-multi-modal-reinforcement-learning

A curated list of Multi-Modal Reinforcement Learning resources (continually updated)

571 21 Updated Dec 15, 2025

pengsida / learning_research

本人的科研经验

10,121 528 Updated Jan 29, 2026

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,317 413 Updated Jan 19, 2026

SakanaAI / ShinkaEvolve

ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution

Python 825 160 Updated Jan 25, 2026

SakanaAI / robust-kbench

Python 82 11 Updated Nov 22, 2025

ScalingIntelligence / KernelBench

KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)

Jupyter Notebook 791 132 Updated Jan 20, 2026

MiroMindAI / MiroThinker

MiroThinker is an open source deep research agent optimized for research and prediction. It achieves a 80.8% Avg@8 score on the challenging GAIA benchmark.

Python 6,101 450 Updated Feb 4, 2026

rlresearch / dr-tulu

Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Python 551 49 Updated Feb 2, 2026

TIGER-AI-Lab / verl-tool

A version of verl to support diverse tool use

Python 859 72 Updated Jan 6, 2026

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes for ML SYS.

Python 5,276 342 Updated Jan 30, 2026

pytorch / rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Python 3,292 435 Updated Feb 5, 2026

sail-sg / oat

🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

Python 624 59 Updated Jan 29, 2026

scaleapi / BioRiskEval

open source codebase for BioRiskEval

Jupyter Notebook 6 2 Updated Feb 3, 2026

ServiceNow / PipelineRL

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 360 35 Updated Feb 4, 2026

pettingllms-ai / PettingLLMs

[ICLR'26] Stronger-MAS: A RL Framework for multi LLM agent system

Python 99 14 Updated Feb 3, 2026

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 3,680 495 Updated Feb 5, 2026

verl-project / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 19,010 3,193 Updated Feb 5, 2026

Tree-Shu-Zhao / ferret

An extensible RL framework for training LLM agents with advanced search capabilities, built on VERL and supporting state-of-the-art search strategies.

Python 30 2 Updated Dec 1, 2025