Skip to content
View boyiwei's full-sized avatar
🤡
🤡

Highlights

  • Pro

Organizations

@princeton-polaris-lab

Block or report boyiwei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 2,233 190 Updated Dec 23, 2025

ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution

Python 752 130 Updated Dec 14, 2025
Python 72 9 Updated Nov 22, 2025

KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)

Jupyter Notebook 721 103 Updated Dec 23, 2025

MiroThinker is a series of open-source agentic models trained for deep research and complex tool use scenarios.

Python 1,359 94 Updated Dec 23, 2025

Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Python 476 35 Updated Dec 19, 2025

A version of verl to support diverse tool use

Python 774 63 Updated Dec 23, 2025

My learning notes for ML SYS.

Python 4,771 303 Updated Dec 22, 2025

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Python 3,225 427 Updated Dec 18, 2025

🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

Python 584 50 Updated Dec 23, 2025

open source codebase for BioRiskEval

Jupyter Notebook 6 2 Updated Nov 21, 2025

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 337 34 Updated Dec 23, 2025

A RL Framework for multi LLM agent system

Python 84 9 Updated Dec 22, 2025

slime is an LLM post-training framework for RL Scaling.

Python 2,961 358 Updated Dec 23, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,733 2,877 Updated Dec 23, 2025

An extensible RL framework for training LLM agents with advanced search capabilities, built on VERL and supporting state-of-the-art search strategies.

Python 21 Updated Dec 1, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,699 310 Updated Nov 13, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 51,339 8,599 Updated Nov 12, 2025

Friends of OLMo and their links.

356 31 Updated Sep 15, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 81,546 12,193 Updated Dec 21, 2025

A convenient way to trigger synchronizations to wandb / Weights & Biases if your compute nodes don't have internet!

Python 88 8 Updated Dec 8, 2025
Python 465 37 Updated Aug 28, 2025

Foundation Models for Genomics & Transcriptomics

Jupyter Notebook 759 81 Updated Dec 23, 2025

Curated coding interview preparation materials for busy software engineers

TypeScript 136,377 16,330 Updated Nov 18, 2025

Pretraining infrastructure for multi-hybrid AI model architectures

Python 197 21 Updated Jul 16, 2025

Jailbreak Evo

Python 20 1 Updated Jun 2, 2025

Official repository for the ProteinGym benchmarks

HTML 375 49 Updated Jul 21, 2025

Genome modeling and design across all domains of life

Jupyter Notebook 3,258 378 Updated Sep 17, 2025
Next