Skip to content
View R-Yin-217's full-sized avatar
  • The University of Tokyo
  • Boston

Highlights

  • Pro

Block or report R-Yin-217

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A secure, configurable file-sharing and URL shortening web app written in Rust.

Rust 4,395 303 Updated Jun 13, 2026

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,365 171 Updated Aug 3, 2023

[Findings of EMNLP 2025] Benchmark for evaluating sycophantic behavior in multi-turn, free-form conversational settings.

Python 29 7 Updated Dec 19, 2025
Jupyter Notebook 45 7 Updated Sep 28, 2025

Author's PyTorch implementation of BCQ for continuous and discrete actions

Python 664 146 Updated Apr 6, 2021

An Extendible (General) Continual Learning Framework based on Pytorch - official codebase of Dark Experience for General Continual Learning

Python 822 155 Updated May 20, 2026

JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.

Python 1,128 156 Updated Mar 17, 2025

🔥🔥🔥 ICLR 2025 Oral. Automating Agentic Workflow Generation.

Python 529 88 Updated Dec 25, 2025

A Python tool that automatically cleans, completes, and standardizes BibTeX entries using LLMs and web search.

Python 184 7 Updated Jun 10, 2026

Writing AI Conference Papers: A Handbook for Beginners

3,825 134 Updated Jul 16, 2025

All notes and materials for the CS229: Machine Learning course by Stanford University

Jupyter Notebook 3,324 1,187 Updated Feb 14, 2025

[ICLR 2025] Automated Design of Agentic Systems

Python 1,591 239 Updated Jan 28, 2025