-
The University of Tokyo
- Boston
Highlights
- Pro
Stars
A secure, configurable file-sharing and URL shortening web app written in Rust.
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
[Findings of EMNLP 2025] Benchmark for evaluating sycophantic behavior in multi-turn, free-form conversational settings.
Author's PyTorch implementation of BCQ for continuous and discrete actions
An Extendible (General) Continual Learning Framework based on Pytorch - official codebase of Dark Experience for General Continual Learning
JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.
🔥🔥🔥 ICLR 2025 Oral. Automating Agentic Workflow Generation.
A Python tool that automatically cleans, completes, and standardizes BibTeX entries using LLMs and web search.
Writing AI Conference Papers: A Handbook for Beginners
All notes and materials for the CS229: Machine Learning course by Stanford University
[ICLR 2025] Automated Design of Agentic Systems