-
Johns Hopkins University
- Baltimore, US
-
16:35
(UTC -04:00) - chuanyangjin.com
- @chuanyang_jin
Highlights
- Pro
Stars
verl: Volcano Engine Reinforcement Learning for LLMs
Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks
Democratizing Reinforcement Learning for LLMs
A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).
AutoToM: Scaling Model-based Mental Inference via Automated Agent Modeling
[ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis
Beyond the Binary: Capturing Diverse Preferences With Reward Regularization
Collection of advice for prospective and current PhD students
[ICLR 2024] Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"
[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist web agents
List of language agents based on paper "Cognitive Architectures for Language Agents"
Social-AI papers across computing communities, courses, and dissertations.
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Official code for paper: Chain of Ideas: Revolutionizing Research via Novel Idea Development with LLM Agents
MuMA-ToM: Multi-modal Multi-Agent Theory of Mind
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
[ACL 2025] A Neural-Symbolic Self-Training Framework
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/