Skip to content
View chuanyangjin's full-sized avatar

Highlights

  • Pro

Block or report chuanyangjin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Jupyter Notebook 1 1 Updated Oct 6, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,119 2,517 Updated Oct 9, 2025

Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks

Python 246 11 Updated May 5, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,432 416 Updated Oct 9, 2025

github profile

19 Updated Aug 26, 2024

A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).

Python 290 30 Updated Oct 9, 2025
Python 2 Updated Nov 15, 2024

AutoToM: Scaling Model-based Mental Inference via Automated Agent Modeling

Python 27 4 Updated Jul 26, 2025

[ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Jupyter Notebook 163 12 Updated Oct 8, 2025

Beyond the Binary: Capturing Diverse Preferences With Reward Regularization

Python 5 Updated Apr 16, 2025

Collection of advice for prospective and current PhD students

1,879 140 Updated Jul 10, 2024

[ICLR 2024] Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"

Python 278 45 Updated Mar 30, 2025

AWM: Agent Workflow Memory

Python 327 28 Updated Jan 31, 2025

[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist web agents

Jupyter Notebook 877 117 Updated Apr 3, 2025

List of language agents based on paper "Cognitive Architectures for Language Agents"

TeX 1,039 69 Updated Jan 16, 2025

Social-AI papers across computing communities, courses, and dissertations.

22 1 Updated Jun 10, 2025

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 5,606 564 Updated Jan 16, 2025

Official code for paper: Chain of Ideas: Revolutionizing Research via Novel Idea Development with LLM Agents

Python 474 27 Updated Jan 15, 2025

MuMA-ToM: Multi-modal Multi-Agent Theory of Mind

Python 31 2 Updated Jan 23, 2025

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 27,491 2,488 Updated Sep 30, 2025

AllenAI's post-training codebase

Python 3,231 444 Updated Oct 9, 2025

本人的科研经验

7,550 436 Updated Aug 12, 2025

[ACL 2025] A Neural-Symbolic Self-Training Framework

C 115 4 Updated Jun 1, 2025

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript 3,861 777 Updated Sep 4, 2025
JavaScript 3,596 1,526 Updated Jun 21, 2024
Next