Skip to content
View alphadl's full-sized avatar
🎯
hiring @ alibaba https://liamding.cc/hiring.html
🎯
hiring @ alibaba https://liamding.cc/hiring.html

Highlights

  • Pro

Block or report alphadl

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 9 Updated Feb 28, 2025
Jupyter Notebook 136 2 Updated Dec 19, 2025

💻 Terminal-Agent with Human-in-the-Loop Learning

Python 24 1 Updated Dec 18, 2025

repo for paper https://arxiv.org/abs/2504.13837

Python 300 17 Updated Dec 17, 2025

Lemon Agent

19 1 Updated Dec 4, 2025

An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone

Python 18,274 2,870 Updated Dec 19, 2025

SPEC-RL: Accelerating On-Policy Reinforcement Learning via Speculative Rollouts

Python 55 2 Updated Dec 1, 2025

slime is an LLM post-training framework for RL Scaling.

Python 2,921 353 Updated Dec 21, 2025

Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.

127 2 Updated Dec 19, 2025

Implement a reasoning LLM in PyTorch from scratch, step by step

Jupyter Notebook 2,251 308 Updated Dec 20, 2025

Expanding natural instructions

Python 1,028 197 Updated Dec 11, 2023

The official implementation of Energy Loss Phenomenon in RLHF [ICML 2025].

Python 7 Updated Oct 25, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,504 180 Updated Dec 21, 2025

Towards a Unified View of Large Language Model Post-Training

Python 196 11 Updated Sep 8, 2025

Build, evaluate and train General Multi-Agent Assistance with ease

Python 1,075 107 Updated Dec 19, 2025

A Systematic Survey of Deep Research

213 10 Updated Nov 27, 2025

Awesome List for Agentic RL

HTML 639 26 Updated Dec 9, 2025

Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Python 472 35 Updated Dec 19, 2025

[Preprint] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Python 161 16 Updated Nov 14, 2025

Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/

TypeScript 9,516 739 Updated May 8, 2025

🛠️ DeepAgent: A General Reasoning Agent with Scalable Toolsets

Python 871 111 Updated Nov 2, 2025
Python 48 2 Updated Dec 10, 2025

[EMNLP 2025] The code and resource of"Chinese Toxic Language Mitigation via Sentiment Polarity Consistent Rewrites"

Python 3 Updated Nov 8, 2025

Marco Search Agent for Realistic and Challenging Agentic Search

Python 240 21 Updated Oct 24, 2025
Python 12 2 Updated Sep 24, 2025

Implementation for FP8/INT8 Rollout for RL training without performence drop.

Python 281 18 Updated Nov 7, 2025

Train your Agent model via our easy and efficient framework

Python 1,664 156 Updated Dec 5, 2025

Scaling Deep Research via Reinforcement Learning in Real-world Environments.

Python 675 46 Updated Oct 15, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,671 1,356 Updated Dec 17, 2025

The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"

Python 244 15 Updated Nov 16, 2025
Next