- Shanghai(CN) & Sydney(AU)
- liamding.cc
- @liangdingNLP
- https://scholar.google.com/citations?user=lFCLvOAAAAAJ
- https://huggingface.co/alphadl
Highlights
- Pro
Stars
💻 Terminal-Agent with Human-in-the-Loop Learning
repo for paper https://arxiv.org/abs/2504.13837
An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone
SPEC-RL: Accelerating On-Policy Reinforcement Learning via Speculative Rollouts
slime is an LLM post-training framework for RL Scaling.
Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.
Implement a reasoning LLM in PyTorch from scratch, step by step
The official implementation of Energy Loss Phenomenon in RLHF [ICML 2025].
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
Towards a Unified View of Large Language Model Post-Training
Build, evaluate and train General Multi-Agent Assistance with ease
Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research
[Preprint] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
🛠️ DeepAgent: A General Reasoning Agent with Scalable Toolsets
[EMNLP 2025] The code and resource of"Chinese Toxic Language Mitigation via Sentiment Polarity Consistent Rewrites"
Marco Search Agent for Realistic and Challenging Agentic Search
Implementation for FP8/INT8 Rollout for RL training without performence drop.
Train your Agent model via our easy and efficient framework
Scaling Deep Research via Reinforcement Learning in Real-world Environments.
Tongyi Deep Research, the Leading Open-source Deep Research Agent
The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"