alphadl

🎯

hiring @ alibaba https://liamding.cc/hiring.html

Liam Liang Ding alphadl

🎯

hiring @ alibaba https://liamding.cc/hiring.html

AI researcher & builder

236 followers · 221 following

Shanghai(CN) & Sydney(AU)
07:55 (UTC +10:00)
liamding.cc
@liangdingNLP
https://scholar.google.com/citations?user=lFCLvOAAAAAJ
https://huggingface.co/alphadl

Achievements

x2 x2

Achievements

x2 x2

Highlights

3d-gen-for-llm-builders Public

A hands-on guide to 3D latent diffusion for LLM/VLM builders

Shell 27 Other Updated Apr 7, 2026
alphadl Public

statistics

Updated Apr 5, 2026
cc-agent Public

Claude Code–style agentic CLI in Python.

Python 1 Updated Mar 31, 2026
cc-agent-fork-archive Public
Forked from ultraworkers/claw-code

Better Harness Tools, not merely storing the archive of leaked Claude Code but also make shit things done. Now rewriting in Rust.

Python 2 Updated Mar 31, 2026
AgentSynth Public

AgentSynth: Industrial-Grade Agent Data Synthesis Pipeline

agent data-synthesis

Python 2 Other Updated Mar 25, 2026
AdaRubrics Public

AdaRubric: Adaptive Dynamic Rubric Evaluator for Agent Trajectories

rubric rlhf reward-model llm-evaluation agent-evaluation

Python 8 1 Apache License 2.0 Updated Mar 25, 2026
AgentHER Public

AgentHER: Hindsight Experience Replay for LLM Agents

data-augmentation training-data hindsight-experience-replay llm-agent too-use

Python 8 Apache License 2.0 Updated Mar 25, 2026
NemoClaw Public
Forked from NVIDIA/NemoClaw

NVIDIA plugin for secure installation of OpenClaw

JavaScript Apache License 2.0 Updated Mar 19, 2026
page-agent Public
Forked from alibaba/page-agent

JavaScript in-page GUI agent. Control web interfaces with natural language.

TypeScript MIT License Updated Mar 19, 2026
unsloth Public
Forked from unslothai/unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

Python Apache License 2.0 Updated Mar 16, 2026
FibrationPO Public

unofficial implementation of Fibration Policy Optimization (https://arxiv.org/pdf/2603.08239)

Python 1 Updated Mar 15, 2026
ms-swift Public
Forked from modelscope/ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python Apache License 2.0 Updated Mar 14, 2026
openclaw Public
Forked from openclaw/openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript MIT License Updated Mar 14, 2026
DDCA Public

dynamic decoupled conditional advantage for efficient reasoning

Python 3 Updated Mar 14, 2026
trajectory_tokenization Public

Trajectory Tokenization for ReAct: compress older steps into tokens, keep recent steps full—no training, drop-in

Python 2 MIT License Updated Mar 14, 2026
officeqa-agentbeats-leaderboard Public
Forked from RDI-Foundation/officeqa-agentbeats-leaderboard

Python Updated Mar 7, 2026
agentx-agentbeats-officeqa Public

OfficeQA Purple Agent for Berkeley RDI AgentX-AgentBeats (Finance track)

Python Updated Mar 7, 2026
BiT Public

BiT: Improving neural machine translation with bidirectional training - EMNLP 2021

machine-translation

Shell 1 Updated Feb 28, 2026
CCAN Public

CCAN: Context-Aware Cross-Attention for Non-Autoregressive Translation - COLING 2020

non-autoregressive-translation

Python 1 Apache License 2.0 Updated Feb 28, 2026
Bottleneck_LC Public

Bottleneck_LC: Widening the bottleneck of lexical choice in NAT - Computer Speech & Language 2025

non-autoregressive-translation

Shell 1 Other Updated Feb 28, 2026
Recurrent-Graph-Syntax-Encoder4MT Public

Recurrent Graph Syntax Encoder (RGSE) for NMT - arxiv 2019

machine-translation

Python 1 Updated Feb 28, 2026
LCNAT Public

LCNAT: Lexical choice in NAT - ICLR 2021

non-autoregressive-translation

Python 7 Updated Feb 28, 2026
RLFW-NAT.mono Public

RLFW-NAT.mono: Redistributing low-frequency words (monolingual data) - ACL 2022

non-autoregressive-translation

Python 3 MIT License Updated Feb 28, 2026
RLFW-NAT Public

RLFW-NAT: Rejuvenating low-frequency words (parallel data) - ACL 2021

non-autoregressive-translation

Python 5 Updated Feb 28, 2026
XLPE Public

XLPE: Cross-lingual position encoding - ACL 2020

machine-translation

TeX 1 Updated Feb 28, 2026
LLM-Lite Public

clean LLM train/inference code

Python Updated Feb 19, 2026
darts.pytorch1.1 Public

Implementation with latest PyTorch (v1.1) for multi-gpu differentiable architecture search https://arxiv.org/abs/1806.09055

darts neural-architecture-search

Python 84 29 Updated Jan 31, 2026
R1 Public

🚀enhanced GRPO with more verifiable rewards and real-time evaluators

deepseek-r1 grpo

Python 37 Apache License 2.0 Updated Jan 27, 2026
dr-tulu Public
Forked from rlresearch/dr-tulu

Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Python Apache License 2.0 Updated Nov 22, 2025
DeepAgent Public
Forked from RUC-NLPIR/DeepAgent

🛠️ DeepAgent: A General Reasoning Agent with Scalable Toolsets

Python MIT License Updated Nov 2, 2025

Liam Liang Ding alphadl

Achievements

Achievements

Highlights

3d-gen-for-llm-builders Public

Uh oh!

alphadl Public

Uh oh!

cc-agent Public

Uh oh!

cc-agent-fork-archive Public

Uh oh!

AgentSynth Public

Uh oh!

AdaRubrics Public

Uh oh!

AgentHER Public

Uh oh!

NemoClaw Public

Uh oh!

page-agent Public

Uh oh!

unsloth Public

Uh oh!

FibrationPO Public

Uh oh!

ms-swift Public

Uh oh!

openclaw Public

Uh oh!

DDCA Public

Uh oh!

trajectory_tokenization Public

Uh oh!

officeqa-agentbeats-leaderboard Public

Uh oh!

agentx-agentbeats-officeqa Public

Uh oh!

BiT Public

Uh oh!

CCAN Public

Uh oh!

Bottleneck_LC Public

Uh oh!

Recurrent-Graph-Syntax-Encoder4MT Public

Uh oh!

LCNAT Public

Uh oh!

RLFW-NAT.mono Public

Uh oh!

RLFW-NAT Public

Uh oh!

XLPE Public

Uh oh!

LLM-Lite Public

Uh oh!

darts.pytorch1.1 Public

Uh oh!

R1 Public

Uh oh!

dr-tulu Public

Uh oh!

DeepAgent Public

Uh oh!