alphadl

Follow

🎯

hiring @ alibaba https://liamding.cc/hiring.html

Liam Liang Ding alphadl

🎯

hiring @ alibaba https://liamding.cc/hiring.html

Follow

AI researcher & builder

255 followers · 222 following

Shanghai(CN) & Sydney(AU)
04:17 (UTC +10:00)
liamding.cc
@liangdingNLP
https://scholar.google.com/citations?user=lFCLvOAAAAAJ
https://huggingface.co/alphadl

Achievements

Achievements

Highlights

Pro

alphadl/README.md

Hi there

🙋‍♂️ I am building a deterministic agentic AI ecosystem at Alibaba. I was the chief scientist at a startup (raised more than 50M$), previously worked at JD Explore Academy and Tencent AI Lab, and held an adjunct researcher position at ZJU.

🔭 Working on the whole pipeline of LLM R&D and their human-centric applications, including efficient and sufficient training, alignment, evaluations, compression, multilinguality, multimodality, agentic application, and much more.

💪 I'm keen on bodybuilding (5 years+), marathon (completed first half marathon (126min) in Beijing-2016 and most recent half marathon (86min) in Sydney-2019😅. will resume training in 2024💪🏻).

🥗 I (once😅) enjoy cooking.

🐈 I like to spend Sundays with my cats (two from 2020-2023, one from 2023).

🔥 Recent open-source projects — agentic AI (data, evaluation, context) and LLM alignment / policy optimization:

🔄 AgentHER Hindsight relabeling of failed trajectories for training.
🧬 AgentSynth Synthetic agent data from scratch with execution validation.
📏 AdaRubric Dynamic rubric evaluation for trajectory quality.
🗜️ trajectory_tokenization ReAct with compressed history for long-horizon context.
📡 SigFibPO SNR-calibrated trust regions and causal fiber residuals for multi-domain RLVR (research code + verl hook).

Pinned Loading

THUNLP-MT/MT-Reading-List THUNLP-MT/MT-Reading-List Public

A machine translation reading list maintained by Tsinghua Natural Language Processing Group

TeX 2.4k 442
lookahead.pytorch lookahead.pytorch Public

lookahead optimizer (Lookahead Optimizer: k steps forward, 1 step back) for pytorch

Python 336 63
AgentHER AgentHER Public

AgentHER: Hindsight Experience Replay for LLM Agents

Python 92 10
AdaRubrics AdaRubrics Public

AdaRubric: Adaptive Dynamic Rubric Evaluator for Agent Trajectories

Python 213 22
darts.pytorch1.1 darts.pytorch1.1 Public

Implementation with latest PyTorch (v1.1) for multi-gpu differentiable architecture search https://arxiv.org/abs/1806.09055

Python 83 28
3d-gen-for-llm-builders 3d-gen-for-llm-builders Public

A hands-on guide to 3D latent diffusion for LLM/VLM builders

Shell 27

⚡