Cao Ying
lcy-seso
MSR Asia, system research group. Previously worked at Baidu IDL(Institution of Deep Learning) and contributed as a member of the Paddle team.
MSRA, system research group China
Shangming Cai
ShangmingCai
Currently working at Alibaba Cloud Apsara Lab.
Research Interests: Efficient LLM serving system.
Alibaba Cloud
Chris Fregly
cfregly
AI Systems Performance Engineer
[3x O'Reilly Author]
[Former AWS, Databricks, Netflix]
AI Systems Performance Engineer San Francisco, CA
Varun Sundar Rabindranath
varun-sundar-rabindranath
Software Engineer interested in the research areas Machine Learning, Natural Language Processing, Information Retrieval, Program Synthesis and Computer Vision
Red Hat Boston, MA
youkaichao
youkaichao
Ph.D. from Tsinghua University. Core maintainer of @vllm-project .
Co-Founder & Chief Scientist @Inferact .
@vllm-project Beijing, China
Alp Dener
denera
Senior Performance Engineer @NVIDIA
DL Frameworks | Transformer Engine
@nvidia Chicago, IL
Chenggang Zhao
LyricZhao
@deepseek-ai infra; previously at NVIDIA | SenseTime | Tsinghua University.
DeepSeek AI Hangzhou, China
PreviousNext