🎯
Focusing
Interested in AI for system, efficient LLM training and serving!
-
Ph.D. Candidate@CUHK-MMLab, B.E.@ UCAS
- HongKong
- https://jf-d.github.io/
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
33
stars
written in C++
Clear filter
FlexFlow Serve: Low-Latency, High-Performance LLM Serving
Development repository for integrating FlexFlow (A distributed deep learning framework that supports flexible parallelization strategies) with DLRM (Deep Learning Recommendation Model)