🎯
Focusing
Interested in AI for system, efficient LLM training and serving!
-
Ph.D. Candidate@CUHK-MMLab, B.E.@ UCAS
- HongKong
- https://jf-d.github.io/
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
8
results
for sponsorable starred repositories
written in Python
Clear filter
A high-throughput and memory-efficient inference and serving engine for LLMs
The definitive Web UI for local AI, with powerful features and easy setup.
Accessible large language models via k-bit quantization for PyTorch.
PyTorch Implementation of Fully Convolutional Networks. (Training code to reproduce the original result is available.)
Official repository for LongChat and LongEval
https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching
Measure and optimize the energy consumption of your AI applications!