Work for Tencent-WXG. Focus on model inference optimization, such as inference engine and model compression.
- Shanghai
Stars
1
star
written in Rust
Clear filter
A Datacenter Scale Distributed Inference Serving Framework