😮💨
I am a Ph.D. graduate from South China University of Technology, with research interests in OCR, text image processing,and document understanding.
-
SCUT
- Guangzhou
-
06:47
(UTC +08:00) - https://scholar.google.com/citations?user=dW7AgfgAAAAJ&hl=zh-CN
Stars
5
results
for source starred repositories
written in C++
Clear filter
FlashMLA: Efficient Multi-head Latent Attention Kernels
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…
A tensorflow implementation of EAST text detector
Geometric Augmentation for Text Image