PhD student in HKUST-GZ
-
HKUST-GZ
- Guangzhou, China
-
05:59
(UTC +08:00) - https://dblp.org/pid/301/6349.html
Highlights
- Pro
Stars
2
results
for sponsorable starred repositories
Clear filter
Due to the huge vocaburary size (151,936) of Qwen models, the Embedding and LM Head weights are excessively heavy. Therefore, this project provides a Tokenizer vocabulary shearing solution for Qwen…
Perform data science on data that remains in someone else's server