default search action
"Length-Adaptive Distillation: Customizing Small Language Model for Dynamic ..."
Chang Liu et al. (2023)
- Chang Liu, Chongyang Tao, Jianxin Liang, Jiazhan Feng, Tao Shen, Quzhe Huang, Dongyan Zhao:
Length-Adaptive Distillation: Customizing Small Language Model for Dynamic Token Pruning. EMNLP (Findings) 2023: 4452-4463
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.