Fuxun Yu, Shawn Bray, Di Wang, Longfei Shangguan, Xulong Tang, Chenchen Liu, Xiang Chen: Automated Runtime-Aware Scheduling for Multi-Tenant DNN Inference on GPU. ICCAD 2021: 1-9