default search action

combined dblp search
author search
venue search
publication search

ask others

Weihao Cui

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/tc/GuoXLQGCCG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tc/GuoXLQGCCG24
Cong Guo, Fengchen Xue, Jingwen Leng, Yuxian Qiu, Yue Guan, Weihao Cui, Quan Chen, Minyi Guo:
Accelerating Sparse DNNs Based on Tiled GEMM. IEEE Trans. Computers 73(5): 1275-1289 (2024)
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-10876
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-10876
Cong Guo, Fengchen Xue, Jingwen Leng, Yuxian Qiu, Yue Guan, Weihao Cui, Quan Chen, Minyi Guo:
Accelerating Sparse DNNs Based on Tiled GEMM. CoRR abs/2402.10876 (2024)
[i5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-16125
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-16125
Chunyu Xue, Weihao Cui, Han Zhao, Quan Chen, Shulai Zhang, Pengyu Yang, Jing Yang, Shaobo Li, Minyi Guo:
A Codesign of Scheduling and Parallelization for Large Model Training in Heterogeneous Clusters. CoRR abs/2403.16125 (2024)
[i4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-14691
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-14691
Han Zhao, Weihao Cui, Quan Chen, Shulai Zhang, Zijun Li, Jingwen Leng, Chao Li, Deze Zeng, Minyi Guo:
Towards Fast Setup and High Throughput of GPU Serverless Computing. CoRR abs/2404.14691 (2024)
[i3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-11299
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-11299
Pai Zeng, Zhenyu Ning, Jieru Zhao, Weihao Cui, Mengwei Xu, Liwei Guo, Xusheng Chen, Yizhou Shan:
The CAP Principle for LLM Serving: A Survey of Long-Context Large Language Model Serving. CoRR abs/2405.11299 (2024)
[i2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-01075
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-01075
Yangjie Zhou, Honglin Zhu, Qian Qiu, Weihao Cui, Zihan Liu, Cong Guo, Siyuan Feng, Jintao Meng, Haidong Lan, Jingwen Leng, Wenxi Zhu, Minwen Deng:
Vortex: Efficient Sample-Free Dynamic Tensor Program Optimization via Hardware-aware Strategy Space Hierarchization. CoRR abs/2409.01075 (2024)
2023
[j4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/tc/ZhaoCCG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tc/ZhaoCCG23
Han Zhao, Weihao Cui, Quan Chen, Minyi Guo:
ISPA: Exploiting Intra-SM Parallelism in GPUs via Fine-Grained Resource Management. IEEE Trans. Computers 72(5): 1473-1487 (2023)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/tc/ZhaoCCLZG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tc/ZhaoCCLZG23
Han Zhao, Weihao Cui, Quan Chen, Jingwen Leng, Deze Zeng, Minyi Guo:
Improving Cluster Utilization Through Adaptive Resource Management for Deep Neural Network and CPU Jobs Colocation. IEEE Trans. Computers 72(12): 3458-3472 (2023)
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/cf/0001SL0C0000G23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cf/0001SL0C0000G23
Yangjie Zhou, Yaoxu Song, Jingwen Leng, Zihan Liu, Weihao Cui, Zhendong Zhang, Cong Guo, Quan Chen, Li Li, Minyi Guo:
AdaptGear: Accelerating GNN Training via Adaptive Subgraph-Level Kernels on GPUs. CF 2023: 52-62
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/cloud/Chen0CHZCLG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cloud/Chen0CHZCLG23
Binghao Chen, Han Zhao, Weihao Cui, Yifu He, Shulai Zhang, Quan Chen, Zijun Li, Minyi Guo:
Maximizing the Utilization of GPUs Used by Cloud Gaming through Adaptive Co-location with Combo. SoCC 2023: 265-280
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icpads/ChengZLCCG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpads/ChengZLCCG23
Jiagan Cheng, Yilong Zhao, Zijun Li, Quan Chen, Weihao Cui, Minyi Guo:
Microless: Cost-Efficient Hybrid Deployment of Microservices on IaaS VMs and Serverless. ICPADS 2023: 2303-2310
[c9]
- view
  - electronic edition @ usenix.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/osdi/CuiHOWZM00XQZ0T23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/osdi/CuiHOWZM00XQZ0T23
Weihao Cui, Zhenhua Han, Lingji Ouyang, Yichuan Wang, Ningxin Zheng, Lingxiao Ma, Yuqing Yang, Fan Yang, Jilong Xue, Lili Qiu, Lidong Zhou, Quan Chen, Haisheng Tan, Minyi Guo:
Optimizing Dynamic Neural Networks with Brainstorm. OSDI 2023: 797-815
[i1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-17408
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-17408
Yangjie Zhou, Yaoxu Song, Jingwen Leng, Zihan Liu, Weihao Cui, Zhendong Zhang, Cong Guo, Quan Chen, Li Li, Minyi Guo:
AdaptGear: Accelerating GNN Training via Adaptive Subgraph-Level Kernels on GPUs. CoRR abs/2305.17408 (2023)
2022
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/tc/ZhangCZCFG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tc/ZhangCZCFG22
Wei Zhang, Quan Chen, Ningxin Zheng, Weihao Cui, Kaihua Fu, Minyi Guo:
Toward QoS-Awareness and Improved Utilization of Spatial Multitasking GPUs. IEEE Trans. Computers 71(4): 866-879 (2022)
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/hpca/ZhaoCCZLLLG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hpca/ZhaoCCZLLLG22
Han Zhao, Weihao Cui, Quan Chen, Youtao Zhang, Yanchao Lu, Chao Li, Jingwen Leng, Minyi Guo:
Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS. HPCA 2022: 800-813
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/ics/ZhangCCZGLLG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ics/ZhangCCZGLLG22
Shulai Zhang, Weihao Cui, Quan Chen, Zhengnian Zhang, Yue Guan, Jingwen Leng, Chao Li, Minyi Guo:
PAME: precision-aware multi-exit DNN serving for reducing latencies of batched inferences. ICS 2022: 37:1-37:12
[c6]
- view
  - electronic edition @ usenix.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/usenix/Cui00WLZ0G22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/usenix/Cui00WLZ0G22
Weihao Cui, Han Zhao, Quan Chen, Hao Wei, Zirui Li, Deze Zeng, Chao Li, Minyi Guo:
DVABatch: Diversity-aware Multi-Entry Multi-Exit Batching for Efficient Processing of DNN Services on GPUs. USENIX ATC 2022: 183-198
2021
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tpds/CuiCZWTG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tpds/CuiCZWTG21
Weihao Cui, Quan Chen, Han Zhao, Mengze Wei, Xiaoxin Tang, Minyi Guo:
E²bird: Enhanced Elastic Batch for Improving Responsiveness and Throughput of Deep Learning Services. IEEE Trans. Parallel Distributed Syst. 32(6): 1307-1321 (2021)
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/iccd/0005CCZLG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccd/0005CCZLG21
Han Zhao, Weihao Cui, Quan Chen, Jieru Zhao, Jingwen Leng, Minyi Guo:
Exploiting Intra-SM Parallelism in GPUs via Persistent and Elastic Blocks. ICCD 2021: 290-298
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/sc/Cui0CZLZSMYLG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sc/Cui0CZLZSMYLG21
Weihao Cui, Han Zhao, Quan Chen, Ningxin Zheng, Jingwen Leng, Jieru Zhao, Zhuo Song, Tao Ma, Yong Yang, Chao Li, Minyi Guo:
Enable simultaneous DNN services based on deterministic operator overlap and precise latency prediction. SC 2021: 15
2020
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icdcs/0005CCLYZ0G20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icdcs/0005CCLYZ0G20
Han Zhao, Weihao Cui, Quan Chen, Jingwen Leng, Kai Yu, Deze Zeng, Chao Li, Minyi Guo:
CODA: Improving Resource Utilization by Slimming and Co-locating DNN and CPU Jobs. ICDCS 2020: 853-863

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/iccd/CuiWCTLLG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccd/CuiWCTLLG19
Weihao Cui, Mengze Wei, Quan Chen, Xiaoxin Tang, Jingwen Leng, Li Li, Mingyi Guo:
Ebird: Elastic Batch for Improving Responsiveness and Throughput of Deep Learning Services. ICCD 2019: 497-505
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/ics/ZhangCFCMWLG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ics/ZhangCFCMWLG19
Wei Zhang, Weihao Cui, Kaihua Fu, Quan Chen, Daniel Edward Mawhirter, Bo Wu, Chao Li, Minyi Guo:
Laius: Towards latency awareness and improved utilization of spatial multitasking accelerators in datacenters. ICS 2019: 58-68

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.