default search action
Yanghua Peng
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c15]Hanpeng Hu, Junwei Su, Juntao Zhao, Yanghua Peng, Yibo Zhu, Haibin Lin, Chuan Wu:
CDMPP: A Device-Model Agnostic Framework for Latency Prediction of Tensor Programs. EuroSys 2024: 1054-1074 - [c14]Juntao Zhao, Borui Wan, Yanghua Peng, Haibin Lin, Yibo Zhu, Chuan Wu:
QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices. IPDPS 2024: 193-204 - [c13]Ziheng Jiang, Haibin Lin, Yinmin Zhong, Qi Huang, Yangrui Chen, Zhi Zhang, Yanghua Peng, Xiang Li, Cong Xie, Shibiao Nong, Yulu Jia, Sun He, Hongmin Chen, Zhihao Bai, Qi Hou, Shipeng Yan, Ding Zhou, Yiyao Sheng, Zhuo Jiang, Haohan Xu, Haoran Wei, Zhang Zhang, Pengfei Nie, Leqi Zou, Sida Zhao, Liang Xiang, Zherui Liu, Zhe Li, Xiaoying Jia, Jianxi Ye, Xin Jin, Xin Liu:
MegaScale: Scaling Large Language Model Training to More Than 10, 000 GPUs. NSDI 2024: 745-760 - [c12]Juntao Zhao, Borui Wan, Chuan Wu, Yanghua Peng, Haibin Lin:
POSTER: LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization. PPoPP 2024: 460-462 - [i12]Ziheng Jiang, Haibin Lin, Yinmin Zhong, Qi Huang, Yangrui Chen, Zhi Zhang, Yanghua Peng, Xiang Li, Cong Xie, Shibiao Nong, Yulu Jia, Sun He, Hongmin Chen, Zhihao Bai, Qi Hou, Shipeng Yan, Ding Zhou, Yiyao Sheng, Zhuo Jiang, Haohan Xu, Haoran Wei, Zhang Zhang, Pengfei Nie, Leqi Zou, Sida Zhao, Liang Xiang, Zherui Liu, Zhe Li, Xiaoying Jia, Jianxi Ye, Xin Jin, Xin Liu:
MegaScale: Scaling Large Language Model Training to More Than 10, 000 GPUs. CoRR abs/2402.15627 (2024) - [i11]Juntao Zhao, Borui Wan, Yanghua Peng, Haibin Lin, Chuan Wu:
LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization. CoRR abs/2403.01136 (2024) - [i10]Juntao Zhao, Borui Wan, Yanghua Peng, Haibin Lin, Yibo Zhu, Chuan Wu:
QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices. CoRR abs/2407.02327 (2024) - [i9]Borui Wan, Mingji Han, Yiyao Sheng, Zhichao Lai, Mofan Zhang, Junda Zhang, Yanghua Peng, Haibin Lin, Xin Liu, Chuan Wu:
ByteCheckpoint: A Unified Checkpointing System for LLM Development. CoRR abs/2407.20143 (2024) - [i8]Weiqi Feng, Yangrui Chen, Shaoyu Wang, Yanghua Peng, Haibin Lin, Minlan Yu:
Optimus: Accelerating Large-Scale Multi-Modal LLM Training by Bubble Exploitation. CoRR abs/2408.03505 (2024) - [i7]Guangming Sheng, Chi Zhang, Zilingfeng Ye, Xibin Wu, Wang Zhang, Ru Zhang, Yanghua Peng, Haibin Lin, Chuan Wu:
HybridFlow: A Flexible and Efficient RLHF Framework. CoRR abs/2409.19256 (2024) - 2023
- [j4]Yangrui Chen, Jiaxuan You, Jun He, Yuan Lin, Yanghua Peng, Chuan Wu, Yibo Zhu:
SP-GNN: Learning structure and position information from graphs. Neural Networks 161: 505-514 (2023) - [j3]Yixin Bao, Yanghua Peng, Chuan Wu:
Deep Learning-Based Job Placement in Distributed Machine Learning Clusters With Heterogeneous Workloads. IEEE/ACM Trans. Netw. 31(2): 634-647 (2023) - [c11]Tianfeng Liu, Yangrui Chen, Dan Li, Chuan Wu, Yibo Zhu, Jun He, Yanghua Peng, Hongzheng Chen, Hongzhi Chen, Chuanxiong Guo:
BGL: GPU-Efficient GNN Training by Optimizing Graph Data I/O and Preprocessing. NSDI 2023: 103-118 - [i6]Hanpeng Hu, Junwei Su, Juntao Zhao, Yanghua Peng, Yibo Zhu, Haibin Lin, Chuan Wu:
CDMPP: A Device-Model Agnostic Framework for Latency Prediction of Tensor Programs. CoRR abs/2311.09690 (2023) - 2022
- [c10]Hanpeng Hu, Chenyu Jiang, Yuchen Zhong, Yanghua Peng, Chuan Wu, Yibo Zhu, Haibin Lin, Chuanxiong Guo:
dPRO: A Generic Performance Diagnosis and Optimization Toolkit for Expediting Distributed DNN Training. MLSys 2022 - [c9]Yangrui Chen, Cong Xie, Meng Ma, Juncheng Gu, Yanghua Peng, Haibin Lin, Chuan Wu, Yibo Zhu:
SAPipe: Staleness-Aware Pipeline for Data Parallel DNN Training. NeurIPS 2022 - [c8]Yihao Zhao, Yuanqiang Liu, Yanghua Peng, Yibo Zhu, Xuanzhe Liu, Xin Jin:
Multi-resource interleaving for deep learning training. SIGCOMM 2022: 428-440 - [i5]Hanpeng Hu, Chenyu Jiang, Yuchen Zhong, Yanghua Peng, Chuan Wu, Yibo Zhu, Haibin Lin, Chuanxiong Guo:
dPRO: A Generic Profiling and Optimization System for Expediting Distributed DNN Training. CoRR abs/2205.02473 (2022) - 2021
- [j2]Yanghua Peng, Yixin Bao, Yangrui Chen, Chuan Wu, Chen Meng, Wei Lin:
DL2: A Deep Learning-Driven Scheduler for Deep Learning Clusters. IEEE Trans. Parallel Distributed Syst. 32(8): 1947-1960 (2021) - [i4]Tianfeng Liu, Yangrui Chen, Dan Li, Chuan Wu, Yibo Zhu, Jun He, Yanghua Peng, Hongzheng Chen, Hongzhi Chen, Chuanxiong Guo:
BGL: GPU-Efficient GNN Training by Optimizing Graph Data I/O and Preprocessing. CoRR abs/2112.08541 (2021) - 2020
- [c7]Yangrui Chen, Yanghua Peng, Yixin Bao, Chuan Wu, Yibo Zhu, Chuanxiong Guo:
Elastic parameter server load distribution in deep learning clusters. SoCC 2020: 507-521 - [c6]Yixin Bao, Yanghua Peng, Yangrui Chen, Chuan Wu:
Preemptive All-reduce Scheduling for Expediting Distributed DNN Training. INFOCOM 2020: 626-635
2010 – 2019
- 2019
- [c5]Yixin Bao, Yanghua Peng, Chuan Wu:
Deep Learning-based Job Placement in Distributed Machine Learning Clusters. INFOCOM 2019: 505-513 - [c4]Yanghua Peng, Yibo Zhu, Yangrui Chen, Yixin Bao, Bairen Yi, Chang Lan, Chuan Wu, Chuanxiong Guo:
A generic communication scheduler for distributed DNN training acceleration. SOSP 2019: 16-29 - [i3]Yanghua Peng, Yixin Bao, Yangrui Chen, Chuan Wu, Chen Meng, Wei Lin:
DL2: A Deep Learning-driven Scheduler for Deep Learning Clusters. CoRR abs/1909.06040 (2019) - 2018
- [c3]Yanghua Peng, Yixin Bao, Yangrui Chen, Chuan Wu, Chuanxiong Guo:
Optimus: an efficient dynamic resource scheduler for deep learning clusters. EuroSys 2018: 3:1-3:14 - [c2]Yixin Bao, Yanghua Peng, Chuan Wu, Zongpeng Li:
Online Job Scheduling in Distributed Machine Learning Clusters. INFOCOM 2018: 495-503 - [i2]Yixin Bao, Yanghua Peng, Chuan Wu, Zongpeng Li:
Online Job Scheduling in Distributed Machine Learning Clusters. CoRR abs/1801.00936 (2018) - 2017
- [j1]Jingpu Duan, Chuan Wu, Franck Le, Alex X. Liu, Yanghua Peng:
Dynamic Scaling of Virtualized, Distributed Service Chains: A Case Study of IMS. IEEE J. Sel. Areas Commun. 35(11): 2501-2511 (2017) - [c1]Yanghua Peng, Ji Yang, Chuan Wu, Chuanxiong Guo, Chengchen Hu, Zongpeng Li:
deTector: a Topology-aware Monitoring System for Data Center Networks. USENIX ATC 2017: 55-68 - [i1]Jingpu Duan, Chuan Wu, Franck Le, Alex X. Liu, Yanghua Peng:
Dynamic Scaling of Virtualized, Distributed Service Chains: A Case Study of IMS. CoRR abs/1702.02853 (2017)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-12 21:56 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint