default search action
IEEE Transactions on Parallel and Distributed Systems, Volume 33
Volume 33, Number 1, January 2022
- J. Rubén Titos Gil, Ricardo Fernández Pascual, Alberto Ros, Manuel E. Acacio:
DeTraS: Delaying Stores for Friendly-Fire Mitigation in Hardware Transactional Memory. 1-13 - Haozhao Wang, Song Guo, Zhihao Qu, Ruixuan Li, Ziming Liu:
Error-Compensated Sparsification for Communication-Efficient Decentralized Training in Edge Environment. 14-25 - Georgios Andreadis, Fabian Mastenbroek, Vincent van Beek, Alexandru Iosup:
Capelin: Data-Driven Compute Capacity Procurement for Cloud Datacenters Using Portfolios of Scenarios. 26-39 - Yibo Jin, Zhuzhong Qian, Song Guo, Sheng Zhang, Lei Jiao, Sanglu Lu:
$run$ runData: Re-Distributing Data via Piggybacking for Geo-Distributed Data Analytics Over Edges. 40-55 - Si Wu, Zhirong Shen, Patrick P. C. Lee, Yinlong Xu:
Optimal Repair-Scaling Trade-off in Locally Repairable Codes: Analysis and Evaluation. 56-69 - Gangzhao Lu, Weizhe Zhang, Zheng Wang:
Optimizing Depthwise Separable Convolution Operations on GPUs. 70-87 - Gingfung Yeung, Damian Borowiec, Renyu Yang, Adrian Friday, Richard Harper, Peter Garraghan:
Horus: Interference-Aware and Prediction-Based Scheduling in Deep Learning Systems. 88-100 - Shreshth Tuli, Shivananda R. Poojara, Satish Narayana Srirama, Giuliano Casale, Nicholas R. Jennings:
COSCO: Container Orchestration Using Co-Simulation and Gradient Based Optimization for Fog Computing Environments. 101-116 - Dipika Deb, Rohith M. K., John Jose:
FlitZip: Effective Packet Compression for NoC in MultiProcessor System-on-Chip. 117-128 - Tongfeng Weng, Xu Zhou, Kenli Li, Peng Peng, Keqin Li:
Efficient Distributed Approaches to Core Maintenance on Large Dynamic Graphs. 129-143 - Yidi Wu, Kaihao Ma, Xiao Yan, Zhi Liu, Zhenkun Cai, Yuzhen Huang, James Cheng, Han Yuan, Fan Yu:
Elastic Deep Learning in Multi-Tenant GPU Clusters. 144-158 - Zhen Xie, Guangming Tan, Weifeng Liu, Ninghui Sun:
A Pattern-Based SpGEMM Library for Multi-Core and Many-Core Architectures. 159-175 - Federico Magnanini, Luca Ferretti, Michele Colajanni:
Scalable, Confidential and Survivable Software Updates. 176-191 - Yuhao Zhou, Qing Ye, Jiancheng Lv:
Communication-Efficient Federated Learning With Compensated Overlap-FedAvg. 192-205 - Saad Zia Sheikh, Muhammad Adeel Pasha:
Energy-Efficient Cache-Aware Scheduling on Heterogeneous Multicore Systems. 206-217 - Huan Wang, Guoming Tang, Kui Wu, Jianping Wang:
PLVER: Joint Stable Allocation and Content Replication for Edge-Assisted Live Video Delivery. 218-230 - Amelie Chi Zhou, Weilin Xue, Yao Xiao, Bingsheng He, Shadi Ibrahim, Reynold Cheng:
Taming System Dynamics on Resource Optimization for Data Processing Workflows: A Probabilistic Approach. 231-248
Volume 33, Number 2, February 2022
- Shutong Chen, Lei Jiao, Fangming Liu, Lin Wang:
EdgeDR: An Online Mechanism Design for Demand Response in Edge Clouds. 343-358 - Ping Gao, Xiaohui Duan, Bertil Schmidt, Wusheng Zhang, Lin Gan, Haohuan Fu, Wei Xue, Weiguo Liu, Guangwen Yang:
Optimization of Reactive Force Field Simulation: Refactor, Parallelization, and Vectorization for Interactions. 359-373 - Yipei Niu, Panpan Jin, Jian Guo, Yikai Xiao, Rong Shi, Fangming Liu, Chen Qian, Yang Wang:
PostMan: Rapidly Mitigating Bursty Traffic via On-Demand Offloading of Packet Processing. 374-387 - Konstantinos Iliakis, Sotirios Xydis, Dimitrios Soudris:
Repurposing GPU Microarchitectures with Light-Weight Out-Of-Order Execution. 388-402 - Li Chen, Shuhao Liu, Baochun Li:
Optimizing Network Transfers for Data Analytic Jobs Across Geo-Distributed Datacenters. 403-414 - Limei Lin, Yanze Huang, Li Xu, Sun-Yuan Hsieh:
A Pessimistic Fault Diagnosability of Large-Scale Connected Networks via Extra Connectivity. 415-428 - Nhut-Minh Ho, Weng-Fai Wong:
Tensorox: Accelerating GPU Applications via Neural Approximation on Unused Tensor Cores. 429-443 - Abdurrahman Yasar, Sivasankaran Rajamanickam, Jonathan W. Berry, Ümit V. Çatalyürek:
A Block-Based Triangle Counting Algorithm on Heterogeneous Environments. 444-458 - Feng Zhang, Jidong Zhai, Xipeng Shen, Onur Mutlu, Xiaoyong Du:
POCLib: A High-Performance Framework for Enabling Near Orthogonal Processing on Compression. 459-475 - Jie Cui, Bei Li, Hong Zhong, Geyong Min, Yan Xu, Lu Liu:
A Practical and Efficient Bidirectional Access Control Scheme for Cloud-Edge Data Sharing. 476-488 - Scott Pakin, Christof Teuscher, Catherine D. Schuman:
Guest Editorial: Special Section on Parallel and Distributed Computing Techniques for Non-Von Neumann Technologies. 249-250 - Chang Hyun Kim, Won Jun Lee, Yoonah Paik, Kiyong Kwon, Seok Young Kim, Il Park, Seon Wook Kim:
Silent-PIM: Realizing the Processing-in-Memory Computing With Standard Memory Requests. 251-262 - Purab Ranjan Sutradhar, Sathwika Bavikadi, Mark Connolly, Savankumar Prajapati, Mark A. Indovina, Sai Manoj Pudukotai Dinakarrao, Amlan Ganguly:
Look-up-Table Based Processing-in-Memory Architecture With Programmable Precision-Scaling for Deep Learning Applications. 263-275 - Leonid Yavits, Roman Kaplan, Ran Ginosar:
GIRAF: General Purpose In-Storage Resistive Associative Framework. 276-287 - Twisha Titirsha, Shihao Song, Anup Das, Jeffrey L. Krichmar, Nikil D. Dutt, Nagarajan Kandasamy, Francky Catthoor:
Endurance-Aware Mapping of Spiking Neural Networks to Neuromorphic Hardware. 288-301 - Kyle Henke, Garrett T. Kenyon, Ben Migliori:
Fast Post-Hoc Normalization for Brain Inspired Sparse Coding on a Neuromorphic Device. 302-309 - Elijah Pelofske, Georg Hahn, Hristo N. Djidjev:
Inferring the Dynamics of the State Evolution During Quantum Annealing. 310-321 - Karolos-Alexandros Tsakalos, Georgios Ch. Sirakoulis, Andrew Adamatzky, Jim Smith:
Protein Structured Reservoir Computing for Spike-Based Pattern Recognition. 322-331 - Bosheng Song, Kenli Li, Xiangxiang Zeng:
Monodirectional Evolutional Symport Tissue P Systems With Promoters and Cell Division. 332-342
Volume 33, Number 3, March 2022
- Shixiong Zhao, Fanxin Li, Xusheng Chen, Xiuxian Guan, Jianyu Jiang, Dong Huang, Yuhao Qing, Sen Wang, Peng Wang, Gong Zhang, Cheng Li, Ping Luo, Heming Cui:
vPipe: A Virtualized Acceleration System for Achieving Efficient and Scalable Pipeline Parallel DNN Training. 489-506 - Yishu Du, Loris Marchal, Guillaume Pallez, Yves Robert:
Optimal Checkpointing Strategies for Iterative Applications. 507-522 - Marcin Copik, Tobias Grosser, Torsten Hoefler, Paolo Bientinesi, Benjamin Berkels:
Work-Stealing Prefix Scan: Addressing Load Imbalance in Large-Scale Image Registration. 523-535 - Wei Yang Bryan Lim, Jer Shyuan Ng, Zehui Xiong, Jiangming Jin, Yang Zhang, Dusit Niyato, Cyril Leung, Chunyan Miao:
Decentralized Edge Intelligence: A Dynamic Resource Allocation Framework for Hierarchical Federated Learning. 536-550 - Yiwen Gao, Jia Xu, Hongbing Wang:
cuNH: Efficient GPU Implementations of Post-Quantum KEM NewHope. 551-568 - Hui Cai, Fan Ye, Yuanyuan Yang, Yanmin Zhu, Jie Li, Fu Xiao:
Online Pricing and Trading of Private Data in Correlated Queries. 569-585 - Yuan Wang, Hideaki Ishii, François Bonnet, Xavier Défago:
Resilient Real-Valued Consensus in Spite of Mobile Malicious Agents on Directed Graphs. 586-603 - Oliver Giersch, Jörg Nolte:
Fast and Portable Concurrent FIFO Queues With Deterministic Memory Reclamation. 604-616 - Chavit Denninnart, Mohsen Amini Salehi:
Harnessing the Potential of Function-Reuse in Multimedia Cloud Systems. 617-629 - Jed Mills, Jia Hu, Geyong Min:
Multi-Task Federated Learning for Personalised Deep Neural Networks in Edge Computing. 630-641 - John Gounley, Madhurima Vardhan, Erik W. Draeger, Pedro Valero-Lara, Shirley V. Moore, Amanda Randles:
Propagation Pattern for Moment Representation of the Lattice Boltzmann Method. 642-653 - Quan Zheng, Tao Yang, Yuanzhi Kan, Xiaobin Tan, Jian Yang, Xiaofeng Jiang:
On the Analysis of Cache Invalidation With LRU Replacement. 654-666 - Tayebeh Bahreini, Hossein Badri, Daniel Grosu:
Mechanisms for Resource Allocation and Pricing in Mobile Edge Computing Systems. 667-682 - Xing Chen, Jianshan Zhang, Bing Lin, Zheyi Chen, Katinka Wolter, Geyong Min:
Energy-Efficient Offloading for DNN-Based Smart IoT Systems in Cloud-Edge Environments. 683-697 - Anandarup Mukherjee, Pallav Kumar Deb, Sudip Misra:
Timed Loops for Distributed Storage in Wireless Networks. 698-709 - Umar Ibrahim Minhas, Roger F. Woods, Dimitrios S. Nikolopoulos, Georgios Karakonstantis:
Efficient, Dynamic Multi-Task Execution on FPGA-Based Computing Systems. 710-722 - Linsong Cheng, Jiliang Wang, Yinghui Li:
ViTrack: Efficient Tracking on the Edge for Commodity Video Surveillance Systems. 723-735
Volume 33, Number 4, April 2022
- Sadaf R. Alam, Lois Curfman McInnes, Kengo Nakajima:
IEEE Special Issue on Innovative R&D Toward the Exascale Era. 736-738 - Andrea Borghesi, Martin Molan, Michela Milano, Andrea Bartolini:
Anomaly Detection and Anticipation in High Performance Computing Systems. 739-750 - Yiming Wang, Weizhe Zhang, Meng Hao, Zheng Wang:
Online Power Management for Multi-Cores: A Reinforcement Learning Based Approach. 751-764 - Chao Chen, Greg Eisenhauer, Santosh Pande:
Near-Zero Downtime Recovery From Transient-Error-Induced Crashes. 765-778 - Juan M. Cebrian, Thibaud Balem, Adrián Barredo, Marc Casas, Miquel Moretó, Alberto Ros, Alexandra Jimborean:
Compiler-Assisted Compaction/Restoration of SIMD Instructions. 779-791 - Lazaros Papadopoulos, Dimitrios Soudris, Christoph W. Kessler, August Ernstsson, Johan Ahlqvist, Nikos Vasilas, Athanasios I. Papadopoulos, Panos Seferlis, Charles Prouveur, Matthieu Haefele, Samuel Thibault, Athanasios Salamanis, Theodoros Ioakimidis, Dionysios D. Kehagias:
EXA2PRO: A Framework for High Development Productivity on Heterogeneous Computing Systems. 792-804 - Christian R. Trott, Damien Lebrun-Grandié, Daniel Arndt, Jan Ciesko, Vinh Q. Dang, Nathan D. Ellingwood, Rahulkumar Gayatri, Evan Harvey, Daisy S. Hollman, Dan Ibanez, Nevin Liber, Jonathan R. Madsen, Jeff Miles, David Poliakoff, Amy Powell, Sivasankaran Rajamanickam, Mikael Simberg, Dan Sunderland, Bruno Turcksin, Jeremiah J. Wilke:
Kokkos 3: Programming Model Extensions for the Exascale Era. 805-817 - André Merzky, Matteo Turilli, Mikhail Titov, Aymen Al-Saadi, Shantenu Jha:
Design and Performance Characterization of RADICAL-Pilot on Leadership-Class Platforms. 818-829 - Jonas H. Müller Korndörfer, Ahmed Eleliemy, Ali Mohammed, Florina M. Ciorba:
LB4OMP: A Dynamic Load Balancing Library for Multithreaded Applications. 830-841 - Junchao Zhang, Jed Brown, Satish Balay, Jacob Faibussowitsch, Matthew G. Knepley, Oana Marin, Richard Tran Mills, Todd S. Munson, Barry F. Smith, Stefano Zampini:
The PetscSF Scalable Communication Layer. 842-853 - Keren Zhou, Xiaozhu Meng, Ryuichi Sai, Dejan Grubisic, John M. Mellor-Crummey:
An Automated Tool for Analysis and Tuning of GPU-Accelerated Code in HPC Applications. 854-865 - Ivy Bo Peng, Maya B. Gokhale, Karim Youssef, Keita Iwabuchi, Roger Pearce:
Enabling Scalable and Extensible Memory-Mapped Datastores in Userspace. 866-877 - Lipeng Wan, Axel Huebl, Junmin Gu, Franz Poeschel, Ana Gainaru, Ruonan Wang, Jieyang Chen, Xin Liang, Dmitry Ganyushin, Todd S. Munson, Ian T. Foster, Jean-Luc Vay, Norbert Podhorszki, Kesheng Wu, Scott Klasky:
Improving I/O Performance for Exascale Applications Through Online Data Layout Reorganization. 878-890 - Houjun Tang, Quincey Koziol, John Ravi, Suren Byna:
Transparent Asynchronous Parallel I/O Using Background Threads. 891-902 - Jérome Soumagne, Jordan Henderson, Mohamad Chaarawi, Neil Fortner, M. Scot Breitenfeld, Songyu Lu, Dana Robinson, Elena Pourmal, Johann Lombardi:
Accelerating HDF5 I/O for Exascale Using DAOS. 903-914 - Sayan Ghosh, Nathan R. Tallent, Mahantesh Halappanavar:
Characterizing Performance of Graph Neighborhood Communication Patterns. 915-928 - Arindam Khanda, Sriram Srinivasan, Sanjukta Bhowmick, Boyana Norris, Sajal K. Das:
A Parallel Algorithm Template for Updating Single-Source Shortest Paths in Large-Scale Dynamic Networks. 929-940 - Xinbiao Gan, Yiming Zhang, Ruibo Wang, Tiejun Li, Tiaojie Xiao, Ruigeng Zeng, Jie Liu, Kai Lu:
TianheGraph: Customizing Graph Search for Graph500 on Tianhe Supercomputer. 941-951 - Robert F. Bird, Nigel Tan, Scott V. Luedtke, Stephen Lien Harrell, Michela Taufer, Brian J. Albright:
VPIC 2.0: Next Generation Particle-in-Cell Simulations. 952-963 - Sameh Abdulah, Qinglei Cao, Yu Pei, George Bosilca, Jack J. Dongarra, Marc G. Genton, David E. Keyes, Hatem Ltaief, Ying Sun:
Accelerating Geostatistical Modeling and Prediction With Mixed-Precision Computations: A High-Productivity Approach With PaRSEC. 964-976 - Stephen Hudson, Jeffrey Larson, John-Luke Navarro, Stefan M. Wild:
libEnsemble: A Library to Coordinate the Concurrent Evaluation of Dynamic Ensembles of Calculations. 977-988 - Ariful Azad, Oguz Selvitopi, Md Taufique Hussain, John R. Gilbert, Aydin Buluç:
Combinatorial BLAS 2.0: Scaling Combinatorial Algorithms on Distributed-Memory Systems. 989-1001 - Gordon Euhyun Moon, Hyoukjun Kwon, Geonhwa Jeong, Prasanth Chatarasi, Sivasankaran Rajamanickam, Tushar Krishna:
Evaluating Spatial Accelerator Architectures with Tiled Matrix-Matrix Multiplication. 1002-1014 - Anil Gaihre, Xiaoye Sherry Li, Hang Liu:
gSoFa: Scalable Sparse Symbolic LU Factorization on GPUs. 1015-1026 - Neil Lindquist, Piotr Luszczek, Jack J. Dongarra:
Accelerating Restarted GMRES With Mixed Precision Arithmetic. 1027-1037
Volume 33, Number 5, May 2022
- Fabio Montagna, Stefan Mach, Simone Benatti, Angelo Garofalo, Gianmarco Ottavi, Luca Benini, Davide Rossi, Giuseppe Tagliavini:
A Low-Power Transprecision Floating-Point Cluster for Efficient Near-Sensor Data Analytics. 1038-1053 - Fei Lei, Dezun Dong, Xiangke Liao:
Exploring the Galaxyfly Family to Build Flexible-Scale Interconnection Networks. 1054-1068 - Zongyi Zhao, Xingang Shi, Zhiliang Wang, Qing Li, Han Zhang, Xia Yin:
Efficient and Accurate Flow Record Collection With HashFlow. 1069-1083 - Jiantong Jiang, Zeyi Wen, Ze-ke Wang, Bingsheng He, Jian Chen:
Parallel and Distributed Structured SVM Training. 1084-1096 - Jian Liu, Peilun Li, Raymond Cheng, N. Asokan, Dawn Song:
Parallel and Asynchronous Smart Contract Execution. 1097-1108 - Jiaqi Liu, Shiyue Huang, Deng Li, Sheng Wen, Hui Liu:
Addictive Incentive Mechanism in Crowdsensing From the Perspective of Behavioral Economics. 1109-1127 - Shaoqi Wang, Aidi Pi, Xiaobo Zhou:
Elastic Parameter Server: Accelerating ML Training With Scalable Resource Scheduling. 1128-1143 - Xiaoyu Xia, Feifei Chen, Qiang He, Guangming Cui, John C. Grundy, Mohamed Almorsy Abdelrazek, Xiaolong Xu, Hai Jin:
Data, User and Power Allocations for Caching in Multi-Access Edge Computing. 1144-1155 - Bruno Donassolo, Arnaud Legrand, Panayotis Mertikopoulos, Ilhem Fajjari:
Online Reconfiguration of IoT Applications in the Fog: The Information-Coordination Trade-Off. 1156-1172 - Lipeng Wang, Qiong Luo, Shengen Yan:
DIESEL+: Accelerating Distributed Deep Learning Tasks on Image Datasets. 1173-1184 - Zhi Ma, Sheng Zhang, Zhiqi Chen, Tao Han, Zhuzhong Qian, Mingjun Xiao, Ning Chen, Jie Wu, Sanglu Lu:
Towards Revenue-Driven Multi-User Online Task Offloading in Edge Computing. 1185-1198 - Jing Li, Weifa Liang, Wenzheng Xu, Zichuan Xu, Xiaohua Jia, Wanlei Zhou, Jin Zhao:
Maximizing User Service Satisfaction for Delay-Sensitive IoT Applications in Edge Computing. 1199-1212 - Takuya Kojima, Ayaka Ohwada, Hideharu Amano:
Mapping-Aware Kernel Partitioning Method for CGRAs Assisted by Deep Learning. 1213-1230 - YuAng Chen, Yeh-Ching Chung:
Workload Balancing via Graph Reordering on Multicore Systems. 1231-1245 - Junsong Fu, Na Wang, Baojiang Cui, Bharat K. Bhargava:
A Practical Framework for Secure Document Retrieval in Encrypted Cloud File Systems. 1246-1261 - Zhu Jin, Wen-Kang Jia:
DH-SVRF: A Reconfigurable Unicast/Multicast Forwarding for High-Performance Packet Forwarding Engines. 1262-1275
Volume 33, Number 6, June 2022
- Kiril Dichev, Daniele De Sensi, Dimitrios S. Nikolopoulos, Kirk W. Cameron, Ivor T. A. Spence:
Power Log'n'Roll: Power-Efficient Localized Rollback for MPI Applications Using Message Logging Protocols. 1276-1288 - Feng Zhang, Erkang Xue, Ruixin Guo, Guangzhi Qu, Gansen Zhao, Albert Y. Zomaya:
DS-ADMM++: A Novel Distributed Quantized ADMM to Speed up Differentially Private Matrix Factorization. 1289-1302 - Tsung-Wei Huang, Dian-Lun Lin, Chun-Xun Lin, Yibo Lin:
Taskflow: A Lightweight Parallel and Heterogeneous Task Graph Computing System. 1303-1320 - John Augustine, Keerti Choudhary, Avi Cohen, David Peleg, Sumathi Sivasubramaniam, Suman Sourav:
Distributed Graph Realizations. 1321-1337 - Maciej Kokocinski, Tadeusz Kobus, Pawel T. Wojciechowski:
On Mixing Eventual and Strong Consistency: Acute Cloud Types. 1338-1356 - Yuxuan Li, Lin Gan, Mingcheng Chen, Yaojian Chen, Haitian Lu, Chao-Yang Lu, Jian-Wei Pan, Haohuan Fu, Guangwen Yang:
Benchmarking 50-Photon Gaussian Boson Sampling on the Sunway TaihuLight. 1357-1372 - Peijin Cong, Zhixing Zhang, Junlong Zhou, Xin Liu, Yao Liu, Tongquan Wei:
Customer Adaptive Resource Provisioning for Long-Term Cloud Profit Maximization under Constrained Budget. 1373-1392 - Haotian Wu, Zhe Peng, Songtao Guo, Yuanyuan Yang, Bin Xiao:
VQL: Efficient and Verifiable Cloud Query Services for Blockchain Systems. 1393-1406 - Kwangsung Oh, Minmin Zhang, Abhishek Chandra, Jon B. Weissman:
Network Cost-Aware Geo-Distributed Data Analytics System. 1407-1420 - Ziliang Wang, Xiaohong Zhang, Meng Yan, Ling Xu, Dan Yang:
HSA-Net: Hidden-State-Aware Networks for High-Precision QoS Prediction. 1421-1435 - Brian R. Tauro, Conghao Liu, Kyle C. Hale:
Modeling Speedup in Multi-OS Environments. 1436-1450 - Chen Zhao, Wu Gao, Feiping Nie, Huiyang Zhou:
A Survey of GPU Multitasking Methods Supported by Hardware Architecture. 1451-1463 - Ronan-Alexandre Cherrueau, Marie Delavergne, Alexandre van Kempen, Adrien Lebre, Dimitri Pertin, Javier Rojas Balderrama, Anthony Simonet, Matthieu Simonin:
EnosLib: A Library for Experiment-Driven Research in Distributed Computing. 1464-1477 - Abhishek Kumar Jain, Douglas L. Maskell, Suhaib A. Fahmy:
Coarse Grained FPGA Overlay for Rapid Just-In-Time Accelerator Compilation. 1478-1490 - Qingzhi Liu, Tiancong Xia, Long Cheng, Merijn van Eijk, Tanir Ozcelebi, Ying Mao:
Deep Reinforcement Learning for Load-Balancing Aware Network Control in IoT Edge Systems. 1491-1502 - Yan Ding, Kenli Li, Chubo Liu, Keqin Li:
A Potential Game Theoretic Approach to Computation Offloading Strategy Optimization in End-Edge-Cloud Computing. 1503-1519
Volume 33, Number 7, July 2022
- Hoon Sung Chwa, Hyeongboo Baek, Jinkyu Lee:
Necessary Feasibility Analysis for Mixed-Criticality Real-Time Embedded Systems. 1520-1537 - Jiahui Li, Hao Wu, Jiapei Chen, Qiang He, Ching-Hsien Hsu:
Topology-Aware Neural Model for Highly Accurate QoS Prediction. 1538-1552 - Zaifeng Pan, Feng Zhang, Yanliang Zhou, Jidong Zhai, Xipeng Shen, Onur Mutlu, Xiaoyong Du:
Exploring Data Analytics Without Decompression on Embedded GPU Systems. 1553-1568 - Zhaorui Zhang, Cho-Li Wang:
SaPus: Self-Adaptive Parameter Update Strategy for DNN Training on Multi-GPU Clusters. 1569-1580 - Yu Chen, Sheng Zhang, Yibo Jin, Zhuzhong Qian, Mingjun Xiao, Jidong Ge, Sanglu Lu:
LOCUS: User-Perceived Delay-Aware Service Placement and User Allocation in MEC Environment. 1581-1592 - Yanze Huang, Limei Lin, Sun-Yuan Hsieh:
A Fast $f(r, k+1)/k$f(r, k+1)/k-Diagnosis for Interconnection Networks Under MM* Model. 1593-1604 - Woojoong Kim, Chan-Hyun Youn:
Cooperative Scheduling Schemes for Explainable DNN Acceleration in Satellite Image Analysis and Retraining. 1605-1618 - Zihang Yao, Rong Chen, Binyu Zang, Haibo Chen:
Wukong+G: Fast and Concurrent RDF Query Processing Using RDMA-Assisted GPU Graph Exploration. 1619-1635 - Zecheng Li, Haotian Wu, Ricky Lap-Hou Lao, Songtao Guo, Yuanyuan Yang, Bin Xiao:
Pistis: Issuing Trusted and Authorized Certificates With Distributed Ledger and TEE. 1636-1649 - Sheng Yue, Ju Ren, Nan Qiao, Yongmin Zhang, Hongbo Jiang, Yaoxue Zhang, Yuanyuan Yang:
TODG: Distributed Task Offloading With Delay Guarantees for Edge Computing. 1650-1665 - Yinjin Fu, Yutong Lu, Zhiguang Chen, Yang Wu, Nong Xiao:
Design and Simulation of Content-Aware Hybrid DRAM-PCM Memory System. 1666-1677 - Sepideh Safari, Heba Khdr, Pourya Gohari-Nazari, Mohsen Ansari, Shaahin Hessabi, Jörg Henkel:
TherMa-MiCs: Thermal-Aware Scheduling for Fault-Tolerant Mixed-Criticality Systems. 1678-1694 - Muhammed Tawfiqul Islam, Shanika Karunasekera, Rajkumar Buyya:
Performance and Cost-Efficient Spark Job Scheduling Based on Deep Reinforcement Learning in Cloud Computing Environments. 1695-1710 - Martin Kleppmann, Dominic P. Mulligan, Victor B. F. Gomes, Alastair R. Beresford:
A Highly-Available Move Operation for Replicated Trees. 1711-1724 - Hyungmin Cho, Jeesoo Lee, Jaejin Lee:
FARNN: FPGA-GPU Hybrid Acceleration Platform for Recurrent Neural Networks. 1725-1738 - Limei Lin, Yanze Huang, Yuhang Lin, Sun-Yuan Hsieh, Li Xu:
FFNLFD: Fault Diagnosis of Multiprocessor Systems at Local Node With Fault-Free Neighbors Under PMC Model and MM* Model. 1739-1751 - Yuxing Yang, Lingling Zhang:
Hamiltonian Paths of $k$k-ary $n$n-cubes Avoiding Faulty Links and Passing Through Prescribed Linear Forests. 1752-1760
Volume 33, Number 8, August 2022
- Qi Zhang, Yi Liu, Tao Liu:
iBalancer: Load-Aware in-Server Flow Scheduling for Sub-Millisecond Tail Latency. 1761-1774 - Fahao Chen, Peng Li, Toshiaki Miyazaki, Celimuge Wu:
FedGraph: Federated Graph Learning With Intelligent Sampling. 1775-1786 - Dixit Bhatta, Lena Mashayekhy:
A Bifactor Approximation Algorithm for Cloudlet Placement in Edge Computing. 1787-1798 - Yusheng Hua, Xuanhua Shi, Kang He, Hai Jin, Wei Xie, Ligang He, Yong Chen:
LoomIO: Object-Level Coordination in Distributed File Systems. 1799-1810 - Bingting Jiang, Zhuo Tang, Xiong Xiao, Jing Yao, Ronghui Cao, Kenli Li:
Efficient and Automated Deployment Architecture for OpenStack in TianHe SuperComputing Environment. 1811-1824 - Kaihua Fu, Wei Zhang, Quan Chen, Deze Zeng, Minyi Guo:
Adaptive Resource Efficient Microservice Deployment in Cloud-Edge Continuum. 1825-1840 - Xiaojie Wang, Zhaolong Ning, Lei Guo, Song Guo, Xinbo Gao, Guoyin Wang:
Online Learning for Distributed Computation Offloading in Wireless Powered Mobile Edge Computing Networks. 1841-1855 - Qinglei Cao, George Bosilca, Nuria Losada, Wei Wu, Dong Zhong, Jack J. Dongarra:
Evaluating Data Redistribution in PaRSEC. 1856-1872 - Liang Yuan, Qiang He, Feifei Chen, Jun Zhang, Lianyong Qi, Xiaolong Xu, Yang Xiang, Yun Yang:
CSEdge: Enabling Collaborative Edge Storage for Multi-Access Edge Computing Based on Blockchain. 1873-1887 - Mingchuan Wu, Yangjun Wu, Honghui Shang, Ying Liu, Huimin Cui, Fang Li, Xiaohui Duan, Yunquan Zhang, Xiaobing Feng:
Scaling Poisson Solvers on Many Cores via MMEwald. 1888-1901 - Xiao-Wen Qin, Rong-Xia Hao, Jie Wu:
Construction of Dual-CISTs on an Infinite Class of Networks. 1902-1910 - Zheyi Chen, Jia Hu, Geyong Min, Chunbo Luo, Tarek A. El-Ghazawi:
Adaptive and Efficient Resource Allocation in Cloud Datacenters Using Actor-Critic Deep Reinforcement Learning. 1911-1923 - Daniele Bianchi, Federico Avanzini, Adriano Baratè, Luca A. Ludovico, Giorgio Presti:
A GPU-Oriented Application Programming Interface for Digital Audio Workstations. 1924-1938 - Xiao-Yan Li, Wanling Lin, Ximeng Liu, Cheng-Kuan Lin, Kung-Jui Pai, Jou-Ming Chang:
Completely Independent Spanning Trees on BCCC Data Center Networks With an Application to Fault-Tolerant Routing. 1939-1952 - Kai Lu, Nannan Zhao, Jiguang Wan, Changhong Fei, Wei Zhao, Tongliang Deng:
TridentKV: A Read-Optimized LSM-Tree Based KV Store via Adaptive Indexing and Space-Efficient Partitioning. 1953-1966 - Zhenkun Cai, Xiao Yan, Kaihao Ma, Yidi Wu, Yuzhen Huang, James Cheng, Teng Su, Fan Yu:
TensorOpt: Exploring the Tradeoffs in Distributed DNN Training With Auto-Parallelism. 1967-1981 - Tao Shi, Hui Ma, Gang Chen, Sven Hartmann:
Cost-Effective Web Application Replication and Deployment in Multi-Cloud Environment. 1982-1995 - Yongheng Deng, Feng Lyu, Ju Ren, Huaqing Wu, Yuezhi Zhou, Yaoxue Zhang, Xuemin Shen:
AUCTION: Automated and Quality-Aware Client Selection Framework for Efficient Federated Learning. 1996-2009
Volume 33, Number 9, September 2022
- Manish Parashar:
EiC Editorial - Advancing Reproducibility in Parallel and Distributed Systems Research. 2010 - Stephen Lien Harrell, Scott Michael, Carlos Maltzahn:
Advancing Adoption of Reproducibility in HPC: A Preface to the Special Section. 2011-2013 - Mert Hidayetoglu, Tekin Biçer, Simon Garcia de Gonzalo, Bin Ren, Doga Gürsoy, Rajkumar Kettimuthu, Ian T. Foster, Wen-Mei W. Hwu:
MemXCT: Design, Optimization, Scaling, and Reproducibility of X-Ray Tomography Imaging. 2014-2031 - Zejia Fan, Yuchen Gu, Zhewen Hao, Yueyang Pan, Pengcheng Xu, Yuxuan Yan, Fangyuan Yang, Zhenxin Fu, Yun Liang:
Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From Peking University. 2032-2034 - Nicole Prindle, Ali Kazmi, Aman Jain, Albert Chen, Marissa Sorkin, Sudhanshu Agarwal, Richard W. Vuduc, Vijay Thakkar:
Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From Georgia Tech. 2035-2038 - Jan Kleine, Rahul Steiger, Simon Wachter, Emir Isman, Simon Jacob, Dario Romaniello:
Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From ETH Zürich. 2039-2042 - Xiaochen Li, Maximilian Apodaca, Arunav Gupta, Zihao Kong, Hongyi Pan, Hongyu Zhou, Mary P. Thomas, Martin Kandes, Zhaoyi Li, Mahidhar Tatineni, Lewis Carroll:
Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From University of California San Diego. 2043-2046 - Yuchen Liu, Yixuan Meng, Kaiyuan Xu, Zijun Xu, Tianyuan Wu, Yiwei Yang, Shu Yin:
Reproducibility: Performance Evaluation of MemXCT on Azure CycleCloud Platform. 2047-2049 - Runxin Zhong, Jiajie Chen, Chen Zhang, Mingshu Zhai, Zeyu Song, Yutian Wang, Wentao Han, Lin Gan, Jidong Zhai:
Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From Tsinghua University. 2050-2053 - Griffin Dube, Cavender Holt, John Hollowell, Sarah Placke, Sansriti Ranjan, Nikolas Heitzig, Jon Calhoun:
Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From Clemson University. 2054-2057 - Shenggui Li, Bu-Sung Lee:
Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From Nanyang Technological University. 2058-2061 - Brock Davis, Juan Paez, Jack Gaither, Joe A. Garcia:
Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From the University of Texas at Austin. 2062-2065 - Yuan Meng, Sanmukh R. Kuppannagari, Rajgopal Kannan, Viktor K. Prasanna:
PPOAccel: A High-Throughput Acceleration Framework for Proximal Policy Optimization. 2066-2078 - Xiaoyong Tang, Wenbiao Cao, Huiya Tang, Tan Deng, Jing Mei, Yi Liu, Cheng Shi, Meng Xia, Zeng Zeng:
Cost-Efficient Workflow Scheduling Algorithm for Applications With Deadline Constraint on Heterogeneous Clouds. 2079-2092 - Xu Zhang, Zhengnan Qi, Geyong Min, Wang Miao, Qilin Fan, Zhan Ma:
Cooperative Edge Caching Based on Temporal Convolutional Networks. 2093-2105 - Antonis Papaioannou, Kostas Magoutis:
Addressing the Read-Performance Impact of Reconfigurations in Replicated Key-Value Stores. 2106-2119 - Meng Ma, Jingbin Zhang, Ping Wang:
DePo: Dynamically Offload Expensive Event Processing to the Edge of Cyber-Physical Systems. 2120-2132 - Aldenio Burgos, Eduardo Alchieri, Fernando Luís Dotti, Fernando Pedone:
Exploiting Concurrency in Sharded Parallel State Machine Replication. 2133-2147 - Joshua Mack, Samet E. Arda, Ümit Y. Ogras, Ali Akoglu:
Performant, Multi-Objective Scheduling of Highly Interleaved Task Graphs on Heterogeneous System on Chip Devices. 2148-2162 - Minghao Zhao, Zhenhua Li, Wei Liu, Jian Chen, Xingyao Li:
UFC2: User-Friendly Collaborative Cloud. 2163-2182 - Huifang Li, Danjing Wang, MengChu Zhou, Yushun Fan, Yuanqing Xia:
Multi-Swarm Co-Evolution Based Hybrid Intelligent Optimization for Bi-Objective Multi-Workflow Scheduling in the Cloud. 2183-2197 - Zhenli He, Kenli Li, Keqin Li:
Cost-Efficient Server Configuration and Placement for Mobile Edge Computing. 2198-2212 - Penghao Zhang, Heng Pan, Zhenyu Li, Penglai Cui, Ru Jia, Peng He, Zhibin Zhang, Gareth Tyson, Gaogang Xie:
NetSHa: In-Network Acceleration of LSH-Based Distributed Search. 2213-2229 - Thomas Faingnaert, Tim Besard, Bjorn De Sutter:
Flexible Performant GEMM Kernels on GPUs. 2230-2248 - Jiamin Cao, Ying Liu, Yu Zhou, Lin He, Chen Sun, Yangyang Wang, Mingwei Xu:
CoFilter: High-Performance Switch-Accelerated Stateful Packet Filter for Bare-Metal Servers. 2249-2262
Volume 33, Number 10, October 2022
- Yuan Li, Ahmed Louri, Avinash Karanth:
SPRINT: A High-Performance, Energy-Efficient, and Scalable Chiplet-Based Accelerator With Photonic Interconnects for CNN Inference. 2332-2345 - Huawei Huang, Zhengyu Yue, Xiaowen Peng, Liuding He, Wuhui Chen, Hong-Ning Dai, Zibin Zheng, Song Guo:
Elastic Resource Allocation Against Imbalanced Transaction Assignments in Sharding-Based Permissioned Blockchains. 2372-2385 - Haowei Chen, Shuiguang Deng, Hongze Zhu, Hailiang Zhao, Rong Jiang, Schahram Dustdar, Albert Y. Zomaya:
Mobility-Aware Offloading and Resource Allocation for Distributed Services Collaboration. 2428-2443 - Dimitri Kagaris, Sourav Dutta, Stijn Eyerman:
Execution Time Estimation of Multithreaded Programs With Critical Sections. 2470-2481 - Chuan Pham, Duong Tuan Nguyen, Nguyen Hoang Tran, Kim Khoa Nguyen, Mohamed Cheriet:
Dynamic Controller/Switch Mapping: A Service Oriented Assignment Approach. 2482-2495 - Seyed Morteza Nabavinejad, Sherief Reda, Masoumeh Ebrahimi:
Coordinated Batching and DVFS for DNN Inference on GPU Accelerators. 2496-2508 - Hanlin Lu, Ting He, Shiqiang Wang, Changchang Liu, Mehrdad Mahdavi, Vijaykrishnan Narayanan, Kevin S. Chan, Stephen Pasteris:
Communication-Efficient $k$k-Means for Edge-Based Machine Learning. 2509-2523 - Francesca Fossati, Stéphane Rovedakis, Stefano Secci:
Distributed Algorithms for Multi-Resource Allocation. 2524-2539 - Lei Yang, Fulin Wen, Jiannong Cao, Zhenyu Wang:
EdgeTB: A Hybrid Testbed for Distributed Machine Learning at the Edge With High Fidelity. 2540-2553 - Wen Xia, Can Wei, Zhenhua Li, Xuan Wang, Xiangyu Zou:
NetSync: A Network Adaptive and Deduplication-Inspired Delta Synchronization Approach for Cloud Storage Services. 2554-2570 - Yi Qiao, Menghao Zhang, Yu Zhou, Xiao Kong, Han Zhang, Mingwei Xu, Jun Bi, Jilong Wang:
NetEC: Accelerating Erasure Coding Reconstruction With In-Network Aggregation. 2571-2583 - Deke Guo, Bangbang Ren, Guoming Tang, Lailong Luo, Tao Chen, Xiaoming Fu:
Optimal Embedding of Aggregated Service Function Tree. 2584-2596 - Xianzhi Zhang, Yipeng Zhou, Di Wu, Miao Hu, James Xi Zheng, Min Chen, Song Guo:
Optimizing Video Caching at the Edge: A Hybrid Multi-Point Process Approach. 2597-2611 - Miao Hu, Di Wu, Yipeng Zhou, Xu Chen, Min Chen:
Incentive-Aware Autonomous Client Participation in Federated Learning. 2612-2627 - Sijie Shen, Xingda Wei, Rong Chen, Haibo Chen, Binyu Zang:
DrTM+B: Replication-Driven Live Reconfiguration for Fast and General Distributed Transaction Processing. 2628-2643 - Geyao Cheng, Deke Guo, Lailong Luo, Junxu Xia, Siyuan Gu:
LOFS: A Lightweight Online File Storage Strategy for Effective Data Deduplication at Network Edge. 2263-2276 - Zhonghua Wang, Ting Yao, Jiguang Wan, Hong Jiang, Qiu Cui, Liu Tang, Yiwen Zhang, Qiuyang Zhang:
ComboTree: A Persistent Indexing Structure With Universal Operational Efficiency and Scalability. 2277-2290 - Anran Li, Lan Zhang, Junhao Wang, Feng Han, Xiang-Yang Li:
Privacy-Preserving Efficient Federated-Learning Model Debugging. 2291-2303 - Mostafa Hadizadeh, Elham Cheshmikhani, Maysam Rahmanpour, Onur Mutlu, Hossein Asadi:
CoPA: Cold Page Awakening to Overcome Retention Failures in STT-MRAM Based I/O Buffers. 2304-2317 - TianZhang He, Adel Nadjaran Toosi, Rajkumar Buyya:
CAMIG: Concurrency-Aware Live Migration Management of Multiple Virtual Machines in SDN-Enabled Clouds. 2318-2331 - Shuiguang Deng, Hailiang Zhao, Zhengzhe Xiang, Cheng Zhang, Rong Jiang, Ying Li, Jianwei Yin, Schahram Dustdar, Albert Y. Zomaya:
Dependent Function Embedding for Distributed Serverless Edge Computing. 2346-2357 - Yuanyuan Zeng, Kenli Li, Xu Zhou, Wensheng Luo, Yunjun Gao:
An Efficient Index-Based Approach to Distributed Set Reachability on Small-World Graphs. 2358-2371 - Liangliang Xu, Min Lyu, Qiliang Li, Lingjiang Xie, Cheng Li, Yinlong Xu:
SelectiveEC: Towards Balanced Recovery Load on Erasure-Coded Storage Systems. 2386-2400 - Jun Li, Yumeng Shao, Kang Wei, Ming Ding, Chuan Ma, Long Shi, Zhu Han, H. Vincent Poor:
Blockchain Assisted Decentralized Federated Learning (BLADE-FL): Performance Analysis and Resource Allocation. 2401-2415 - Panagiotis Gkikopoulos, Valerio Schiavoni, Josef Spillner:
Decentralised Data Quality Control in Ground Truth Production for Autonomic Decisions. 2416-2427 - Renping Liu, Zhenhua Tan, Linbo Long, Yu Wu, Yujuan Tan, Duo Liu:
Improving Fairness for SSD Devices through DRAM Over-Provisioning Cache Management. 2444-2454 - Fan Jiang, Rafael K. V. Maeda, Jun Feng, Shixi Chen, Lin Chen, Xiao Li, Jiang Xu:
Fast and Accurate Statistical Simulation of Shared-Memory Applications on Multicore Systems. 2455-2469 - Dajia Peng, Yunlong Feng, Yong Liu, Xin Liu, Wei Xue, Dexun Chen, Jiawei Song, Zuoning Chen:
Jdebug: A Fast, Non-Intrusive and Scalable Fault Locating Tool for Ten-Million-Scale Parallel Applications. 3491-3504 - Xiaoyu He, Zibin Zheng, Chuan Chen, Yuren Zhou, Chuan Luo, Qingwei Lin:
Distributed Evolution Strategies for Black-Box Stochastic Optimization. 3718-3731 - Bing Zhang, Tevfik Kosar:
SMURF: Efficient and Scalable Metadata Access for Distributed Applications. 3915-3928 - Zhenxin Li, Bing Jiao, Shuibing He, Weikuan Yu:
PhaST: Hierarchical Concurrent Log-Free Skip List for Persistent Memory. 3929-3941 - Mozhengfu Liu, Xueyan Tang:
Busy-Time Scheduling on Heterogeneous Machines: Algorithms and Analysis. 3942-3958 - Mostafa Rizk, Kevin J. M. Martin, Jean-Philippe Diguet:
Run-Time Remapping Algorithm of Dataflow Actors on NoC-Based Heterogeneous MPSoCs. 3959-3976 - Elmira Karimi, Nicolas Bohm Agostini, Shi Dong, David R. Kaeli:
VCSR: An Efficient GPU Memory-Aware Sparse Format. 3977-3989 - Antonis Psistakis, Nikos Chrysos, Fabien Chaix, Marios Asiminakis, Michalis Gianioudis, Pantelis Xirouchakis, Vassilis Papaefstathiou, Manolis Katevenis:
Optimized Page Fault Handling During RDMA. 3990-4005 - Jie Zhang, Song Guo, Zhihao Qu, Deze Zeng, Haozhao Wang, Qifeng Liu, Albert Y. Zomaya:
Adaptive Vertical Federated Learning on Unbalanced Features. 4006-4018 - Shuai Zhao, Xiaotian Dai, Iain Bate:
DAG Scheduling and Analysis on Multi-Core Systems by Modelling Parallelism and Dependency. 4019-4038 - Shusen Yang, Zhanhua Zhang, Cong Zhao, Xin Song, Siyan Guo, Hailiang Li:
CNNPC: End-Edge-Cloud Collaborative CNN Inference With Joint Model Partition and Compression. 4039-4056 - Jiesong Liu, Feng Zhang, Hourun Li, Dalin Wang, Weitao Wan, Xiaokun Fang, Jidong Zhai, Xiaoyong Du:
Exploring Query Processing on CPU-GPU Integrated Edge Device. 4057-4070 - Murat Akpinar, Ece Güran Schmidt, Klaus Werner Schmidt:
Highly Accurate Clock Synchronization With Drift Correction for the Controller Area Network. 4071-4082 - Qiang Wang, Xinxin Mei, Hai Liu, Yiu-Wing Leung, Zongpeng Li, Xiaowen Chu:
Energy-Aware Non-Preemptive Task Scheduling With Deadline Constraint in DVFS-Enabled Heterogeneous Clusters. 4083-4099 - Xu Jiang, Nan Guan, Maolin Yang, Yang Wang, Yue Tang, Wang Yi:
Real-Time Scheduling of Parallel Task Graphs With Critical Sections Across Different Vertices. 4117-4133 - Ana Gainaru, Lipeng Wan, Ruonan Wang, Eric Suchyta, Jieyang Chen, Norbert Podhorszki, James Kress, David Pugmire, Scott Klasky:
Understanding the Impact of Data Staging for Coupled Scientific Workflows. 4134-4147 - Lanlan Rui, Dai Song, Shiyou Chen, Yingtai Yang, Yang Yang, Zhipeng Gao:
Content Collaborative Caching Strategy in the Edge Maintenance of Communication Network: A Joint Download Delay and Energy Consumption Method. 4148-4163 - João Gonçalves, Miguel Matos, Rodrigo Rodrigues:
SconeKV: A Scalable, Strongly Consistent Key-Value Store. 4164-4175 - Chen Zhang, Yinbin Miao, Qingyuan Xie, Yu Guo, Hongwei Du, Xiaohua Jia:
Privacy-Preserving Deduplication of Sensor Compressed Data in Distributed Fog Computing. 4176-4191 - Shengan Zheng, Jingyu Wang, Dongliang Xue, Jiwu Shu, Linpeng Huang:
Hydra: A Decentralized File System for Persistent Memory and RDMA Networks. 4192-4206 - Song Yang, Lei Jiao, Ramin Yahyapour, Jiannong Cao:
Online Orchestration of Collaborative Caching for Multi-Bitrate Videos in Edge Computing. 4207-4220 - Vincenzo Gulisano, Hannaneh Najdataei, Yiannis Nikolakopoulos, Alessandro V. Papadopoulos, Marina Papatriantafilou, Philippas Tsigas:
STRETCH: Virtual Shared-Nothing Parallelism for Scalable and Elastic Stream Processing. 4221-4238 - Jidong Zhai, Liyan Zheng, Feng Zhang, Xiongchao Tang, Haojie Wang, Teng Yu, Yuyang Jin, Shuaiwen Leon Song, Wenguang Chen:
Detecting Performance Variance for Parallel Applications Without Source Code. 4239-4255 - Chen Tian, Yi Wang, Bingchuan Tian, Yang Zhao, Yuhang Zhou, Chenxu Wang, Hao-Ran Guan, Wanchun Dou, Guihai Chen:
PushBox: Making Use of Every Bit of Time to Accelerate Completion of Data-Parallel Jobs. 4256-4269 - Xiaoyu Xia, Feifei Chen, Qiang He, John C. Grundy, Mohamed Almorsy Abdelrazek, Jun Shen, Athman Bouguettaya, Hai Jin:
Formulating Cost-Effective Data Distribution Strategies Online for Edge Cache Systems. 4270-4281 - Abeda Sultana, Md. Mainul Haque, Li Chen, Fei Xu, Xu Yuan:
Eiffel: Efficient and Fair Scheduling in Adaptive Federated Learning. 4282-4294 - Fei Chen, Zhipeng Li, Changkun Jiang, Tao Xiang, Yuanyuan Yang:
Cloud Object Storage Synchronization: Design, Analysis, and Implementation. 4295-4310 - Yuepeng Li, Deze Zeng, Lin Gu, Quan Chen, Song Guo, Albert Y. Zomaya, Minyi Guo:
Efficient and Secure Deep Learning Inference in Trusted Processor Enabled Edge Clouds. 4311-4325 - Zihao Zeng, Chubo Liu, Zhuo Tang, Kenli Li, Keqin Li:
AccTFM: An Effective Intra-Layer Model Parallelization Strategy for Training Large-Scale Transformer-Based Models. 4326-4338 - Chen Ding, Ting Yao, Hong Jiang, Qiu Cui, Liu Tang, Yiwen Zhang, Jiguang Wan, Zhi-hu Tan:
TriangleKV: Reducing Write Stalls and Write Amplification in LSM-Tree Based KV Stores With Triangle Container in NVM. 4339-4352 - Zhe Qu, Rui Duan, Lixing Chen, Jie Xu, Zhuo Lu, Yao Liu:
Context-Aware Online Client Selection for Hierarchical Federated Learning. 4353-4367 - Andreas Kurth, Björn Forsberg, Luca Benini:
HEROv2: Full-Stack Open-Source Research Platform for Heterogeneous Computing. 4368-4382 - Ali Mohammed, Jonas H. Müller Korndörfer, Ahmed Eleliemy, Florina M. Ciorba:
Automated Scheduling Algorithm Selection and Chunk Parameter Calculation in OpenMP. 4383-4394 - Shaonan Ma, Teng Ma, Kang Chen, Yongwei Wu:
A Survey of Storage Systems in the RDMA Era. 4395-4409 - Mohsen Ansari, Sepideh Safari, Heba Khdr, Pourya Gohari-Nazari, Jörg Henkel, Alireza Ejlali, Shaahin Hessabi:
Power-Aware Checkpointing for Multicore Embedded Systems. 4410-4424 - Konstantinos Iliakis, Helga Timko, Sotirios Xydis, Panagiotis Tsapatsaris, Dimitrios Soudris:
Enabling Large Scale Simulations for Particle Accelerators. 4425-4439 - Yuanjian Liu, Sheng Di, Kai Zhao, Sian Jin, Cheng Wang, Kyle Chard, Dingwen Tao, Ian T. Foster, Franck Cappello:
Optimizing Error-Bounded Lossy Compression for Scientific Data With Diverse Constraints. 4440-4457 - Kai Xu, Jinxiao Zhang, Xiaohui Duan, Xiaobo Wan, Niu Huang, Bertil Schmidt, Weiguo Liu, Guangwen Yang:
Redesigning and Optimizing UCSF DOCK3.7 on Sunway TaihuLight. 4458-4471 - Liangchen Guo, Kai Zhang, X. Sean Wang:
Gaviss : Boosting the Performance of GPU-Accelerated NFV Systems via Data Sharing. 4472-4483 - Ping Chen, Shuibing He, Xuechen Zhang, Shuaiben Chen, Peiyi Hong, Yanlong Yin, Xian-He Sun:
Accelerating Tensor Swapping in GPUs With Self-Tuning Compression. 4484-4498 - Jing Wu, Lin Wang, Qiangyu Pei, Xingqi Cui, Fangming Liu, Tingting Yang:
HiTDL: High-Throughput Deep Learning Inference at the Hybrid Mobile Edge. 4499-4514 - Yongheng Deng, Feng Lyu, Ju Ren, Yi-Chao Chen, Peng Yang, Yuezhi Zhou, Yaoxue Zhang:
Improving Federated Learning With Quality-Aware User Incentive and Auto-Weighted Model Aggregation. 4515-4529 - Shumpei Shiina, Kenjiro Taura:
Improving Cache Utilization of Nested Parallel Programs by Almost Deterministic Work Stealing. 4530-4546 - Xinye Cai, Haiyang Xu, Xiaoping Li, Kang Wang, Long Chen, Rubén Ruiz García, Qingfu Zhang:
A Bi-Objective Learn-and-Deploy Scheduling Method for Bursty and Stochastic Requests on Heterogeneous Cloud Servers. 4547-4562 - Xinchang Zhang, Tianyi Wang:
Elastic and Reliable Bandwidth Reservation Based on Distributed Traffic Monitoring and Control. 4563-4580
Volume 33, Number 11, November 2022
- Jidong Zhai, Min Si, Antonio J. Peña:
Guest Editorial. 2644-2647 - Canh T. Dinh, Nguyen Hoang Tran, Tuan Dung Nguyen, Wei Bao, Amir Rezaei Balef, Bing Bing Zhou, Albert Y. Zomaya:
DONE: Distributed Approximate Newton-type Method for Federated Edge Learning. 2648-2660 - Moming Duan, Duo Liu, Xinyuan Ji, Yu Wu, Liang Liang, Xianzhang Chen, Yujuan Tan, Ao Ren:
Flexible Clustered Federated Learning for Client-Level Data Distribution Shift. 2661-2674 - Jer Shyuan Ng, Wei Yang Bryan Lim, Zehui Xiong, Xianbin Cao, Jiangming Jin, Dusit Niyato, Cyril Leung, Chunyan Miao:
Reputation-Aware Hedonic Coalition Formation for Efficient Serverless Hierarchical Federated Learning. 2675-2686 - Jie Feng, Lei Liu, Qingqi Pei, Keqin Li:
Min-Max Cost Optimization for Efficient Hierarchical Federated Learning in Wireless Edge Networks. 2687-2700 - Jialin Guo, Jie Wu, Anfeng Liu, Neal N. Xiong:
LightFed: An Efficient and Secure Federated Edge Learning System on Model Splitting. 2701-2713 - Yiming Zeng, Yixuan Lin, Yuanyuan Yang, Ji Liu:
Differentially Private Federated Temporal Difference Learning. 2714-2726 - Abolfazl Hashemi, Anish Acharya, Rudrajit Das, Haris Vikalo, Sujay Sanghavi, Inderjit S. Dhillon:
On the Benefits of Multiple Gossip Steps in Communication-Constrained Decentralized Federated Learning. 2727-2739 - Michael Mitzenmacher, Matteo Dell'Amico:
The Supermarket Model With Known and Predicted Service Times. 2740-2751 - Saiqin Long, Wen Wen, Zhetao Li, Kenli Li, Rong Yu, Jiang Zhu:
A Global Cost-Aware Container Scheduling Strategy in Cloud Data Centers. 2752-2766 - Yuan Yao, Shuangyang Liu, Sikai Wu, Jinyu Wang, Jinting Ni, Gang Yang, Yu Zhang:
WAMP$^2$2S: Workload-Aware GPU Performance Model Based Pseudo-Preemptive Real-Time Scheduling for the Airborne Embedded System. 2767-2780 - Zhisheng Ye, Peng Sun, Wei Gao, Tianwei Zhang, Xiaolin Wang, Shengen Yan, Yingwei Luo:
Astraea: A Fair Deep Learning Scheduler for Multi-Tenant GPU Clusters. 2781-2793 - Shreshth Tuli, Giuliano Casale, Nicholas R. Jennings:
MCDS: AI Augmented Workflow Scheduling in Mobile Edge Cloud Computing Systems. 2794-2807 - Rong Gu, Yuquan Chen, Shuai Liu, Haipeng Dai, Guihai Chen, Kai Zhang, Yang Che, Yihua Huang:
Liquid: Intelligent Resource Estimation and Network-Efficient Scheduling for Deep Learning Jobs on Distributed GPU Clusters. 2808-2820 - Shreshth Tuli, Giuliano Casale, Nicholas R. Jennings:
GOSH: Task Scheduling Using Deep Surrogate Models in Fog Computing Environments. 2821-2833 - Jiajun Li, Hao Zheng, Ke Wang, Ahmed Louri:
SGCNAX: A Scalable Graph Convolutional Neural Network Accelerator With Workload Balancing. 2834-2845 - Mingfan Li, Junshi Chen, Qian Xiao, Fei Wang, Qingcai Jiang, Xuncheng Zhao, Rongfen Lin, Hong An, Xiao Liang, Lixin He:
Bridging the Gap between Deep Learning and Frustrated Quantum Spin System for Extreme-Scale Simulations on New Generation of Sunway Supercomputer. 2846-2859 - Rui Xu, Sheng Ma, Yaohua Wang, Yang Guo, Dongsheng Li, Yuran Qiao:
Heterogeneous Systolic Array Architecture for Compact CNNs Hardware Accelerators. 2860-2871 - Hai Jin, Cong Liu, Haikun Liu, Ruikun Luo, Jiahong Xu, Fubing Mao, Xiaofei Liao:
ReHy: A ReRAM-Based Digital/Analog Hybrid PIM Architecture for Accelerating CNN Training. 2872-2884 - Jintao Meng, Chen Zhuang, Peng Chen, Mohamed Wahib, Bertil Schmidt, Xiao Wang, Haidong Lan, Dou Wu, Minwen Deng, Yanjie Wei, Shengzhong Feng:
Automatic Generation of High-Performance Convolution Kernels on ARM CPUs for Deep Learning. 2885-2899 - Zhuojin Li, Marco Paolieri, Leana Golubchik, Sung-Han Lin, Wumo Yan:
Predicting Throughput of Distributed Stochastic Gradient Descent. 2900-2912 - Ariel Keller Rorabaugh, Silvina Caíno-Lores, Travis Johnston, Michela Taufer:
Building High-Throughput Neural Architecture Search Workflows via a Decoupled Fitness Prediction Engine. 2913-2926 - Qinglong Zhang, Rui Han, Gaofeng Xin, Chi Harold Liu, Guoren Wang, Lydia Y. Chen:
Lightweight and Accurate DNN-Based Anomaly Detection at Edge. 2927-2942 - Farui Wang, Weizhe Zhang, Shichao Lai, Meng Hao, Zheng Wang:
Dynamic GPU Energy Optimization for Machine Learning Training Workloads. 2943-2954 - Amelie Chi Zhou, Jianming Lao, Zhoubin Ke, Yi Wang, Rui Mao:
FarSpot: Optimizing Monetary Cost for HPC Applications in the Cloud Spot Market. 2955-2967 - Wenkai Lv, Quan Wang, Pengfei Yang, Yunqing Ding, Bijie Yi, Zhenyi Wang, Chengmin Lin:
Microservice Deployment in Edge Computing Based on Deep Q Learning. 2968-2978 - Kai Zhong, ZhiBang Yang, Guoqing Xiao, Xingpei Li, Wangdong Yang, Kenli Li:
An Efficient Parallel Reinforcement Learning Approach to Cross-Layer Defense Mechanism in Industrial Control Systems. 2979-2990 - Jing Zeng, Ding Ding, Kaixuan Kang, Huamao Xie, Qian Yin:
Adaptive DRL-Based Virtual Machine Consolidation in Energy-Efficient Cloud Data Center. 2991-3002 - Yuanhao Yang, Hong Shen:
Deep Reinforcement Learning Enhanced Greedy Optimization for Online Scheduling of Batched Tasks in Cloud HPC Systems. 3003-3014 - Zaifeng Pan, Feng Zhang, Hourun Li, Chenyang Zhang, Xiaoyong Du, Dong Deng:
G-SLIDE: A GPU-Based Sub-Linear Deep Learning Engine via LSH Sparsification. 3015-3027 - Nabil Abubaker, M. Ozan Karsavuran, Cevdet Aykanat:
Scalable Unsupervised ML: Latency Hiding in Distributed Sparse Tensor Decomposition. 3028-3040 - Dian-Lun Lin, Tsung-Wei Huang:
Accelerating Large Sparse Neural Network Inference Using GPU Task Graph Parallelism. 3041-3052 - Zhaorui Zhang, Cho-Li Wang:
MIPD: An Adaptive Gradient Sparsification Framework for Distributed DNNs Training. 3053-3066 - Jianqiang Huang, Haojie Wang, Xiang Fei, Xiaoying Wang, Wenguang Chen:
$TC-Stream$TC-Stream: Large-Scale Graph Triangle Counting on a Single Machine Using GPUs. 3067-3078 - Yunlong Mao, Wenbo Hong, Boyu Zhu, Zhifei Zhu, Yuan Zhang, Sheng Zhong:
Secure Deep Neural Network Models Publishing Against Membership Inference Attacks Via Training Task Parallelism. 3079-3091 - Amro Alabsi Aljundi, Taha Atahan Akyildiz, Kamer Kaya:
Boosting Graph Embedding on a Single GPU. 3092-3105 - Jingwei Sun, Tao Yan, Hao Sun, Huancheng Lin, Guangzhong Sun:
Lossy Compression of Communication Traces Using Recurrent Neural Networks. 3106-3116 - Jiamin Chen, Jianliang Gao, Yibo Chen, Babatounde Moctard Oloulade, Tengfei Lyu, Zhao Li:
Auto-GNAS: A Parallel Graph Neural Architecture Search Framework. 3117-3128 - Kun Li, Liang Yuan, Yunquan Zhang, Gongwei Chen:
An Accurate and Efficient Large-Scale Regression Method Through Best Friend Clustering. 3129-3140 - Zexi Chen, Ting Wang, Haibin Cai, Subrota Kumar Mondal, Jyoti Prakash Sahoo:
BLB-gcForest: A High-Performance Distributed Deep Forest With Adaptive Sub-Forest Splitting. 3141-3152 - Shaoduo Gan, Akhil Mathur, Anton Isopoussu, Fahim Kawsar, Nadia Berthouze, Nicholas D. Lane:
FRuDA: Framework for Distributed Adversarial Domain Adaptation. 3153-3164 - Danyang Xiao, Chengang Yang, Weigang Wu:
Mixing Activations and Labels in Distributed Training for Split Learning. 3165-3177 - Cheng Gong, Ye Lu, Kunpeng Xie, Zongming Jin, Tao Li, Yanzhi Wang:
Elastic Significant Bit Quantization and Acceleration for Deep Neural Networks. 3178-3193 - Pyeongsu Park, Jaewoon Lee, Heetaek Jeong, Jangwoo Kim:
DLS: A Fast and Flexible Neural Network Training System With Fine-grained Heterogeneous Device Orchestration. 3194-3206 - Daniel Gerlinghoff, Zhehui Wang, Xiaozhe Gu, Rick Siow Mong Goh, Tao Luo:
E3NE: An End-to-End Framework for Accelerating Spiking Neural Networks With Emerging Neural Encoding on FPGAs. 3207-3219 - Size Zheng, Renze Chen, Yicheng Jin, Anjiang Wei, Bingyang Wu, Xiuhong Li, Shengen Yan, Yun Liang:
NeoFlow: A Flexible Framework for Enabling Efficient Compilation for High Performance DNN Training. 3220-3232 - Yuxuan Zhou, Wanzhong Chen, Linlin Li, Linlin Gong, Tao Zhang:
Heterogeneous Multi-Agent System for Brain-Computer Interaction in Routing and Forwarding With Memristive Neuron Networks. 3233-3248 - Maolin Wang, Seyedramin Rasoulinezhad, Philip H. W. Leong, Hayden Kwok-Hay So:
NITI: Training Integer Neural Networks Using Integer-Only Arithmetic. 3249-3261
Volume 33, Number 12, December 2022
- Zoltán Ádám Mann:
Decentralized Application Placement in Fog Computing. 3262-3273 - Junchang Wang, Dunwei Liu, Xiong Fu, Fu Xiao, Chen Tian:
DHash: Dynamic Hash Tables With Non-Blocking Regular Operations. 3274-3290 - Zihao Zhou, Yanan Li, Xuebin Ren, Shusen Yang:
Towards Efficient and Stable K-Asynchronous Federated Learning With Unbounded Stale Gradients on Non-IID Data. 3291-3305 - Murugaraj Odiathevar, Winston K. G. Seah, Marcus Frean:
A Bayesian Approach To Distributed Anomaly Detection In Edge AI Networks. 3306-3320 - Nimish Shah, Wannes Meert, Marian Verhelst:
GraphOpt: Constrained-Optimization-Based Parallelization of Irregular Graphs. 3321-3332 - Hai Zhou, Dan Feng, Yuchong Hu:
Bandwidth-Aware Scheduling Repair Techniques in Erasure-Coded Clusters: Design and Analysis. 3333-3348 - Jianshu Liu, Shungeng Zhang, Qingyang Wang, Jinpeng Wei:
Coordinating Fast Concurrency Adapting With Autoscaling for SLO-Oriented Web Applications. 3349-3362 - Pedro J. Martínez-Ferrer, A. N. Yzelman, Vicenç Beltran:
A Native Tensor-Vector Multiplication Algorithm for High Performance Computing. 3363-3374 - Chenlei Tang, Jiguang Wan, Changsheng Xie:
FenceKV: Enabling Efficient Range Query for Key-Value Separation. 3375-3386 - Hongxiang Fan, Martin Ferianc, Zhiqiang Que, Xinyu Niu, Miguel R. D. Rodrigues, Wayne Luk:
Accelerating Bayesian Neural Networks via Algorithmic and Hardware Optimizations. 3387-3399 - Xiaolu Li, Keyun Cheng, Zhirong Shen, Patrick P. C. Lee:
Fast Proactive Repair in Erasure-Coded Storage: Analysis, Design, and Implementation. 3400-3414 - Sina Darabi, Ehsan Yousefzadeh-Asl-Miandoab, Negar Akbarzadeh, Hajar Falahati, Pejman Lotfi-Kamran, Mohammad Sadrosadati, Hamid Sarbazi-Azad:
OSM: Off-Chip Shared Memory for GPUs. 3415-3429 - Yu Zhan, Liguo Zhou, Baocang Wang, Pu Duan, Benyu Zhang:
Efficient Function Queryable and Privacy Preserving Data Aggregation Scheme in Smart Grid. 3430-3441 - Miao Zhang, Yong Peng, Jiancheng Zhu, Quanjun Yin:
Efficient Flow-Based Scheduling for Geo-Distributed Simulation Tasks in Collaborative Edge and Cloud Environments. 3442-3459 - Shuo Wang, Surya Nepal, Kristen Moore, Marthie Grobler, Carsten Rudolph, Alsharif Abuadbba:
OCTOPUS: Overcoming Performance and Privatization Bottlenecks in Distributed Learning. 3460-3477 - Sathyanarayanan Srinivasan, Ramesh Kandukoori:
Solving Consensus in True Partial Synchrony. 3478-3490 - Robert Underwood, Jon C. Calhoun, Sheng Di, Amy W. Apon, Franck Cappello:
OptZConfig: Efficient Parallel Optimization of Lossy Compression Configuration. 3505-3519 - Nicolas Blin, Edwin Carlinet, Florian Lemaitre, Lionel Lacassagne, Thierry Géraud:
Max-Tree Computation on GPUs. 3520-3531 - Rory Hector, Ramachandran Vaidyanathan, Gokarna Sharma, Jerry L. Trahan:
Optimal Convex Hull Formation on a Grid by Asynchronous Robots With Lights. 3532-3545 - Hemanta Sapkota, Engin Arslan:
Reliable Wide-Area Data Transfers for Streaming Workflows. 3546-3557 - Jidong Zhai, Liyan Zheng, Jinghan Sun, Feng Zhang, Xiongchao Tang, Xuehai Qian, Bingsheng He, Wei Xue, Wenguang Chen, Weimin Zheng:
Leveraging Code Snippets to Detect Variations in the Performance of HPC Systems. 3558-3574 - J. C. S. Kadupitiya, Vikram Jadhao, Prateek Sharma:
SciSpot: Scientific Computing On Temporally Constrained Cloud Preemptible VMs. 3575-3588 - Hatem Elshazly, Jorge Ejarque, Rosa M. Badia:
Storage-Heterogeneity Aware Task-based Programming Models to Optimize I/O Intensive Applications. 3589-3599 - Jiuchuan Jiang, Kai Di, Bo An, Yichuan Jiang, Zhan Bu, Jie Cao:
Batch Crowdsourcing for Complex Tasks Based on Distributed Team Formation in E-Markets. 3600-3615 - J. Gregory Pauloski, Lei Huang, Weijia Xu, Kyle Chard, Ian T. Foster, Zhao Zhang:
Deep Neural Network Training With Distributed K-FAC. 3616-3627 - Jiaqing Dong, Lijuan Tan, Chen Tian, Yuhang Zhou, Yi Wang, Wanchun Dou, Guihai Chen:
Meet: Rack-Level Pooling Based Load Balancing in Datacenter Networks. 3628-3639 - Rafal Krawczyk, Tommaso Colombo, Niko Neufeld, Flavio Pisani, Sébastien Valat:
Ethernet for High-Throughput Computing at CERN. 3640-3650 - Michael P. Lingg, Stephen M. Hughey, Balasubramaniam Shanker, Hasan Metin Aktulga:
High Performance Evaluation of Helmholtz Potentials Using the Multi-Level Fast Multipole Algorithm. 3651-3666 - Shengguo Li, Hao Jiang, Dezun Dong, Chun Huang, Jie Liu, Xia Liao, Xuguang Chen:
Efficient Data Redistribution Algorithms From Irregular to Block Cyclic Data Distribution. 3667-3677 - Hua Kang, Zhiyang Li, Qian Zhang:
Communicational and Computational Efficient Federated Domain Adaptation. 3678-3689 - Xu Ma, Xiaoqian Sun, Yuduo Wu, Zheli Liu, Xiaofeng Chen, Changyu Dong:
Differentially Private Byzantine-Robust Federated Learning. 3690-3701 - Huizhang Luo, Junqi Wang, Qing Liu, Jieyang Chen, Scott Klasky, Norbert Podhorszki:
zMesh: Theories and Methods to Exploring Application Characteristics to Improve Lossy Compression Ratio for Adaptive Mesh Refinement. 3702-3717 - Jianhua Gao, Weixing Ji, Zhaonian Tan, Yizhuo Wang, Feng Shi:
TaiChi: A Hybrid Compression Format for Binary Sparse Matrix-Vector Multiplication on GPU. 3732-3745 - Guangqiang Luan, Pu Pang, Quan Chen, Shuai Xue, Zhuo Song, Minyi Guo:
Online Thread Auto-Tuning for Performance Improvement and Resource Saving. 3746-3759 - Lu Zhao, Bo Li, Wenan Tan, Guangming Cui, Qiang He, Xiaolong Xu, Lida Xu, Yun Yang:
Joint Coverage-Reliability for Budgeted Edge Application Deployment in Mobile Edge Computing Environment. 3760-3771 - Tian Wang, Junlong Zhou, Liying Li, Gongxuan Zhang, Keqin Li, Xiaobo Sharon Hu:
Deadline and Reliability Aware Multiserver Configuration Optimization for Maximizing Profit. 3772-3786 - Yamin Lei, Zhicheng Cai, Xiaoping Li, Rajkumar Buyya:
State Space Model and Queuing Network Based Cloud Resource Provisioning for Meshed Web Systems. 3787-3799 - Vasilios I. Kelefouras, Georgios Keramidas:
Design and Implementation of 2D Convolution on x86/x64 Processors. 3800-3815 - Ruitao Xie, Junhong Fang, Junmei Yao, Kai Liu, Xiaohua Jia, Kaishun Wu:
QoS-Aware Scheduling of Remote Rendering for Interactive Multimedia Applications in Edge Computing. 3816-3832 - Jananie Jarachanthan, Li Chen, Fei Xu, Bo Li:
Astrea: Auto-Serverless Analytics Towards Cost-Efficiency and QoS-Awareness. 3833-3849 - Hao Dai, Yang Wang, Kenneth B. Kent, Lingfang Zeng, Cheng-Zhong Xu:
The State of the Art of Metadata Managements in Large-Scale Distributed File Systems - Scalability, Performance and Availability. 3850-3869 - Matthieu Nicolas, Gérald Oster, Olivier Perrin:
Efficient Renaming in Sequence CRDTs. 3870-3885 - Haiying Shen, Liuhua Chen:
A Resource-Efficient Predictive Resource Provisioning System in Cloud Systems. 3886-3900 - Shutian Luo, Huanle Xu, Chengzhi Lu, Kejiang Ye, Guoyao Xu, Liping Zhang, Jian He, Cheng-Zhong Xu:
An In-Depth Study of Microservice Call Graph and Runtime Performance. 3901-3914 - Tao Li, Yaozheng Fang, Ye Lu, Jinni Yang, Zhaolong Jian, Zhiguo Wan, Yusen Li:
SmartVM: A Smart Contract Virtual Machine for Fast On-Chain DNN Computations. 4100-4116 - Rui Li, Zhi Zhou, Xiaoxi Zhang, Xu Chen:
Joint Application Placement and Request Routing Optimization for Dynamic Edge Computing Service Management. 4581-4596 - Bo Yi, Xingwei Wang, Min Huang, Sajal K. Das, Keqin Li:
Fairness-Aware VNF Sharing and Rate Coordination for High Efficient Service Scheduling. 4597-4611 - Zhao-Wei Qiu, Kun-Sheng Liu, Ya-Shu Chen:
BARM: A Batch-Aware Resource Manager for Boosting Multiple Neural Networks Inference on GPUs With Memory Oversubscription. 4612-4624 - A-Long Jin, Wenchao Xu, Song Guo, Bing Hu, Kwan Lawrence Yeung:
PS+: A Simple yet Effective Framework for Fast Training on Parameter Server. 4625-4637 - Qinlong Huang, Lixuan Chen, Chao Wang:
A Parallel Secure Flow Control Framework for Private Data Sharing in Mobile Edge Cloud. 4638-4653 - Huaqing Tu, Gongming Zhao, Hongli Xu, Xianjin Fang:
Tenant-Grained Request Scheduling in Software-Defined Cloud Computing. 4654-4671 - Gokul Madathupalyam Chinnappan, Bharadwaj Veeravalli:
Theoretical Analysis of an Adaptive Periodic Multi Installment Scheduling With Result Retrieval for SAR Image Processing. 4672-4683 - Xiaoqing Liu, Shuming Zhou, Sun-Yuan Hsieh, Hong Zhang:
Robustness of Subsystem Reliability of $k$k-Ary $n$n-Cube Networks Under Probabilistic Fault Model. 4684-4693 - Xiaodong Yi, Shiwei Zhang, Lansong Diao, Chuan Wu, Zhen Zheng, Shiqing Fan, Siyu Wang, Jun Yang, Wei Lin:
Optimizing DNN Compilation for Distributed Training With Joint OP and Tensor Fusion. 4694-4706 - Kexin Liu, Chen Tian, Qingyue Wang, Yanqing Chen, Bingchuan Tian, Wenhao Sun, Ke Meng, Long Yan, Lei Han, Jie Fu, Wanchun Dou, Guihai Chen:
PayDebt: Reduce Buffer Occupancy Under Bursty Traffic on Large Clusters. 4707-4722 - Jiaquan Gao, Xinyue Chu, Xiaotong Wu, Jun Wang, Guixia He:
Parallel Dynamic Sparse Approximate Inverse Preconditioning Algorithm on GPU. 4723-4737 - Zhijun Ding, Binbin Feng, Changjun Jiang:
COIN: A Container Workload Prediction Model Focusing on Common and Individual Changes in Workloads. 4738-4751 - Xin Chen, Yingxiang Gao, Honghui Shang, Fang Li, Zhiqian Xu, Xin Liu, Dexun Chen:
Increasing the Efficiency of Massively Parallel Sparse Matrix-Matrix Multiplication in First-Principles Calculation on the New-Generation Sunway Supercomputer. 4752-4766 - Dewen Qiao, Songtao Guo, Defang Liu, Saiqin Long, Pengzhan Zhou, Zhetao Li:
Adaptive Federated Deep Reinforcement Learning for Proactive Content Caching in Edge Computing. 4767-4782 - Chunjiang Che, Xiaoli Li, Chuan Chen, Xiaoyu He, Zibin Zheng:
A Decentralized Federated Learning Framework via Committee Mechanism With Convergence Guarantee. 4783-4800 - Libin Liu, Hong Xu, Zhixiong Niu, Jingzong Li, Wei Zhang, Peng Wang, Jiamin Li, Chun Jason Xue, Cong Wang:
ScaleFlux: Efficient Stateful Scaling in NFV. 4801-4817 - Jianyong Zhu, Renyu Yang, Xiaoyang Sun, Tianyu Wo, Chunming Hu, Hao Peng, Junqing Xiao, Albert Y. Zomaya, Jie Xu:
QoS-Aware Co-Scheduling for Distributed Long-Running Applications on Shared Clusters. 4818-4834 - Chang Xu, Yu Jia, Liehuang Zhu, Chuan Zhang, Guoxie Jin, Kashif Sharif:
TDFL: Truth Discovery Based Byzantine Robust Federated Learning. 4835-4848 - Chao Fu, Li Wan, Jun Han:
LosaTM: A Hardware Transactional Memory Integrated With a Low-Overhead Scenario-Awareness Conflict Manager. 4849-4862 - Zhengjie Yang, Wei Bao, Dong Yuan, Nguyen Hoang Tran, Albert Y. Zomaya:
Federated Learning With Nesterov Accelerated Gradient. 4863-4873 - Xuebing Li, Yang Chen, Mengying Zhou, Tiancheng Guo, Chenhao Wang, Yu Xiao, Junjie Wan, Xin Wang:
Artemis: A Latency-Oriented Naming and Routing System. 4874-4890 - Shaoxian Xu, Zhiyuan Shao, Ci Yang, Xiaofei Liao, Hai Jin:
Accelerating Backward Aggregation in GCN Training With Execution Path Preparing on GPUs. 4891-4902 - Yuping Fan, Boyang Li, Dustin Favorite, Naunidh Singh, John Taylor Childers, Paul Rich, William E. Allcock, Michael E. Papka, Zhiling Lan:
DRAS: Deep Reinforcement Learning for Cluster Scheduling in High Performance Computing. 4903-4917 - Penglai Cui, Heng Pan, Zhenyu Li, Penghao Zhang, Tianhao Miao, Jianer Zhou, Hongtao Guan, Gaogang Xie:
Enabling In-Network Floating-Point Arithmetic for Efficient Computation Offloading. 4918-4934 - Xuedong Zhang, Zhuo Tang, Xiantao Zhang, Kenli Li:
Co-Concurrency Mechanism for Multi-GPUs in Distributed Heterogeneous Environments. 4935-4947 - Zhuozhao Li, Ryan Chard, Yadu N. Babuji, Ben Galewsky, Tyler J. Skluzacek, Kirill Nagaitsev, Anna Woodard, Ben Blaiszik, Josh Bryan, Daniel S. Katz, Ian T. Foster, Kyle Chard:
$f$funcX: Federated Function as a Service for Science. 4948-4963 - Xiang Fu, Huaimin Wang, Peichang Shi:
Votes-as-a-Proof (VaaP): Permissioned Blockchain Consensus Protocol Made Simple. 4964-4973 - Andrey A. Chusov:
Outperforming Sequential Full-Word Long Addition With Parallelization and Vectorization. 4974-4985
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.