default search action
27th Euro-Par 2021: Lisbon, Portugal
- Leonel Sousa, Nuno Roma, Pedro Tomás:
Euro-Par 2021: Parallel Processing - 27th International Conference on Parallel and Distributed Computing, Lisbon, Portugal, September 1-3, 2021, Proceedings. Lecture Notes in Computer Science 12820, Springer 2021, ISBN 978-3-030-85664-9
Compilers, Tools and Environments
- Daniel Maier, Biagio Cosenza, Ben H. H. Juurlink:
ALONA: Automatic Loop Nest Approximation with Reconstruction and Space Pruning. 3-18 - Peter Arzt, Yannic Fischler, Jan-Patrick Lehr, Christian H. Bischof:
Automatic Low-Overhead Load-Imbalance Detection in MPI Applications. 19-34
Performance and Power Modeling, Prediction and Evaluation
- Yannis Sfakianakis, Eleni Kanellou, Manolis Marazakis, Angelos Bilas:
Trace-Based Workload Generation and Execution. 37-54 - Anne Benoit, Louis-Claude Canon, Redouane Elghazi, Pierre-Cyrille Héam:
Update on the Asymptotic Optimality of LPT. 55-69 - Burak Aksar, Benjamin Schwaller, Omar Aaziz, Vitus J. Leung, Jim M. Brandt, Manuel Egele, Ayse K. Coskun:
E2EWatch: An End-to-End Anomaly Diagnosis Framework for Production HPC Systems. 70-85
Scheduling and Load Balancing
- Zhuoran Ji, Cho-Li Wang:
Collaborative GPU Preemption via Spatial Multitasking for Efficient GPU Sharing. 89-104 - Ning Tang, Alix Munier Kordon:
A Fixed-Parameter Algorithm for Scheduling Unit Dependent Tasks with Unit Communication Delays. 105-119 - Jan Kopanski, Krzysztof Rzadca:
Plan-Based Job Scheduling for Supercomputers with Shared Burst Buffers. 120-135 - Sonia Ben Mokhtar, Louis-Claude Canon, Anthony Dugois, Loris Marchal, Etienne Rivière:
Taming Tail Latency in Key-Value Stores: A Scheduling Perspective. 136-150 - Adrian Naruszko, Bartlomiej Przybylski, Krzysztof Rzadca:
A Log-Linear (2 +5/6)-Approximation Algorithm for Parallel Machine Scheduling with a Single Orthogonal Resource. 151-166 - Maria Predari, Charilaos Tzovas, Christian Schulz, Henning Meyerhenke:
An MPI-based Algorithm for Mapping Complex Networks onto Hierarchical Architectures. 167-182 - Olivier Beaumont, Lionel Eyraud-Dubois, Alena Shilova:
Pipelined Model Parallelism: Complexity Results and Memory Considerations. 183-198
Data Management, Analytics and Machine Learning
- Haoran Wang, Chong Li, Thibaut Tachon, Hongxing Wang, Sheng Yang, Sébastien Limet, Sophie Robert:
Efficient and Systematic Partitioning of Large and Deep Neural Networks for Parallelization. 201-216 - Kyusik Choi, Hoeseok Yang:
A GPU Architecture Aware Fine-Grain Pruning Technique for Deep Neural Networks. 217-231 - Zhongyi Lin, Evangelos Georganas, John D. Owens:
Towards Flexible and Compiler-Friendly Layer Fusion for CNNs on Multicore CPUs. 232-248 - Tiago Lopes, Miguel E. Coimbra, Luís Veiga:
Smart Distributed DataSets for Stream Processing. 249-265
Cluster, Cloud and Edge Computing
- Francesc Lordan, Daniele Lezzi, Rosa M. Badia:
Colony: Parallel Functions as a Service on the Cloud-Edge Continuum. 269-284 - David Delande, Patricia Stolf, Raphaël Féraud, Jean-Marc Pierson, André Bottaro:
Horizontal Scaling in Cloud Using Contextual Bandits. 285-300 - Ronan-Alexandre Cherrueau, Marie Delavergne, Adrien Lèbre:
Geo-distribute Cloud Applications at the Edge. 301-316 - Rafaela C. Brum, Walisson P. Sousa, Alba C. M. A. Melo, Cristiana Bentes, Maria Clicia Stelling de Castro, Lúcia Maria de A. Drummond:
A Fault Tolerant and Deadline Constrained Sequence Alignment Application on Cloud-Based Spot GPU Instances. 317-333 - Sophie Cerf, Raphaël Bleuse, Valentin Reis, Swann Perarnau, Éric Rutten:
Sustaining Performance While Reducing Energy Consumption: A Control Theory Approach. 334-349
Theory and Algorithms for Parallel and Distributed Processing
- Rezaul Chowdhury, Francesco Silvestri, Flavio Vella:
Algorithm Design for Tensor Units. 353-367 - Jeremy Buhler, Thomas Lavastida, Kefu Lu, Benjamin Moseley:
A Scalable Approximation Algorithm for Weighted Longest Common Subsequence. 368-384 - Adones Rukundo, Philippas Tsigas:
TSLQueue: An Efficient Lock-Free Design for Priority Queues. 385-401 - Bryan Rowe, Rajiv Gupta:
G-Morph: Induced Subgraph Isomorphism Search of Labeled Graphs on a GPU. 402-417
Parallel and Distributed Programming, Interfaces, and Languages
- Catalina Munoz Morales, Rafael Murari, Joao P. L. de Carvalho, Bruno Chinelato Honorio, Alexandro Baldassin, Guido Araujo:
Accelerating Graph Applications Using Phased Transactional Memory. 421-434 - Dian-Lun Lin, Tsung-Wei Huang:
Efficient GPU Computation Using Task Graph Parallelism. 435-450 - Nicolas M. Morales, Keita Teranishi, Bogdan Nicolae, Christian Trott, Franck Cappello:
Towards High Performance Resilience Using Performance Portable Abstractions. 451-465 - Thomas Dionisi, Stéphane Bouhrour, Julien Jaeger, Patrick Carribault, Marc Pérache:
Enhancing Load-Balancing of MPI Applications with Workshare. 466-481 - Nicolas L. Guidotti, Pedro Ceyrat, João Barreto, José Monteiro, Rodrigo Rodrigues, Ricardo Fonseca, Xavier Martorell, Antonio J. Peña:
Particle-In-Cell Simulation Using Asynchronous Tasking. 482-498
Multicore and Manycore Parallelism
- Raúl Nozal, José Luis Bosque:
Exploiting Co-execution with OneAPI: Heterogeneity from a Modern Perspective. 501-516
Parallel Numerical Methods and Applications
- Yuankun Fu, Fengguang Song:
Designing a 3D Parallel Memory-Aware Lattice Boltzmann Algorithm on Manycore Systems. 519-535 - Camille Coti, Laure Petrucci, Daniel Alberto Torres González:
Fault-Tolerant LU Factorization Is Low Cost. 536-549 - Fritz Göbel, Thomas Grützmacher, Tobias Ribizel, Hartwig Anzt:
Mixed Precision Incomplete and Factorized Sparse Approximate Inverse Preconditioning on GPUs. 550-564 - Yuxi Hong, El Houcine Bergou, Nicolas Doucet, Hao Zhang, Jesse Cranney, Hatem Ltaief, Damien Gratadour, François Rigaut, David E. Keyes:
Outsmarting the Atmospheric Turbulence for Ground-Based Telescopes Using the Stochastic Levenberg-Marquardt Method. 565-579 - Adam Smelko, Miroslav Kratochvíl, Martin Krulis, Tomás Sieger:
GPU-Accelerated Mahalanobis-Average Hierarchical Clustering Analysis. 580-595
High Performance Architectures and Accelerators
- Vladimir Dimic, Miquel Moretó, Marc Casas, Mateo Valero:
PrioRAT: Criticality-Driven Prioritization Inside the On-Chip Memory Hierarchy. 599-615 - Alberto Zeni, Kenneth O'Brien, Michaela Blott, Marco D. Santambrogio:
Optimized Implementation of the HPCG Benchmark on Reconfigurable Hardware. 616-630
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.