default search action
IPDPS 2015: Hyderabad, India - Workshops
- 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, IPDPS 2015, Hyderabad, India, May 25-29, 2015. IEEE Computer Society 2015, ISBN 978-1-4673-7684-6
Workshop 1: HCW - Heterogeneity in Computing Workshop
- Shoukat Ali, Denis Trystram:
HCW Introduction. 1-2 - Behrooz A. Shirazi:
Message from the HCW Steering Committee Chair. 3 - Denis Trystram:
Message from the HCW Program Committee Chair. 4 - Andrew S. Grimshaw:
HCW 2014 Keynote Talk. 5
Session 1: Scheduling and Load Balancing
- Nathanael Cheriere, Erik Saule:
Considerations on Distributed Load Balancing for Fully Heterogeneous Machines: Two Particular Cases. 6-16 - Tarun Beri, Sorav Bansal, Subodh Kumar:
ProSteal: A Proactive Work Stealer for Bulk Synchronous Tasks Distributed on a Cluster of Heterogeneous Machines with Multiple Accelerators. 17-26 - Safia Kedad-Sidhoum, Florence Monna, Denis Trystram:
Scheduling Tasks with Precedence Constraints on Hybrid Multi-core Machines. 27-33
Session 2: Applications
- Emmanuel Agullo, Olivier Beaumont, Lionel Eyraud-Dubois, Julien Herrmann, Suraj Kumar, Loris Marchal, Samuel Thibault:
Bridging the Gap between Performance and Bounds of Cholesky Factorization on Heterogeneous Platforms. 34-45 - Md. Tarikul Islam, Hien Nguyen, Jaspal Subhlok, Edgar Gabriel:
Efficient Message Logging to Support Process Replicas in a Volunteer Computing Environment. 46-56 - Subhash Saini, Haoqiang Jin, Dennis C. Jespersen, Samson Cheung, M. Jahed Djomehri, Johnny Chang, Robert Hood:
Early Multi-node Performance Evaluation of a Knights Corner (KNC) Based NASA Supercomputer. 57-67
Workshop 2: RAW - Reconfigurable Architectures Workshop
- Jürgen Becker, Ken Eguro, Diana Göhringer, Wayne Luk, Marco D. Santambrogio, Ramachandran Vaidyanathan, Steven J. E. Wilton:
RAW Introduction and Committees. 68-69 - Viktor K. Prasanna:
RAW 2015 Keynote. 70
Session 1 - Runtime and Tools for Partially Reconfigurable FPGA-Based Systems
- Tian Xia, Jean-Christophe Prévotet, Fabienne Nouvel:
Mini-NOVA: A Lightweight ARM-based Virtualization Microkernel Supporting Dynamic Partial Reconfiguration. 71-80 - Berend H. J. Dekens, Marco Jan Gerrit Bekooij, Gerard J. M. Smit:
Real-Time Multiprocessor Architecture for Sharing Stream Processing Accelerators. 81-89 - Aurelio Morales-Villanueva, Ann Gordon-Ross:
Partial Region and Bitstream Cost Models for Hardware Multitasking on Partially Reconfigurable FPGAs. 90-96 - Marco Rabozzi, Riccardo Cattaneo, Tobias Becker, Wayne Luk, Marco D. Santambrogio:
Relocation-Aware Floorplanning for Partially-Reconfigurable FPGA-Based Systems. 97-104
Session 2 - Applications and Special Purpose Architectures with Reconfigurable Hardware
- Da Tong, Shijie Zhou, Viktor K. Prasanna:
High-Throughput Online Hash Table on FPGA. 105-112 - Nachiket Kapre, Han Jianglei, Andrew Bean, Pradeep Moorthy, Siddhartha:
GraphMMU: Memory Management Unit for Sparse Graph Accelerators. 113-120 - Omer Arap, Martin Swany, Geoffrey Brown, Bryce Himebaugh:
Adaptive Recursive Doubling Algorithm for Collective Communication. 121-128 - Shijie Zhou, Charalampos Chelmis, Viktor K. Prasanna:
Accelerating Large-Scale Single-Source Shortest Path on FPGA. 129-136
Session 3 - New Architectures and Performance Evaluation for Reconfigurable Computing
- Nicklas Bo Jensen, Pascal Schleuniger, Andreas Erik Hindborg, Maxwell Walter, Sven Karlsson:
Experiences with Compiler Support for Processors with Exposed Pipelines. 137-143 - Arash Ashrafi, Ramachandran Vaidyanathan:
An Architecture for Configuring an Effcient Scan Path for a Subset of Elements. 144-153 - Shreyas G. Singapura, Anand V. Panangadan, Viktor K. Prasanna:
Performance Modeling of Matrix Multiplication on 3D Memory Integrated FPGA. 154-162 - Lim Hui Hui, Nachiket Kapre:
Enhancing Speedups for FPGA Accelerated SPICE through Frequency Scaling and Precision Reduction. 163-169
Short Papers
- Rohit Kumar, Ann Gordon-Ross:
An Automated High-Level Design Framework for Partially Reconfigurable FPGAs. 170-175 - Marc-André Daigneault, Jean-Pierre David:
Intermediate-Level Synthesis of a Gauss-Jordan Elimination Linear Solver. 176-181 - Riccardo Cattaneo, Mahdi Badie Moradmand, Donatella Sciuto, Marco D. Santambrogio:
K-Ways Partitioning of Polyhedral Process Networks: A Multi-level Approach. 182-189 - Christian Herglotz, Jürgen Seiler, André Kaup, Arne Hendricks, Marc Reichenbach, Dietmar Fey:
Estimation of Non-functional Properties for Embedded Hardware with Application to Image Processing. 190-195 - Kartik V. Hegde, Vadiraj Kulkarni, R. Harshavardhan, David S. Sumam:
Adaptive Reconfigurable Architecture for Image Denoising. 196-201
Workshop 3: HIPS-Workshop on High-Level Parallel Programming Models and Supportive Environments and LSPP-Workshop on Large-Scale Parallel Processing
- Sriram Krishnamoorthy, Tobias Hilbrich, Darren J. Kerbyson, Ramakrishnan Rajamony, Charles C. Weems:
HIPS-LSPP Introduction and Committees. 202-203 - Torsten Hoefler, Laxmikant V. Kalé:
HIPS-LSPP Keynotes. 204
Session I: Performance Analysis and Optimization
- Matthias Weber, Ronald Geisler, Holger Brunst, Wolfgang E. Nagel:
Folding Methods for Event Timelines in Performance Analysis. 205-214 - Tim Cramer, Robert Dietrich, Christian Terboven, Matthias S. Müller, Wolfgang E. Nagel:
Performance Analysis for Target Devices with the OpenMP Tools Interface. 215-224 - Jian Lin, Khaled Hamidouche, Xiaoyi Lu, Mingzhe Li, Dhabaleswar K. Panda:
High-Performance Coarray Fortran Support with MVAPICH2-X: Initial Experience and Evaluation. 225-234 - Sourav Chakraborty, Hari Subramoni, Jonathan L. Perkins, Ammar Ahmad Awan, Dhabaleswar K. Panda:
On-demand Connection Management for OpenSHMEM and OpenSHMEM+MPI. 235-244
Session II: Parallelization
- Aravind Sukumaran-Rajam, Luis Esteban Campostrini, Juan Manuel Martinez Caamaño, Philippe Clauss:
Speculative Runtime Parallelization of Loop Nests: Towards Greater Scope and Efficiency. 245-254
Session III: Application-Specific Studies
- Daniel G. Chavarría-Miranda, Mahantesh Halappanavar, Sriram Krishnamoorthy, Joseph B. Manzano, Abhinav Vishnu, Adolfy Hoisie:
On the Impact of Execution Models: A Case Study in Computational Chemistry. 255-264 - Nishant Saurabh, Ana Lucia Varbanescu, Gyan Ranjan:
Computing the Pseudo-Inverse of a Graph's Laplacian Using GPUs. 265-274
Workshop 4: NIDISC - Workshop on Nature Inspired Distributed Computing
- Pascal Bouvry, Grégoire Danoy, Franciszek Seredynski, El-Ghazali Talbi, Albert Y. Zomaya:
NIDISC Introduction and Committees. 275
Session 1: Applications of Bio-Inspired Algorithms
- Jakub Gasior, Franciszek Seredynski:
Dynamic Job Scheduling in the Cloud Using Slowdown Optimization and Sandpile Cellular Automata Model. 276-285 - Francois Legillon, Nouredine Melab, Didier Renard, El-Ghazali Talbi:
A Multi-objective Evolutionary Algorithm for Cloud Platform Reconfiguration. 286-291 - Raed Alkharboush, Robson Eduardo De Grande, Azzedine Boukerche:
A Genetic Algorithm Approach for Adjusting Time Series Based Load Prediction. 292-298
Session 2: Parallel, Distributed, and Adaptive Algorithms
- Omar Andrés Carmona Cortes, Mônica Sakuray Pais, Filipo Novo Mor, Andrew Rau-Chaplin, César Augusto Missio Marcon:
Differential Evolution on a GPGPU: The Influence of Parameters on Speedup and the Quality of Solutions. 299-306 - Jakub Muszynski, Sébastien Varrette, Bernabé Dorronsoro Díaz, Pascal Bouvry:
Distributed Cellular Evolutionary Algorithms in a Byzantine Environment. 307-313 - Amir Nakib, Bernard Thibault, Patrick Siarry:
Bayesian Based Metaheuristic for Large Scale Continuous Optimization. 314-322 - Ajay Pratap, Rajiv Misra:
Firefly Inspired Improved Distributed Proximity Algorithm for D2D Communication. 323-328
Workshop 5: HiCOMB - Workshop on High Performance Computational Biology
- Sanguthevar Rajasekaran, Srinivas Aluru, David A. Bader:
HiCOMB Introduction and Committees. 329-330 - Ramesh Hariharan, Ananth Kalyanaraman, Michela Taufer, Trilce Estrada, Pietro Cicotti, Pavan Balaji:
HiCOMB 2015 Keynote and Invited Talks. 331
HiCOMB Session 1
- Tuan Tu Tran, Mathieu Giraud, Jean-Stéphane Varré:
Perfect Hashing Structures for Parallel Similarity Searches. 332-341 - Basavaraj Talawar:
A Crossbar Interconnection Network in DNA. 342-345 - Denis Trystram:
Handling Heterogeneity for Efficient Implementations: A Case Study on Sequence Comparison. 346-349 - G. M. Siddesh, K. G. Srinivasa, Ishank Mishra, Abhinav Anurag, Eklavya Uppal:
Phylogenetic Analysis Using MapReduce Programming Model. 350-356
HiCOMB Session 2
- Wajeeta Lohana, Jawwad A. Shamsi, Tahir Q. Syed, Farrukh Hasan:
Towards Context-Aware DNA Sequence Compression for Efficient Data Exchange. 357-366
HiCOMB Session 3
- Solon P. Pissis, Ahmad Retha:
Generalised Implementation for Fixed-Length Approximate String Matching under Hamming Distance and Applications. 367-374 - Hanyu Jiang, Narayan Ganesan:
Fine-Grained Acceleration of HMMER 3.0 via Architecture-Aware Optimization on Massively Parallel Processors. 375-383
Workshop 6: APDCM - Advances in Parallel and Distributed Computing Models
- Oscar H. Ibarra, Koji Nakano, Akihiro Fujiwara, Susumu Matsumae:
APDCM Introduction and Committees. 384
Session 1: Parallel Algorithms and Applications
- Toru Fujita, Koji Nakano, Yasuaki Ito:
Bulk GCD Computation Using a GPU to Break Weak RSA Keys. 385-394 - Meher Chaitanya, Kishore Kothapalli:
A Simple Parallel Algorithm for Biconnected Components in Sparse Graphs. 395-404 - Marc Aurel Kiefer, Korbinian Molitorisz, Jochen Bieler, Walter F. Tichy:
Parallelizing a Real-Time Audio Application - A Case Study in Multithreaded Software Engineering. 405-414 - Ajay Kattepur, Manoj Nambiar:
Performance Modeling of Multi-tiered Web Applications with Varying Service Demands. 415-424
Session 2: Parallel Computing Systems
- Abhishek Bansal, Sambhav Gupta, Turbo Majumder:
Efficient Estimation of Non-stationary Traffic Parameters on Networks-on-Chip. 425-433 - Daniel Dauwe, Eric Jonardi, Ryan D. Friese, Sudeep Pasricha, Anthony A. Maciejewski, David A. Bader, Howard Jay Siegel:
A Methodology for Co-Location Aware Application Performance Modeling in Multicore Computing. 434-443 - Shounak Chakraborty, Shirshendu Das, Hemangee K. Kapoor:
Performance Constrained Static Energy Reduction Using Way-Sharing Target-Banks. 444-453 - Ke Gao, Dongrui Fan, Jie Wu, Zhiyong Liu:
Decoupling Contention with Victim Row-Buffer on Multicore Memory Systems. 454-463
Session 3: Distributed Algorithms and Computing
- Manmohan Chaubey, Erik Saule:
Replicated Data Placement for Uncertain Scheduling. 464-472 - Guillaume Aupy, Anne Benoit, Henri Casanova, Yves Robert:
Scheduling Computational Workflows on Failure-Prone Platforms. 473-482 - Nicolas Braud-Santoni, Swan Dubois, Mohamed-Hamza Kaaouachi, Franck Petit:
A Generic Framework for Impossibility Results in Time-Varying Graphs. 483-489 - Ajoy Kumar Datta, Anissa Lamani, Lawrence L. Larmore, Franck Petit:
Enabling Ring Exploration with Myopic Oblivious Robots. 490-499
Session 4: Wireless Networks and Distributed Systems
- Jian Tang, Mikel Larrea, Sergio Arévalo, Ernesto Jiménez:
Implementing Uniform Reliable Broadcast in Anonymous Distributed Systems with Fair Lossy Channels. 500-508 - Min Shen, Ajay D. Kshemkalyani, Ta Yuan Hsu:
Causal Consistency for Geo-Replicated Cloud Storage under Partial Replication. 509-518 - Lucas Rodrigues Costa, Lucas Saad N. Nunes, Jacir Luiz Bordim, Koji Nakano:
Asterisk PBX Capacity Evaluation. 519-524 - Marcos Fagundes Caetano, Jacir Luiz Bordim:
A Fair Randomized Contention Resolution Protocol for Wireless Nodes without Collision Detection Capabilities. 525-533
Workshop 7: HPBC - High Performance Big Data and Cloud Computing Workshop and HPDIC - High Performance Data Intensive Computing
- Eric E. Aubanel, Virendrakumar C. Bhavsar, Michael A. Frumkin:
HPBC Introduction and Committees. 534 - Tim Mattson:
HPBC Keynote. 535 - Christophe Cérin, R. K. Shyamasundar, Yuqing Gao, Congfeng Jiang:
HPDIC Introduction and Committees. 536
Session 1: Big Data and Cloud Computing: Storage, Analytics and Data Transfer
- Lars Lundberg, Håkan Grahn, Dragos Ilie, Christian Melander:
Cache Support in a High Performance Fault-Tolerant Distributed Storage System for Cloud and Big Data. 537-546 - Madhushi Niluka Bandara, Rajitha Madhushan Ranasinghe, Rashmi Woranga Mudugamuwa Arachchi, Channa Gayan Somathilaka, Srinath Perera, Daya Chinthana Wimalasuriya:
A Complex Event Processing Toolkit for Detecting Technical Chart Patterns. 547-556 - Eun-Sung Jung, Rajkumar Kettimuthu:
High-Performance Serverless Data Transfer over Wide-Area Networks. 557-564
Session 2: High Performance Data Intensive Computing
- E. Wes Bethel, David Camp, David Donofrio, Mark Howison:
Improving Performance of Structured-Memory, Data-Intensive Applications on Multi-core Platforms via a Space-Filling Curve Memory Layout. 565-574 - Bhavik Shah, Trupti Padiya, Minal Bhise:
Query Execution for RDF Data Using Structure Indexed Vertical Partitioning. 575-584 - Medha Abhijeet Shah, Dinesh B. Kulkarni:
Storm Pub-Sub: High Performance, Scalable Content Based Event Matching System Using Storm. 585-590
Workshop 8: ASHES - Accelerators and Hybrid Exascale Systems
- James Dinan, Wenguang Chen, Xiaosong Ma, Pavan Balaji, Satoshi Matsuoka, Jiayuan Meng, Yunquan Zhang:
AsHES Introduction and Committees. 591-592 - Michela Taufer:
AsHES Keynote. 593
Session 1: Accelerating Analytics
- Sina Meraji, John Keenleyside, Sunil Kamath, Bob Blainey:
Towards a Combined Grouping and Aggregation Algorithm for Fast Query Processing in Columnar Databases with GPUs. 594-603 - Dipanjan Sengupta, Kapil Agarwal, Shuaiwen Leon Song, Karsten Schwan:
GraphReduce: Large-Scale Graph Analytics on Accelerator-Based HPC Systems. 604-609 - Shuai Che, Gregory Rodgers, Bradford M. Beckmann, Steven K. Reinhardt:
Graph Coloring on the GPU and Some Techniques to Improve Load Imbalance. 610-617
Session 2: Algorithm Design for Heterogeneous Systems
- Sushil K. Prasad, Michael McDermott, Xi He, Satish Puri:
GPU-based Parallel R-tree Construction and Querying. 618-627 - Aditya Deshpande, P. J. Narayanan:
Fast Burrows Wheeler Compression Using All-Cores. 628-636 - Kiran Raj Ramamoorthy, Dip Sankar Banerjee, Kannan Srinathan, Kishore Kothapalli:
A Novel Heterogeneous Algorithm for Multiplying Scale-Free Sparse Matrices. 637-646 - Kazuya Matsumoto, Toshihiro Hanawa, Yuetsu Kodama, Hisafumi Fujii, Taisuke Boku:
Implementation of CG Method on GPU Cluster with Proprietary Interconnect TCA for GPU Direct Communication. 647-655
Workshop 9: PLC - Programming Models, Languages, and Compilers for Manycore and Heterogeneous Architectures
- Sunita Chandrasekaran:
PLC Introduction and Committees. 656-657 - Michael Gschwind:
PLC Keynote. 658
Session I: Programming and Compilation Techniques for Heterogeneous and Multicore Systems
- Meghana Gupta, Dibyendu Das, Prakash Raghavendra, Tony Tye, Leonid Lobachev, Amit Agarwal, Ravish Hegde:
Implementing Cross-Device Atomics in Heterogeneous Processors. 659-668 - Rajesh Kumar, Kishore Kothapalli:
A Novel Heterogeneous Framework for Local Dependency Dynamic Programming Problems. 669-678 - Peng Sun, Sunita Chandrasekaran, Barbara M. Chapman:
OpenMP-MCA: Leveraging Multiprocessor Embedded Systems Using Industry Standards. 679-688
Session II: Parallel Programming Experiences and Lessons Learned
- Guido Juckeland, Alexander Grund, Wolfgang E. Nagel:
Performance Portable Applications for Hardware Accelerators: Lessons Learned from SPEC ACCEL. 689-698 - Suttinee Sawadsitang, James Lin, Simon See, François Bodin, Satoshi Matsuoka:
Understanding Performance Portability of OpenACC for Supercomputers. 699-707
Session III: Novel Approaches for Emerging Platforms
- Deepak Majeti, Vivek Sarkar:
Heterogeneous Habanero-C (H2C): A Portable Programming Model for Heterogeneous Processors. 708-717 - Gil Rapaport, Ayal Zaks, Yosi Ben-Asher:
Streamlining Whole Function Vectorization in C Using Higher Order Vector Semantics. 718-727
Workshop 10: EduPar - NSF/TCPP Workshop on Parallel and Distributed Computing Education
- Andrew Lumsdaine, Sushil K. Prasad, Martina Barnas:
EduPar Introduction and Committees. 728-729 - Geoffrey Charles Fox:
EduPar Keynote. 730
Session 1: Methods and Tools
- Jörg Hilpert, Rüdiger Berlich, Peter Lürßen, Almut Zwölfer, Jochen Barwind:
Teaching Simulations and High Performance Computing at Secondary Schools in the German State of Baden-Württemberg. 731-738 - Nasser Giacaman, Simar Kalra, Oliver Sinnen:
The Active classroom: Students and Instructors Parallel Programming in Parallel. 739-745 - Ian Finlayson, Jerome Mueller, Shehan Rajapakse, Daniel Easterling:
Introducing Tetra: An Educational Parallel Programming System. 746-751 - Joel C. Adams:
Patternlets: A Teaching Tool for Introducing Students to Parallel Design Patterns. 752-759
Session 2: Course Design
- Julio Sahuquillo, Salvador Petit, Vicent Selfa, María Engracia Gómez:
A Research-Oriented Course on Advanced Multicore Architecture. 760-765 - Karen L. Karavanic, Daniel Leblanc:
Updating an Introductory Performance Course with PDC Topics. 766-771 - Jawwad A. Shamsi, Nouman M. Durrani, Nadeem Kafi Khan:
Novelties in Teaching High Performance Computing. 772-778
Session 3: Curriculum Integration
- Ali Abu El Humos, Sungbum Hong, Jacqueline M. Jackson, Xuejun Liang, Tzusheng Pei, Bernard Aldrich:
Incorporating PDC Modules Into Computer Science Courses at Jackson State University. 779-781 - Guoming Lu, Jie Xu, Jieyan Liu, Bo Dai, Shenglin Gui, Siyu Zhan:
Integrating Parallel and Distributed Computing Topics into an Undergraduate CS Curriculum at UESTC. 782-787 - Ali Ebnenasir, Jean Mayo:
Fault-Tolerant Parallel and Distributed Computing for Software Engineering Undergraduates. 788-794
Workshop 11: GABB - Graph Algorithms Building Blocks
- Tim Mattson:
GABB Introduction and Committees. 795
GABB Session 1
- Marcin Zalewski, Nicholas Gerard Edmonds, Andrew Lumsdaine:
Declarative Patterns for Imperative Distributed Graph Algorithms. 796-803 - Ariful Azad, Aydin Buluç, John R. Gilbert:
Parallel Triangle Counting and Enumeration Using Matrix Algebra. 804-811
GABB Session 2
- Anil N. Hirani, Kaushik Kalyanaraman, Seth Watts:
Graph Laplacians and Least Squares on Graphs. 812-821 - Vijay Gadepally, Jake Bolewski, Dan Hook, Dylan Hutchison, Benjamin A. Miller, Jeremy Kepner:
Graphulo: Linear Algebra Graph Kernels for NoSQL Databases. 822-830 - Jeremiah Willcock, Andrew Lumsdaine:
A Unifying Programming Model for Parallel Graph Algorithms. 831-840 - Carl Yang, Yangzihao Wang, John D. Owens:
Fast Sparse Matrix and Sparse Vector Multiplication Algorithm on the GPU. 841-847
Workshop 12: HPPAC - High-Performance, Power-Aware Computing
- Wu-chun Feng, Barry Rountree:
HPPAC Introduction and Committees. 848
Session 1: Provisioning and Management
- Akhil Langer, Harshit Dokania, Laxmikant V. Kalé, Udatta S. Palekar:
Analyzing Energy-Time Tradeoff in Power Overprovisioned HPC Data Centers. 849-854 - Daniel Balouek-Thomert, Eddy Caron, Laurent Lefèvre:
Energy-Aware Server Provisioning by Introducing Middleware-Level Dynamic Green Scheduling. 855-862 - Yiannis Georgiou, David Glesser, Denis Trystram:
Adaptive Resource and Job Management for Limited Power Consumption. 863-870
Session 2: Measurement, Modeling, and Optimization
- Rubasri Kalidas, Mayank Daga, Konstantinos Krommydas, Wu-chun Feng:
On the Performance, Energy, and Power of Data-Access Methods in Heterogeneous Computing Systems. 871-879 - Vignesh Adhinarayanan, Wu-chun Feng, Jonathan Woodring, David H. Rogers, James P. Ahrens:
On the Greenness of In-Situ and Post-Processing Visualization Pipelines. 880-887 - Nirmal Prajapati, Waruna Ranasinghe, Vamshi Tandrapati, Rumen Andonov, Hristo N. Djidjev, Sanjay V. Rajopadhye:
Energy Modeling and Optimization for Tiled Nested-Loop Codes. 888-895
Session 3: Efficiency
- Daniel Hackenberg, Robert Schöne, Thomas Ilsche, Daniel Molka, Joseph Schuchart, Robin Geyer:
An Energy Efficiency Feature Survey of the Intel Haswell Processor. 896-904 - Rogelio Long, Shirley Moore, Barry Rountree:
Iso-Power-Efficiency: An Approach to Scaling Application Codes with a Power Budget. 905-910 - Sridutt Bhalachandra, Allan Porterfield, Jan F. Prins:
Using Dynamic Duty Cycle Modulation to Improve Energy Efficiency in High Performance Computing. 911-918
Workshop 13: PDSEC-Workshop on Parallel and Distributed Scientific and Engineering Computing
- Peter E. Strazdins, Raphaël Couturier, Keita Teranishi, John O'Donnell, Thomas Rauber, Gudula Rünger, Laurence T. Yang:
PDSEC Introduction and Committees. 919-920 - Naoya Maruyama:
PDSEC Keynote. 921
Session 1: Best Paper
- Jean-Claude Charr, Raphaël Couturier, Ahmed Fanfakh, Arnaud Giersch:
Energy Consumption Reduction with DVFS for Message Passing Iterative Applications on Heterogeneous Architectures. 922-931
Session 2: Performance
- Steven A. Wright, Stephen A. Jarvis:
Quantifying the Effects of Contention on Parallel File Systems. 932-940 - Peter E. Strazdins, Md. Mohsin Ali, Brendan Harding:
Highly Scalable Algorithms for the Sparse Grid Combination Technique. 941-950 - Ananta Tiwari, Martin Schulz, Laura Carrington:
Predicting Optimal Power Allocation for CPU and DRAM Domains. 951-959
Session 3: Linear Algebra
- Takeshi Fukaya, Toshiyuki Imamura:
Performance Evaluation of the Eigen Exa Eigensolver on Oakleaf-FX: Tridiagonalization Versus Pentadiagonalization. 960-969 - Sara S. Hamouda, Josh Milthorpe, Peter E. Strazdins, Vijay A. Saraswat:
A Resilient Framework for Iterative Linear Algebra Applications in X10. 970-979 - Massimiliano Fasi, Yves Robert, Bora Uçar:
Combining Backward and Forward Recovery to Cope with Silent Errors in Iterative Solvers. 980-989 - Raphaël Couturier, Lilia Ziane Khodja, Christophe Guyeux:
TSIRM: A Two-Stage Iteration with Least-Squares Residual Minimization Algorithm to Solve Large Sparse Linear Systems. 990-997
Session 4: GPUs and Manycore
- Jiayuan Meng, Thomas D. Uram, Vitali A. Morozov, Venkatram Vishwanath, Kalyan Kumaran:
Modeling Cooperative Threads to Project GPU Performance for Adaptive Parallelism. 998-1007 - Takuro Udagawa, Masakazu Sekijima:
GPU Accelerated Molecular Dynamics with Method of Heterogeneous Load Balancing. 1008-1013 - Paolo Spallaccini, Farbod Kayhan, Stefano Chinnici, Guido Montorsi:
Parallel Methods for Optimizing High Order Constellations on GPUs. 1014-1023
Workshop 14: DPDNS - Dependable Parallel, Distributed, and Network-Centric Systems
- Dimiter R. Avresky, Erik Maehle, Nectarios Koziris, Anastassios Nanos:
DPDNS Introduction and Committees. 1024
Session 1: Reliability and Threat-Detection
- Sanem Arslan, Haluk Rahmi Topcuoglu, Mahmut Taylan Kandemir, Oguz Tosun:
Performance and Energy Efficient Asymmetrically Reliable Caches for Multicore Architectures. 1025-1032 - Marc Eduard Frîncu:
Distributed Scheduling Algorithm for Highly Available Component Based Applications. 1033-1041 - Paul Wood, Saurabh Bagchi, Alefiya Hussain:
Optimizing Defensive Investments in Energy-Based Cyber-Physical Systems. 1042-1051
Session 2: Fault Tolerance
- Nentawe Gurumdimma, Arshad Jhumka, Maria Liakata, Edward Chuah, James C. Browne:
Towards Detecting Patterns in Failure Logs of Large-Scale Distributed Systems. 1052-1061 - Salem Saker, Adnan Agbaria:
Communication Pattern-Based Distributed Snapshots in Large-Scale Systems. 1062-1071 - Alessandro Pellegrini, Pierangelo di Sanzo, Dimiter R. Avresky:
A Machine Learning-Based Framework for Building Application Failure Prediction Models. 1072-1081
Session 3: Algorithms, Protocols, and Topologies
- Théodore Jean Richard Relaza, Jacques Jorda, Abdelaziz Mzoughi:
Trapezoid Quorum Protocol Dedicated to Erasure Resilient Coding Based Schemes. 1082-1088 - Brendan Benshoof, Andrew Rosen, Anu G. Bourgeois, Robert W. Harrison:
A Distributed Greedy Heuristic for Computing Voronoi Tessellations with Applications Towards Peer-to-Peer Networks. 1089-1096 - Kaliappa Ravindran:
Dependability Modeling and Assessment of Complex Adaptive Networked Systems. 1097-1105
Workshop 15: PCO - Parallel Computing and Optimization
- Didier El Baz, Bora Uçar:
PCO Introduction and Committees. 1106-1107 - Alex Pothen:
PCO Keynote. 1108
Session 1: Optimization Techniques for Parallel or Distributed Architectures
- Christian Toinard, Timothee Ravier, Christophe Cérin, Yanik Ngoko:
The Promethee Method for Cloud Brokering with Trust and Assurance Criteria. 1109-1118 - Maximilian Odendahl, Andres Goens, Rainer Leupers, Gerd Ascheid, Tomas Henriksson:
Buffer Allocation Based On-Chip Memory Optimization for Many-Core Platforms. 1119-1124
Session 2: Combinatorial Scientific Computing and Parallel Optimization Algorithms
- Enver Kayaaslan, Bora Uçar, Cevdet Aykanat:
Semi-two-dimensional Partitioning for Parallel Sparse Matrix-Vector Multiplication. 1125-1134 - Didier El Baz, Moussa Elkihel:
Parallel Asynchronous Modified Newton Methods for Network Flows. 1135-1142 - Prashant Palkar, Ashutosh Mahajan:
A Branch-and-Estimate Heuristic Procedure for Solving Nonconvex Integer Optimization Problems. 1143-1151
Workshop 16: ParLearning - Parallel and Distributed Computing for Large Scale Machine Learning and Big Data Analytics
- Sutanay Choudhury, Arindam Pal, Anand V. Panangadan, Yinglong Xia:
ParLearning Introduction and Committees. 1152-1153 - David A. Bader, Yihua Huang, Ananth Kalyanaraman:
ParLearning Keynotes. 1154-1156 - Tao Luo, Yin Liao, Yurong Chen, Jianguo Li, Victor Lee:
LFRTrainer: Large-Scale Face Recognition Training System. 1157-1165 - Charith Wickramaarachchi, Charalampos Chelmis, Viktor K. Prasanna:
Empowering Fast Incremental Computation over Large Scale Dynamic Graphs. 1166-1171 - M. Sai Rajeswar, Adepu Ravi Sankar, Vineeth N. Balasubramanian, C. D. Sudheer:
Scaling Up the Training of Deep CNNs for Human Action Recognition. 1172-1177 - Yusuke Nishioka, Kenjiro Taura:
Scalable Task-Parallel SGD on Matrix Factorization in Multicore Architectures. 1178-1184 - Ravikant Dindokar, Neel Choudhury, Yogesh L. Simmhan:
Analysis of Subgraph-Centric Distributed Shortest Path Algorithm. 1185-1190 - Bing Lin, Wenzhong Guo, Guolong Chen, Naixue Xiong, Rongrong Li:
Cost-Driven Scheduling for Deadline-Constrained Workflow on Multi-clouds. 1191-1198
Workshop 17: JSSPP - Workshop on Job Scheduling Strategies for Parallel Processing
- Walfredo Cirne, Narayan Desai:
JSSPP Introduction and Committees. 1199
Workshop 18: iWAPT - International Workshop on Automatic Performance Tuning
- Yusaku Yamamoto, Weichung Wang:
iWAPT Introduction and Committees. 1200-1201 - Ponnuswamy Sadayappan, Ray-Bing Chen:
iWAPT Invited Talks. 1202-1203
iWAPT Session 1
- Youcef Barigou, Vishwanath Venkatesan, Edgar Gabriel:
Auto-tuning Non-blocking Collective Communication Operations. 1204-1213 - Tomohiro Suzuki:
Improved Internode Communication for Tile QR Decomposition for Multicore Cluster Systems. 1214-1220
iWAPT Session 2
- Takahiro Katagiri, Satoshi Ohshima, Masaharu Matsumoto:
Directive-Based Auto-Tuning for the Finite Difference Method on the Xeon Phi. 1221-1230 - Thomas L. Falch, Anne C. Elster:
Machine Learning Based Auto-Tuning for Enhanced OpenCL Performance Portability. 1231-1240 - Martin Kong, Louis-Noël Pouchet, Ponnuswamy Sadayappan:
A Roofline-Based Performance Estimator for Distributed Matrix-Multiply on Intel CnC. 1241-1250
iWAPT Session 3
- Shajulin Benedict, R. S. Rejitha, Philipp Gschwandtner, Radu Prodan, Thomas Fahringer:
Energy Prediction of OpenMP Applications Using Random Forest Modeling Approach. 1251-1260 - Sanath Jayasena, Milinda Fernando, Tharindu Rusira, Chalitha Perera, Chamara Philips:
Auto-Tuning the Java Virtual Machine. 1261-1270
Workshop 19: Julia-Invited Workshop: A New Approach to High Performance Technical Computing
- Alan Edelman:
Julia Introduction. 1271
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.