default search action
HPEC 2021: Waltham, MA, USA
- 2021 IEEE High Performance Extreme Computing Conference, HPEC 2021, Waltham, MA, USA, September 20-24, 2021. IEEE 2021, ISBN 978-1-6654-2369-4
- Amanda Bienz, Luke N. Olson, William D. Gropp, Shelby Lockhart:
Modeling Data Movement Performance on Heterogeneous Architectures. 1-7 - Yijia Zhang, Burak Aksar, Omar Aaziz, Benjamin Schwaller, Jim M. Brandt, Vitus J. Leung, Manuel Egele, Ayse K. Coskun:
Using Monitoring Data to Improve HPC Performance via Network-Data-Driven Allocation. 1-7 - Chinmaya Patnayak, James E. McClure, Ryan K. Williams:
WASP: A Wearable Super-Computing Platform for Distributed Intelligence in Multi-Agent Systems. 1-7 - Zhili Xiao, Roger D. Chamberlain, Anthony M. Cabrera:
HLS Portability from Intel to Xilinx: A Case Study. 1-8 - Upasana Sridhar, Tze Meng Low, Martin D. Schatz:
Fusing Non Element-wise Layers in DNNs. 1-2 - Bill Bergeron, Matthew Hubbell, Dylan Sequeira, Winter Williams, William Arcand, David Bestor, Chansup Byun, Vijay Gadepally, Michael Houle, Michael Jones, Anna Klien, Peter Michaleas, Lauren Milechin, Julie Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Charles Yee, Jeremy Kepner:
3D Real-Time Supercomputer Monitoring. 1-7 - Kaira Samuel, Vijay Gadepally, David Jacobs, Michael Jones, Kyle McAlpin, Kyle Palko, Ben Paulk, Sid Samsi, Ho Chit Siu, Charles Yee, Jeremy Kepner:
Maneuver Identification Challenge. 1-7 - Michel Pelletier, Will Kimmerer, Timothy A. Davis, Timothy G. Mattson:
The GraphBLAS in Julia and Python: the PageRank and Triangle Centralities. 1-7 - Jeremy Kepner, Michael Jones, Daniel Andersen, Aydin Buluç, Chansup Byun, Kimberly C. Claffy, Timothy Davis, William Arcand, Jonathan Bernays, David Bestor, William Bergeron, Vijay Gadepally, Micheal Houle, Matthew Hubbell, Anna Klein, Chad R. Meiners, Lauren Milechin, Julie Mullen, Sandeep Pisharody, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Doug Stetson, Adam Tse, Charles Yee, Peter Michaleas:
Spatial Temporal Analysis of 40, 000, 000, 000, 000 Internet Darkspace Packets. 1-8 - Jie Xin, Xianqi Ye, Long Zheng, Qinggang Wang, Yu Huang, Pengcheng Yao, Linchen Yu, Xiaofei Liao, Hai Jin:
Fast Sparse Deep Neural Network Inference with Flexible SpMM Optimization Space Exploration. 1-7 - Runze Wang, Linchen Yu, Qinggang Wang, Jie Xin, Long Zheng:
Productive High-Performance k-Truss Decomposition on GPU Using Linear Algebra. 1-7 - Julia Wei, Matthew Harper Langston, Pierre-David Letourneau, Matthew J. Morse, Larry Weintraub, Aimee Nogoy, Noah Amsel, Richard Lethin:
Boundary Integral Solver Approaches for Particle Accelerator Simulation Problems and Deployment on NERSC Hardware. 1-6 - Anthony M. Cabrera, Seth Hitefield, Jungwon Kim, Seyong Lee, Narasinga Rao Miniskar, Jeffrey S. Vetter:
Toward Performance Portable Programming for Heterogeneous Systems on a Chip: A Case Study with Qualcomm Snapdragon SoC. 1-7 - Jeremy M. Myers, Daniel M. Dunlavy:
Using Computation Effectively for Scalable Poisson Tensor Factorization: Comparing Methods Beyond Computational Efficiency. 1-7 - Yingbo Ma, Vaibhav Dixit, Michael J. Innes, Xingjian Guo, Christopher Rackauckas:
A Comparison of Automatic Differentiation and Continuous Sensitivity Analysis for Derivatives of Differential Equation Solutions. 1-9 - Dennis Milechin, Ahmed Aly, Josh Bevan, Charlie Jahnke, Yun Shen, Brian Gregor:
Pragmatic Benchmarking for Research Computing. 1-6 - Qizhe Wu, Linfeng Tao, Huawen Liang, Wei Yuan, Teng Tian, Shuang Xue, Xi Jin:
Software-Hardware Co-Optimization on Partial-Sum Problem for PIM-based Neural Network Accelerator. 1-7 - Trevor Steil, Geoffrey Sanders, Roger Pearce:
Towards Distributed Square Counting in Large Graphs. 1-7 - Kyle Denney, Robert Lychev, Donato Kava, Alice Lee, Michael Vai, Nick Evancich, Richard Clark, David Lide, Kyung Joon Kwak, Jason H. Li, Michael Lynch, Kyle Tillotson, Walt Tirenin, Douglas Schafer:
A Novel Approach to Cyber Situational Awareness in Embedded Systems. 1-5 - Yi-Chien Lin, Bingyi Zhang, Viktor K. Prasanna:
GCN Inference Acceleration using High-Level Synthesis. 1-6 - Jeremy Kepner, Tim Davis, Chansup Byun, William Arcand, David Bestor, William Bergeron, Vijay Gadepally, Michael Houle, Matthew Hubbell, Michael Jones, Anna Klein, Lauren Milechin, Julie Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Charles Yee, Peter Michaleas:
Vertical, Temporal, and Horizontal Scaling of Hierarchical Hypersparse GraphBLAS Matrices. 1-6 - Ben Burnett, Sigal Gottlieb, Zachary J. Grant, Alfa R. H. Heryudono:
Performance Evaluation of Mixed-Precision Runge-Kutta Methods. 1-6 - Abhishek Kumar Jain, Sharan Kumar, Aashish Tripathi, Dinesh Gaitonde:
Sparse Deep Neural Network Acceleration on HBM-Enabled FPGA Platform. 1-7 - Dennis Bautembach, Iason Oikonomidis, Antonis A. Argyros:
Even Faster SNN Simulation with Lazy+Event-driven Plasticity and Shared Atomics. 1-8 - Fuhuan Li, David A. Bader:
A GraphBLAS Implementation of Triangle Centrality. 1-2 - Hafsah Shahzad, Ahmed Sanaullah, Martin C. Herbordt:
Survey and Future Trends for FPGA Cloud Architectures. 1-10 - Kaira Samuel, Jeremy Kepner, Michael Jones, Lauren Milechin, Vijay Gadepally, William Arcand, David Bestor, William Bergeron, Chansup Byun, Matthew Hubbell, Michael Houle, Anna Klein, Victor Lopez, Julie Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Sid Samsi, Charles Yee, Peter Michaleas:
Supercomputing Enabled Deployable Analytics for Disaster Response. 1-5 - Mark Bolding, Saul Crumpton, David Ediger, George Samo:
Performance of a GPU-Based Radar Processor. 1-5 - Zhihui Du, Oliver Alvarado Rodriguez, David A. Bader:
Large Scale String Analytics in Arkouda. 1-7 - Daniel Sharp, Miroslav Stoyanov, Stanimire Tomov, Jack J. Dongarra:
A More Portable HeFFTe: Implementing a Fallback Algorithm for Scalable Fourier Transforms. 1-5 - Akanksha Soni, Jeetendra Kumar Soni, Surabhi Hota:
Detection of Multiple Crop Diseases using Image Processing Techniques. 1-6 - Goutham Kalikrishna Reddy Kuncham, Rahul Vaidya, Mahesh Barve:
Performance Study of GPU applications using SYCL and CUDA on Tesla V100 GPU. 1-7 - Scott Mionis, Franz Franchetti, Jason Larkin:
Optimized Quantum Circuit Generation with SPIRAL. 1-7 - Lauren Milechin, Javier Lopez-Contreras, Ferran Alet:
Efficiently Building a Large Scale Dataset for Program Induction. 1-7 - Vasileios Kalantzis, Anshul Gupta, Lior Horesh, Tomasz Nowicki, Mark S. Squillante, Chai Wah Wu, Tayfun Gokmen, Haim Avron:
Solving sparse linear systems with approximate inverse preconditioners on analog devices. 1-7 - Zach Hansen, Brody Williams, John D. Leidel, Xi Wang, Yong Chen:
DMM-GAPBS: Adapting the GAP Benchmark Suite to a Distributed Memory Model. 1-8 - Mark Barnell, Courtney Raymond, Anthony Salmin, Dan Brown, Darrek Isereau:
Model Quantization and Synthetic Aperture Data Analyses Increasing Throughput and Energy Efficiency. 1-5 - Jessie M. Henderson, Daniel O'Malley, Hari S. Viswanathan:
Interrogating the performance of quantum annealing for the solution of steady-state subsurface flow. 1-6 - Kevin Brady, Pooya Khorrami, Lars Gjesteby, Laura J. Brattain:
Instance Segmentation of Neuronal Nuclei Leveraging Domain Adaptation. 1-5 - Bingyi Zhang, Sanmukh R. Kuppannagari, Rajgopal Kannan, Viktor K. Prasanna:
Efficient Neighbor-Sampling-based GNN Training on CPU-FPGA Heterogeneous Platform. 1-7 - Brian A. Page, Peter M. Kogge:
Deluge: Achieving Superior Efficiency, Throughput, and Scalability with Actor Based Streaming on Migrating Threads. 1-6 - Mark P. Blanco, Scott McMillan, Tze Meng Low:
Delayed Asynchronous Iterative Graph Algorithms. 1-7 - Joseph McDonald, Siddharth Samsi, Daniel Edelman, Chansup Byun, Jeremy Kepner, Vijay Gadepally:
Improved Compression for Word Embeddings by Scaling Principal Components. 1-7 - Karim Youssef, Keita Iwabuchi, Wu-Chun Feng, Roger Pearce:
Privateer: Multi-versioned Memory-mapped Data Stores for High-Performance Data Science. 1-7 - Dimitri Leggas, Muthu Manikandan Baskaran, James R. Ezick, Brendan von Hofe:
Filtered Tensor Construction and Decomposition for Drug Repositioning. 1-7 - Teresa M. Ranadive, Muthu Manikandan Baskaran:
An All-at-Once CP Decomposition Method for Count Tensors. 1-8 - Spencer Nelson, Wassim Khalil, SangYun Kim, Jia Di, Zhe Zhou, Zhihang Yuan, Guang-Yu Sun:
Rapid Configuration of Asynchronous Recurrent Neural Networks for ASIC Implementations. 1-6 - Alan Ehret, Eliakin Del Rosario, Carsten Schwicking, Karen Gettings, Michel A. Kinsy:
Reconfigurable Hardware Root-of-Trust for Secure Edge Processing. 1-7 - Yasunori Futamura, Ryota Wakaki, Tetsuya Sakurai:
Spectral Graph Partitioning Using Geodesic Distance-based Projection. 1-7 - Ryan Kabrick, John D. Leidel, David Donofrio:
Toward HDL Extensions for Rapid AI/ML Accelerator Generation. 1-6 - Tiancheng Liu, Dimitris Floros, Nikos Pitsianis, Xiaobai Sun:
Digraph Clustering by the BlueRed Method. 1-7 - Chandler Bernard, William Bryant, Richard Becker, Jia Di:
Design of Asynchronous Polymorphic Logic Gates for Hardware Security. 1-5 - Ahsen J. Uppal, Jaeseok Choi, Thomas B. Rolinger, H. Howie Huang:
Faster Stochastic Block Partition Using Aggressive Initial Merging, Compressed Representation, and Parallelism Control. 1-7 - Benedikt Mayr, Alexander Weinrauch, Mathias Parger, Markus Steinberger:
Are van Emde Boas trees viable on the GPU? 1-7 - Chunshu Wu, Sahan Bandara, Tong Geng, Vipin Sachdeva, Woody Sherman, Martin C. Herbordt:
System-Level Modeling of GPU/FPGA Clusters for Molecular Dynamics Simulations. 1-8 - Sandeep Pisharody, Jonathan Bernays, Vijay Gadepally, Michael Jones, Jeremy Kepner, Chad R. Meiners, Peter Michaleas, Adam Tse, Doug Stetson:
Realizing Forward Defense in the Cyber Domain. 1-7 - Deniz Gurevin, Chris J. Michael, Omer Khan:
An Efficient Algorithm for the Construction of Dynamically Updating Trajectory Networks. 1-7 - Marcin Rogowski, Lisandro Dalcín, Matteo Parsani, David E. Keyes:
Implications of Reduced Communication Precision in a Collocated Discontinuous Galerkin Finite Element Framework. 1-7 - F. Patricia Medina, Randy C. Paffenroth:
Classification frameworks comparison on 3D point clouds. 1-6 - Nicholas Grabill, Kai Pinckard, Dirk Colbry:
Scaling of Evolutionary Search of Algorithm Space to Speed-Up Scientific Image Understanding Workflows. 1-6 - Cannada Lewis, Eric T. Phipps:
Low-Communication Asynchronous Distributed Generalized Canonical Polyadic Tensor Decomposition. 1-5 - Longlong Li, Hu Chen, Ping Li, Jie Han, Guanghui Wang, Gong Zhang:
The K-Core Decomposition Algorithm Under the Framework of GraphBLAS. 1-7 - Lisa J. K. Durbeck, Peter Athanas:
DPGS Graph Summarization Preserves Community Structure. 1-9 - Pouya Haghi, Anqi Guo, Tong Geng, Anthony Skjellum, Martin C. Herbordt:
Workload Imbalance in HPC Applications: Effect on Performance of In-Network Processing. 1-8 - Salman Abdul Khaliq, Usman Ali, Omer Khan:
Timing-based side-channel attack and mitigation on PCIe connected distributed embedded systems. 1-7 - Siddharth Samsi, Matthew L. Weiss, David Bestor, Baolin Li, Michael Jones, Albert Reuther, Daniel Edelman, William Arcand, Chansup Byun, John Holodnack, Matthew Hubbell, Jeremy Kepner, Anna Klein, Joseph McDonald, Adam Michaleas, Peter Michaleas, Lauren Milechin, Julia S. Mullen, Charles Yee, Benjamin Price, Andrew Prout, Antonio Rosa, Allan Vanterpool, Lindsey McEvoy, Anson Cheng, Devesh Tiwari, Vijay Gadepally:
The MIT Supercloud Dataset. 1-8 - Sasindu Wijeratne, Rajgopal Kannan, Viktor K. Prasanna:
Reconfigurable Low-latency Memory System for Sparse Matricized Tensor Times Khatri-Rao Product on FPGA. 1-7 - Wesley Brewer, Chris Geyer, Dardo Kleiner, Connor Horne:
Streaming Detection and Classification Performance of a POWER9 Edge Supercomputer. 1-7 - Craig Walker, Braeden Slade, Gavin Bailey, Nicklaus Przybylski, Nathan DeBardeleben, William M. Jones:
Exploring the Tradeoff Between Reliability and Performance in HPC Systems. 1-7 - Andrew Wood, Moshik Hershcovitch, Daniel G. Waddington, Sarel Cohen, Meredith Wolf, Hongjun Suh, Weiyu Zong, Peter Chin:
Non-Volatile Memory Accelerated Geometric Multi-Scale Resolution Analysis. 1-7 - Connor Imes, Tzu-Mao Li, Mark Glines, Rishi Khan, John Paul Walters:
Distributed and Heterogeneous SAR Backprojection with Halide. 1-9 - Mohammad Almasri, Neo Vasudeva, Rakesh Nagi, Jinjun Xiong, Wen-Mei Hwu:
HyKernel: A Hybrid Selection of One/Two-Phase Kernels for Triangle Counting on GPUs. 1-7 - Amin Norollah, Zahra Kazemi, Niloufar Sayadi, Hakem Beitollahi, Mahdi Fazeli, David Hély:
Efficient Scheduling of Dependent Tasks in Many-Core Real-Time System Using a Hardware Scheduler. 1-7 - Mitesh Kothari, Richard W. Vuduc:
An interface for multidimensional arrays in Arkouda. 1-2 - Dimitri Leggas, Christopher J. Coley, Teresa M. Ranadive:
Knowledge-guided Tensor Decomposition for Baselining and Anomaly Detection. 1-7 - Zhihui Du, Oliver Alvarado Rodriguez, David A. Bader:
Enabling Exploratory Large Scale Graph Analytics through Arkouda. 1-7 - Mike H. M. Teodorescu, Xinyu Yao:
Machine Learning Fairness is Computationally Difficult and Algorithmically Unsatisfactorily Solved. 1-8 - Andrew J. Weinert, Marc Brittain, Ngaire Underhill, Christine Serres:
Benchmarking the Processing of Aircraft Tracks with Triples Mode and Self-Scheduling. 1-8 - Baolin Li, Vijay Gadepally, Siddharth Samsi, Mark S. Veillette, Devesh Tiwari:
Serving Machine Learning Inference Using Heterogeneous Hardware. 1-8 - Oded Green:
Inverse-Deletion BFS - Revisiting Static Graph BFS Traversals with Dynamic Graph Operations. 1-7 - Sadasivan Sadas Shankar:
Lessons from Nature for Computing: Looking beyond Moore's Law with Special Purpose Computing and Co-design*. 1-8 - Md Abdul Motaleb Faysal, Shaikh Arifuzzaman, Cy P. Chan, Maximilian H. Bremer, Doru Popovici, John Shalf:
HyPC-Map: A Hybrid Parallel Community Detection Algorithm Using Information-Theoretic Approach. 1-8 - Albert Reuther, Peter Michaleas, Michael Jones, Vijay Gadepally, Siddharth Samsi, Jeremy Kepner:
AI Accelerator Survey and Trends. 1-9 - Haleh Khojasteh, Hirad Tabatabaei:
A Survey and Taxonomy of Blockchain-based Payment Channel Networks. 1-8 - Stephen L. Olivier, Nathan D. Ellingwood, Jonathan W. Berry, Daniel M. Dunlavy:
Performance Portability of an SpMV Kernel Across Scientific Computing and Data Science Applications. 1-8 - Chansup Byun, William Arcand, David Bestor, Bill Bergeron, Vijay Gadepally, Michael Houle, Matthew Hubbell, Michael Jones, Anna Klein, Peter Michaleas, Lauren Milechin, Julie Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Charles Yee, Jeremy Kepner:
Node-Based Job Scheduling for Large Scale Simulations of Short Running Jobs. 1-7 - Michael E. Franusich, Franz Franchetti:
Graph Embedding and Field Based Detection of Non-Local Webs in Large Scale Free Networks. 1-7 - Yasin Zamani, Tsung-Wei Huang:
A High-Performance Heterogeneous Critical Path Analysis Framework. 1-7 - Jungwon Kim, Seyong Lee, Beau Johnston, Jeffrey S. Vetter:
IRIS: A Portable Runtime System Exploiting Multiple Heterogeneous Programming Systems. 1-8 - Ruiwen Shan, Sheng Di, Jon C. Calhoun, Franck Cappello:
Towards Combining Error-bounded Lossy Compression and Cryptography for Scientific Data. 1-7 - Luca Piccolboni, Giuseppe Di Guglielmo, Simha Sethumadhavan, Luca P. Carloni:
HARDROID: Transparent Integration of Crypto Accelerators in Android. 1-8 - Michael Parker:
Embedded Compute Matrix Processing and FFTs using Floating Point FPGAs. 1-5 - Tong Geng, Chunshu Wu, Cheng Tan, Chenhao Xie, Anqi Guo, Pouya Haghi, Sarah Yuan He, Jiajia Li, Martin C. Herbordt, Ang Li:
A Survey: Handling Irregularities in Neural Network Acceleration with FPGAs. 1-8 - Dean G. Chester, Taylor L. Groves, Simon D. Hammond, Tim Law, Steven A. Wright, Richard P. Smedley-Stevenson, Suhaib A. Fahmy, Gihan R. Mudalidge, Stephen A. Jarvis:
StressBench: A Configurable Full System Network and I/O Benchmark Framework.
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.