default search action
ACM Transactions on Architecture and Code Optimization, Volume 19
Volume 19, Number 1, March 2022
- Aditya Ukarande, Suryakant Patidar, Ram Rangan:
Locality-Aware CTA Scheduling for Gaming Applications. 1:1-1:26 - Hongzhi Liu, Jie Luo, Ying Li, Zhonghai Wu:
Iterative Compilation Optimization Based on Metric Learning and Collaborative Filtering. 2:1-2:25 - Muhammad Aditya Sasongko, Milind Chabbi, Mandana Bagheri-Marzijarani, Didem Unat:
ReuseTracker: Fast Yet Accurate Multicore Reuse Distance Analyzer. 3:1-3:25 - Yaosheng Fu, Evgeny Bolotin, Niladrish Chatterjee, David W. Nellans, Stephen W. Keckler:
GPU Domain Specialization via Composable On-Package Architecture. 4:1-4:23 - Daeyeal Lee, Bill Lin, Chung-Kuan Cheng:
SMT-Based Contention-Free Task Mapping and Scheduling on 2D/3D SMART NoC with Mixed Dimension-Order Routing. 5:1-5:21 - Prasanth Chatarasi, Hyoukjun Kwon, Angshuman Parashar, Michael Pellauer, Tushar Krishna, Vivek Sarkar:
Marvel: A Data-Centric Approach for Mapping Deep Learning Operators on Spatial Accelerators. 6:1-6:26 - Dennis Rieber, Axel Acosta, Holger Fröning:
Joint Program and Layout Transformations to Enable Convolutional Operators on Specialized Hardware Based on Constraint Programming. 7:1-7:26 - Mengya Lei, Fan Li, Fang Wang, Dan Feng, Xiaomin Zou, Renzhi Xiao:
SecNVM: An Efficient and Write-Friendly Metadata Crash Consistency Scheme for Secure NVM. 8:1-8:26 - Bang Di, Daokun Hu, Zhen Xie, Jianhua Sun, Hao Chen, Jinkui Ren, Dong Li:
TLB-pilot: Mitigating TLB Contention Attack on GPUs with Microarchitecture-Aware Scheduling. 9:1-9:23 - Gururaj Saileshwar, Rick Boivie, Tong Chen, Benjamin Segal, Alper Buyuktosunoglu:
HeapCheck: Low-cost Hardware Support for Memory Safety. 10:1-10:24 - Muhammad Waqar Azhar, Miquel Pericàs, Per Stenström:
Task-RM: A Resource Manager for Energy Reduction in Task-Parallel Applications under Quality of Service Constraints. 11:1-11:26 - Cesar Gomes, Maziar Amiraski, Mark Hempstead:
CASHT: Contention Analysis in Shared Hierarchies with Thefts. 12:1-12:27 - Yufei Wang, Xiaoshe Dong, Longxiang Wang, Weiduo Chen, Xingjun Zhang:
Optimizing Small-Sample Disk Fault Detection Based on LSTM-GAN Model. 13:1-13:24 - Franyell Silfa, José-María Arnau, Antonio González:
E-BATCH: Energy-Efficient and High-Throughput RNN Batching. 14:1-14:23 - Chen Ding, Dong Chen, Fangzhou Liu, Benjamin Reber, Wesley Smith:
CARL: Compiler Assigned Reference Leasing. 15:1-15:28
Volume 19, Number 2, June 2022
- Christof Schlaak, Tzung-Han Juang, Christophe Dubach:
Memory-Aware Functional IR for Higher-Level Synthesis of Accelerators. 16:1-16:26 - Kartik Lakshminarasimhan, Ajeya Naithani, Josué Feliu, Lieven Eeckhout:
The Forward Slice Core: A High-Performance, Yet Low-Complexity Microarchitecture. 17:1-17:25 - Sharanyan Srikanthan, Sayak Chakraborti, Princeton Ferro, Sandhya Dwarkadas:
MAPPER: Managing Application Performance via Parallel Efficiency Regulation*. 18:1-18:26 - Athanasios Tziouvaras, Georgios Dimitriou, Georgios I. Stamoulis:
Low-power Near-data Instruction Execution Leveraging Opcode-based Timing Analysis. 19:1-19:26 - Xingguo Jia, Jin Zhang, Boshi Yu, Xingyue Qian, Zhengwei Qi, Haibing Guan:
GiantVM: A Novel Distributed Hypervisor for Resource Aggregation with DSM-aware Optimizations. 20:1-20:27 - Mehrzad Nejat, Madhavan Manivannan, Miquel Pericàs, Per Stenström:
Cooperative Slack Management: Saving Energy of Multicore Processors by Trading Performance Slack Between QoS-Constrained Applications. 21:1-21:27 - Hugo Pompougnac, Ulysse Beaugnon, Albert Cohen, Dumitru Potop-Butucaru:
Weaving Synchronous Reactions into the Fabric of SSA-form Compilers. 22:1-22:25 - Ghassan Shobaki, Vahl Scott Gordon, Paul McHugh, Theodore Dubois, Austin Kerbow:
Register-Pressure-Aware Instruction Scheduling Using Ant Colony Optimization. 23:1-23:23 - Qihan Wang, Zhen Peng, Bin Ren, Jie Chen, Robert G. Edwards:
MemHC: An Optimized GPU Memory Management Framework for Accelerating Many-body Correlation. 24:1-24:26 - Rakesh Kumar, Mehdi Alipour, David Black-Schaffer:
Dependence-aware Slice Execution to Boost MLP in Slice-out-of-order Cores. 25:1-25:28 - Nandita Vijaykumar, Ataberk Olgun, Konstantinos Kanellopoulos, F. Nisa Bostanci, Hasan Hassan, Mehrshad Lotfi, Phillip B. Gibbons, Onur Mutlu:
MetaSys: A Practical Open-source Metadata Management System to Implement and Evaluate Cross-layer Optimizations. 26:1-26:29 - Jing Chen, Madhavan Manivannan, Mustafa Abduljabbar, Miquel Pericàs:
ERASE: Energy Efficient Task Mapping and Resource Management for Work Stealing Runtimes. 27:1-27:29 - Chencheng Ye, Yuanchao Xu, Xipeng Shen, Hai Jin, Xiaofei Liao, Yan Solihin:
Preserving Addressability Upon GC-Triggered Data Movements on Non-Volatile Memory. 28:1-28:26 - George Michelogiannakis, Benjamin Klenk, Brandon Cook, Min Yee Teh, Madeleine Glick, Larry Dennison, Keren Bergman, John Shalf:
A Case For Intra-rack Resource Disaggregation in HPC. 29:1-29:26
Volume 19, Number 3, September 2022
- Ping Wang, Fei Wen, Paul V. Gratz, Alex Sprintson:
SIMD-Matcher: A SIMD-based Arbitrary Matching Framework. 30:1-30:20 - Marcel Mettler, Martin Rapp, Heba Khdr, Daniel Mueller-Gritschneder, Jörg Henkel, Ulf Schlichtmann:
An FPGA-based Approach to Evaluate Thermal and Resource Management Strategies of Many-core Processors. 31:1-31:24 - Paschalis Mpeis, Pavlos Petoumenos, Kim M. Hazelwood, Hugh Leather:
Object Intersection Captures on Interactive Apps to Drive a Crowd-sourced Replay-based Compiler Optimization. 32:1-32:25 - Cunlu Li, Dezun Dong, Xiangke Liao:
MUA-Router: Maximizing the Utility-of-Allocation for On-chip Pipelining Routers. 33:1-33:23 - Ziaul Choudhury, Shashwat Shrivastava, Lavanya Ramapantulu, Suresh Purini:
An FPGA Overlay for CNN Inference with Fine-grained Flexible Parallelism. 34:1-34:26 - Diksha Moolchandani, Anshul Kumar, Smruti R. Sarangi:
Performance and Power Prediction for Concurrent Execution on GPUs. 35:1-35:27 - Ali Jahanshahi, Nanpeng Yu, Daniel Wong:
PowerMorph: QoS-Aware Server Power Reshaping for Data Center Regulation Service. 36:1-36:27 - Peng Xu, Nannan Zhao, Jiguang Wan, Wei Liu, Shuning Chen, Yuanhui Zhou, Hadeel Albahar, Hanyang Liu, Liu Tang, Zhi-hu Tan:
Building a Fast and Efficient LSM-tree Store by Integrating Local Storage with Cloud Storage. 37:1-37:26 - Horng-Ruey Huang, Ding-Yong Hong, Jan-Jan Wu, Kung-Fu Chen, Pangfeng Liu, Wei-Chung Hsu:
Accelerating Video Captioning on Heterogeneous System Architectures. 38:1-38:25 - Shivam Kundan, Theodoros Marinakis, Iraklis Anagnostopoulos, Dimitri Kagaris:
A Pressure-Aware Policy for Contention Minimization on Multicore Systems. 40:1-40:26 - Johnathan Alsop, Weon Taek Na, Matthew D. Sinclair, Samuel Grayson, Sarita V. Adve:
A Case for Fine-grain Coherence Specialization in Heterogeneous Systems. 41:1-41:26 - Mohammadreza Soltaniyeh, Richard P. Martin, Santosh Nagarakatte:
An Accelerator for Sparse Convolutional Neural Networks Leveraging Systolic General Matrix-matrix Multiplication. 42:1-42:26 - Dharanidhar Dang, Bill Lin, Debashis Sahoo:
LiteCON: An All-photonic Neuromorphic Accelerator for Energy-efficient Deep Learning. 43:1-43:22 - Lokesh Siddhu, Rajesh Kedia, Shailja Pandey, Martin Rapp, Anuj Pathania, Jörg Henkel, Preeti Ranjan Panda:
CoMeT: An Integrated Interval Thermal Simulation Toolchain for 2D, 2.5D, and 3D Processor-Memory Systems. 44:1-44:25 - Matthew Benjamin Olson, Brandon Kammerdiener, Michael R. Jantz, Kshitij A. Doshi, Terry R. Jones:
Online Application Guidance for Heterogeneous Memory Systems. 45:1-45:27 - Bruno Chinelato Honorio, João P. L. de Carvalho, Catalina Munoz Morales, Alexandro Baldassin, Guido Araujo:
Using Barrier Elision to Improve Transactional Code Generation. 46:1-46:23
Volume 19, Number 4, December 2022
- Jiansong Li, Xueying Wang, Xiaobing Chen, Guangli Li, Xiao Dong, Peng Zhao, Xianzhi Yu, Yongxin Yang, Wei Cao, Lei Liu, Xiaobing Feng:
An Application-oblivious Memory Scheduling System for DNN Accelerators. 47:1-47:26 - Aditya Narayan, Yvain Thonnart, Pascal Vivet, Ayse K. Coskun, Ajay Joshi:
Architecting Optically Controlled Phase Change Memory. 48:1-48:26 - Chao Zhang, Maximilian H. Bremer, Cy P. Chan, John Shalf, Xiaochen Guo:
ASA: Accelerating Sparse Accumulation in Column-wise SpGEMM. 49:1-49:24 - Aart J. C. Bik, Penporn Koanantakool, Tatiana Shpeisman, Nicolas Vasilache, Bixia Zheng, Fredrik Kjolstad:
Compiler Support for Sparse Tensor Computations in MLIR. 50:1-50:25 - Pierre Michaud, Anis Peysieux:
HAIR: Halving the Area of the Integer Register File with Odd/Even Banking. 51:1-51:25 - Amirreza Yousefzadeh, Jan Stuijt, Martijn Hijdra, Hsiao-Hsuan Liu, Anteneh Gebregiorgis, Abhairaj Singh, Said Hamdioui, Francky Catthoor:
Energy-efficient In-Memory Address Calculation. 52:1-52:16 - Hwisoo So, Moslem Didehban, Yohan Ko, Aviral Shrivastava, Kyoungwoo Lee:
EXPERTISE: An Effective Software-level Redundant Multithreading Scheme against Hardware Faults. 53:1-53:26 - Tim Hartley, Foivos S. Zakkak, Andy Nisbet, Christos Kotselidis, Mikel Luján:
Just-In-Time Compilation on ARM - A Closer Look at Call-Site Code Consistency. 54:1-54:23 - Erling Rennemo Jellum, Milica Orlandic, Edmund Brekke, Tor Arne Johansen, Torleiv H. Bryne:
Solving Sparse Assignment Problems on FPGAs. 55:1-55:20 - Yuhao Li, Benjamin C. Lee:
Phronesis: Efficient Performance Modeling for High-dimensional Configuration Tuning. 56:1-56:26 - Chandrahas Tirumalasetty, Chih-Chieh Chou, A. L. Narasimha Reddy, Paul Gratz, Ayman Abouelwafa:
Reducing Minor Page Fault Overheads through Enhanced Page Walker. 57:1-57:26 - Lan Gao, Jing Wang, Weigong Zhang:
Adaptive Contention Management for Fine-Grained Synchronization on Commodity GPUs. 58:1-58:21 - Ruobing Han, Jaewon Lee, Jaewoong Sim, Hyesoon Kim:
COX : Exposing CUDA Warp-level Functions to CPUs. 59:1-59:25 - Yiding Liu, Xingyao Zhang, Donglin Zhuang, Xin Fu, Shuaiwen Song:
DynamAP: Architectural Support for Dynamic Graph Traversal on the Automata Processor. 60:1-60:26 - Changwei Zou, Yaoqing Gao, Jingling Xue:
Practical Software-Based Shadow Stacks on x86-64. 61:1-61:26
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.