Learning structured sparsity in deep neural networks
High demand for computation resources severely hinders deployment of large-scale Deep
Neural Networks (DNN) in resource constrained devices. In this work, we propose a …
Neural Networks (DNN) in resource constrained devices. In this work, we propose a …
Pipelayer: A pipelined reram-based accelerator for deep learning
Convolution neural networks (CNNs) are the heart of deep learning applications. Recent
works PRIME [1] and ISAAC [2] demonstrated the promise of using resistive random access …
works PRIME [1] and ISAAC [2] demonstrated the promise of using resistive random access …
Extractive summarization as text matching
This paper creates a paradigm shift with regard to the way we build neural extractive
summarization systems. Instead of following the commonly used framework of extracting sentences …
summarization systems. Instead of following the commonly used framework of extracting sentences …
A novel architecture of the 3D stacked MRAM L2 cache for CMPs
Magnetic random access memory (MRAM) is a promising memory technology, which has
fast read access, high density, and non-volatility. Using 3D heterogeneous integrations, it …
fast read access, high density, and non-volatility. Using 3D heterogeneous integrations, it …
A survey of accelerator architectures for deep neural networks
Recently, due to the availability of big data and the rapid growth of computing power, artificial
intelligence (AI) has regained tremendous attention and investment. Machine learning (ML…
intelligence (AI) has regained tremendous attention and investment. Machine learning (ML…
Hermes: an efficient federated learning framework for heterogeneous mobile clients
Federated learning (FL) has been a popular method to achieve distributed machine learning
among numerous devices without sharing their data to a cloud server. FL aims to learn a …
among numerous devices without sharing their data to a cloud server. FL aims to learn a …
Memristor crossbar-based neuromorphic computing system: A case study
By mimicking the highly parallel biological systems, neuromorphic hardware provides the
capability of information processing within a compact and energy-efficient platform. However, …
capability of information processing within a compact and energy-efficient platform. However, …
Terngrad: Ternary gradients to reduce communication in distributed deep learning
High network communication cost for synchronizing gradients and parameters is the well-known
bottleneck of distributed training. In this work, we propose TernGrad that uses ternary …
bottleneck of distributed training. In this work, we propose TernGrad that uses ternary …
A novel chaos-based image encryption algorithm using DNA sequence operations
An image encryption algorithm based on chaotic system and deoxyribonucleic acid (DNA)
sequence operations is proposed in this paper. First, the plain image is encoded into a DNA …
sequence operations is proposed in this paper. First, the plain image is encoded into a DNA …
Emerging non-volatile memories: Opportunities and challenges
In recent years, non-volatile memory (NVM) technologies have emerged as candidates for
future universal memory. NVMs generally have advantages such as low leakage power, high …
future universal memory. NVMs generally have advantages such as low leakage power, high …