Gebruikersprofielen voor Xuehai Qian
Xuehai QianTsinghua University Geverifieerd e-mailadres voor tsinghua.edu.cn Geciteerd door 7790 |
Capuchin: Tensor-based gpu memory management for deep learning
In recent years, deep learning has gained unprecedented success in various domains, the
key of the success is the larger and deeper deep neural networks (DNNs) that achieved very …
key of the success is the larger and deeper deep neural networks (DNNs) that achieved very …
Pipelayer: A pipelined reram-based accelerator for deep learning
Convolution neural networks (CNNs) are the heart of deep learning applications. Recent
works PRIME [1] and ISAAC [2] demonstrated the promise of using resistive random access …
works PRIME [1] and ISAAC [2] demonstrated the promise of using resistive random access …
GraphP: Reducing communication for PIM-based graph processing with efficient data partition
…, Y Wu, K Chen, C Kozyrakis, X Qian - … Symposium on High …, 2018 - ieeexplore.ieee.org
Processing-In-Memory (PIM) is an effective technique that reduces data movements by
integrating processing units within memory. The recent advance of “big data” and 3D stacking …
integrating processing units within memory. The recent advance of “big data” and 3D stacking …
GraphR: Accelerating graph processing using ReRAM
Graph processing recently received intensive interests in light of a wide range of needs to
understand relationships. It is well-known for the poor locality and high memory bandwidth …
understand relationships. It is well-known for the poor locality and high memory bandwidth …
Sc-dcnn: Highly-scalable deep convolutional neural network using stochastic computing
With the recent advance of wearable devices and Internet of Things (IoTs), it becomes
attractive to implement the Deep Convolutional Neural Networks (DCNNs) in embedded and …
attractive to implement the Deep Convolutional Neural Networks (DCNNs) in embedded and …
CirCNN: accelerating and compressing deep neural networks using block-circulant weight matrices
Large-scale deep neural networks (DNNs) are both compute and memory intensive. As the
size of DNNs continues to grow, it is critical to improve the energy efficiency and performance …
size of DNNs continues to grow, it is critical to improve the energy efficiency and performance …
Patdnn: Achieving real-time dnn execution on mobile devices with pattern-based weight pruning
With the emergence of a spectrum of high-end mobile devices, many applications that
formerly required desktop-level computation capability are being transferred to these devices. …
formerly required desktop-level computation capability are being transferred to these devices. …
DudeTM: Building durable transactions with decoupling for persistent memory
Emerging non-volatile memory (NVM) offers non-volatility, byte-addressability and fast
access at the same time. To make the best use of these properties, it has been shown by …
access at the same time. To make the best use of these properties, it has been shown by …
Admm-nn: An algorithm-hardware co-design framework of dnns using alternating direction methods of multipliers
Model compression is an important technique to facilitate efficient embedded and hardware
implementations of deep neural networks (DNNs), a number of prior works are dedicated to …
implementations of deep neural networks (DNNs), a number of prior works are dedicated to …
Graphq: Scalable pim-based graph processing
Processing-In-Memory (PIM) architectures based on recent technology advances (eg,
Hybrid Memory Cube) demonstrate great potential for graph processing. However, existing …
Hybrid Memory Cube) demonstrate great potential for graph processing. However, existing …