default search action
Tong Geng
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j13]Yuxing Wang, Junhan Zhao, Hongye Xu, Cheng Han, Zhiqiang Tao, Dawei Zhou, Tong Geng, Dongfang Liu, Zhicheng Ji:
A systematic evaluation of computational methods for cell segmentation. Briefings Bioinform. 25(5) (2024) - [j12]Chunshu Wu, Chen Yang, Sahan Bandara, Tong Geng, Anqi Guo, Pouya Haghi, Ang Li, Martin C. Herbordt:
FPGA-Accelerated Range-Limited Molecular Dynamics. IEEE Trans. Computers 73(6): 1544-1558 (2024) - [c71]Chunshu Wu, Ruibing Song, Chuan Liu, Yunan Yang, Ang Li, Michael C. Huang, Tong Geng:
Extending Power of Nature from Binary to Real-Valued Graph Learning in Real World. ICLR 2024 - [c70]Cheng Han, Yawen Lu, Guohao Sun, James Chenhao Liang, Zhiwen Cao, Qifan Wang, Qiang Guan, Sohail A. Dianat, Raghuveer Rao, Tong Geng, Zhiqiang Tao, Dongfang Liu:
Prototypical Transformer As Unified Motion Learners. ICML 2024 - [c69]Pouya Haghi, Cheng Tan, Anqi Guo, Chunshu Wu, Dongfang Liu, Ang Li, Anthony Skjellum, Tong Geng, Martin C. Herbordt:
SmartFuse: Reconfigurable Smart Switches to Accelerate Fused Collectives in HPC Applications. ICS 2024: 413-425 - [c68]Hanxiao Li, Yonghong Song, Tong Geng:
Semi-supervised Crowd Counting Based on Hard Pseudo-labels. IJCNN 2024: 1-8 - [c67]Ruibing Song, Chunshu Wu, Chuan Liu, Ang Li, Michael C. Huang, Tong Geng:
DS-GL: Advancing Graph Learning via Harnessing Nature's Power within Scalable Dynamical Systems. ISCA 2024: 45-57 - [c66]Pouya Haghi, Chunshu Wu, Zahra Azad, Yanfei Li, Andrew Gui, Yuchen Hao, Ang Li, Tony Tong Geng:
Bridging the Gap Between LLMs and LNS with Dynamic Data Format and Architecture Codesign. MICRO 2024: 1617-1631 - [c65]Hao Feng, Boyuan Zhang, Fanjiang Ye, Min Si, Ching-Hsiang Chu, Jiannan Tian, Chunxing Yin, Summer Deng, Yuchen Hao, Pavan Balaji, Tong Geng, Dingwen Tao:
Accelerating Communication in Deep Learning Recommendation Model Training with Dual-Level Adaptive Lossy Compression. SC 2024: 89 - [c64]Hongwu Peng, Caiwen Ding, Tong Geng, Sutanay Choudhury, Kevin J. Barker, Ang Li:
Evaluating Emerging AI/ML Accelerators: IPU, RDU, and NVIDIA/AMD GPUs. ICPE (Companion) 2024: 14-20 - [i37]Yanfei Li, Juejing Liu, Xiaodong Zhao, Wenjun Liu, Tong Geng, Ang Li, Xin Zhang:
Accurate and Data-Efficient Micro-XRD Phase Identification Using Multi-Task Learning: Application to Hydrothermal Fluids. CoRR abs/2403.10042 (2024) - [i36]Cheng Han, Yawen Lu, Guohao Sun, James Chenhao Liang, Zhiwen Cao, Qifan Wang, Qiang Guan, Sohail A. Dianat, Raghuveer M. Rao, Tong Geng, Zhiqiang Tao, Dongfang Liu:
Prototypical Transformer as Unified Motion Learners. CoRR abs/2406.01559 (2024) - [i35]Hao Feng, Boyuan Zhang, Fanjiang Ye, Min Si, Ching-Hsiang Chu, Jiannan Tian, Chunxing Yin, Summer Deng, Yuchen Hao, Pavan Balaji, Tong Geng, Dingwen Tao:
Accelerating Communication in Deep Learning Recommendation Model Training with Dual-Level Adaptive Lossy Compression. CoRR abs/2407.04272 (2024) - [i34]Mingkai Chen, Taowen Wang, James Chenhao Liang, Chuan Liu, Chunshu Wu, Qifan Wang, Ying Nian Wu, Michael Huang, Chuang Ren, Ang Li, Tong Geng, Dongfang Liu:
Inertial Confinement Fusion Forecasting via LLMs. CoRR abs/2407.11098 (2024) - [i33]Chuan Liu, Chunshu Wu, Shihui Cao, Mingkai Chen, James Chenhao Liang, Ang Li, Michael Huang, Chuang Ren, Dongfang Liu, Ying Nian Wu, Tong Geng:
Diff-PIC: Revolutionizing Particle-In-Cell Simulation for Advancing Nuclear Fusion with Diffusion Models. CoRR abs/2408.02693 (2024) - [i32]Runjia Zeng, Cheng Han, Qifan Wang, Chunshu Wu, Tong Geng, Lifu Huang, Ying Nian Wu, Dongfang Liu:
Visual Fourier Prompt Tuning. CoRR abs/2411.01327 (2024) - 2023
- [j11]Wei Sun, Ang Li, Tong Geng, Sander Stuijk, Henk Corporaal:
Dissecting Tensor Cores via Microbenchmarks: Latency, Throughput and Numeric Behaviors. IEEE Trans. Parallel Distributed Syst. 34(1): 246-261 (2023) - [c63]Zhenyu Pan, Anshujit Sharma, Jerry Yao-Chieh Hu, Zhuo Liu, Ang Li, Han Liu, Michael C. Huang, Tong Geng:
Ising-Traffic: Using Ising Machine Learning to Predict Traffic Congestion under Uncertainty. AAAI 2023: 9354-9363 - [c62]Yawen Lu, Qifan Wang, Siqi Ma, Tong Geng, Yingjie Victor Chen, Huaijin G. Chen, Dongfang Liu:
TransFlow: Transformer as Flow Learner. CVPR 2023: 18063-18073 - [c61]Zhuo Liu, Yunan Yang, Zhenyu Pan, Anshujit Sharma, Amit Hasan, Caiwen Ding, Ang Li, Michael C. Huang, Tong Geng:
Ising-CF: A Pathbreaking Collaborative Filtering Method Through Efficient Ising Machine Learning. DAC 2023: 1-6 - [c60]Yixuan Luo, Cheng Tan, Nicolas Bohm Agostini, Ang Li, Antonino Tumeo, Nirav Dave, Tong Geng:
ML-CGRA: An Integrated Compilation Framework to Enable Efficient Machine Learning Acceleration on CGRAs. DAC 2023: 1-6 - [c59]Hongwu Peng, Shanglin Zhou, Yukui Luo, Nuo Xu, Shijin Duan, Ran Ran, Jiahui Zhao, Chenghong Wang, Tong Geng, Wujie Wen, Xiaolin Xu, Caiwen Ding:
PASNet: Polynomial Architecture Search Framework for Two-party Computation-based Secure Neural Network Deployment. DAC 2023: 1-6 - [c58]Xi Xie, Hongwu Peng, Amit Hasan, Shaoyi Huang, Jiahui Zhao, Haowen Fang, Wei Zhang, Tong Geng, Omer Khan, Caiwen Ding:
Accel-GCN: High-Performance GPU Accelerator Design for Graph Convolution Networks. ICCAD 2023: 1-9 - [c57]Anqi Guo, Yuchen Hao, Chunshu Wu, Pouya Haghi, Zhenyu Pan, Min Si, Dingwen Tao, Ang Li, Martin C. Herbordt, Tong Geng:
Software-Hardware Co-design of Heterogeneous SmartNIC System for Recommendation Models Inference and Training. ICS 2023: 336-347 - [c56]Pouya Haghi, William Krska, Cheng Tan, Tong Geng, Po-Hao Chen, Connor Greenwood, Anqi Guo, Thomas M. Hines, Chunshu Wu, Ang Li, Anthony Skjellum, Martin C. Herbordt:
FLASH: FPGA-Accelerated Smart Switches with GCN Case Study. ICS 2023: 450-462 - [c55]Uday Kumar Reddy Vengalam, Yongchao Liu, Tong Geng, Hui Wu, Michael C. Huang:
Supporting Energy-based Learning with an Ising Machine substrate: a Case Study on RBM. MICRO 2023: 465-478 - [c54]James Liang, Yiming Cui, Qifan Wang, Tong Geng, Wenguan Wang, Dongfang Liu:
ClusterFomer: Clustering As A Universal Visual Learner. NeurIPS 2023 - [c53]Hongwu Peng, Ran Ran, Yukui Luo, Jiahui Zhao, Shaoyi Huang, Kiran Thorat, Tong Geng, Chenghong Wang, Xiaolin Xu, Wujie Wen, Caiwen Ding:
LinGCN: Structural Linearized Graph Convolutional Network for Homomorphically Encrypted Inference. NeurIPS 2023 - [c52]Yuke Wang, Boyuan Feng, Zheng Wang, Tong Geng, Kevin J. Barker, Ang Li, Yufei Ding:
MGG: Accelerating Graph Neural Networks with Fine-Grained Intra-Kernel Communication-Computation Pipelining on Multi-GPU Platforms. OSDI 2023: 779-795 - [c51]Chunshu Wu, Tong Geng, Anqi Guo, Sahan Bandara, Pouya Haghi, Chuan Liu, Ang Li, Martin C. Herbordt:
FASDA: An FPGA-Aided, Scalable and Distributed Accelerator for Range-Limited Molecular Dynamics. SC 2023: 98:1-98:14 - [i31]Hongwu Peng, Shanglin Zhou, Yukui Luo, Nuo Xu, Shijin Duan, Ran Ran, Jiahui Zhao, Shaoyi Huang, Xi Xie, Chenghong Wang, Tong Geng, Wujie Wen, Xiaolin Xu, Caiwen Ding:
RRNet: Towards ReLU-Reduced Neural Network for Two-party Computation Based Private Inference. CoRR abs/2302.02292 (2023) - [i30]Xiaodong Zhao, YiXuan Luo, Juejing Liu, Wenjun Liu, Kevin M. Rosso, Xiaofeng Guo, Tong Geng, Ang Li, Xin Zhang:
Machine Learning Automated Approach for Enormous Synchrotron X-Ray Diffraction Data Interpretation. CoRR abs/2303.10881 (2023) - [i29]Uday Kumar Reddy Vengalam, Yongchao Liu, Tong Geng, Hui Wu, Michael C. Huang:
Supporting Energy-Based Learning With An Ising Machine Substrate: A Case Study on RBM. CoRR abs/2304.02525 (2023) - [i28]Yawen Lu, Qifan Wang, Siqi Ma, Tong Geng, Yingjie Victor Chen, Huaijin G. Chen, Dongfang Liu:
TransFlow: Transformer as Flow Learner. CoRR abs/2304.11523 (2023) - [i27]Hongwu Peng, Shanglin Zhou, Yukui Luo, Nuo Xu, Shijin Duan, Ran Ran, Jiahui Zhao, Chenghong Wang, Tong Geng, Wujie Wen, Xiaolin Xu, Caiwen Ding:
PASNet: Polynomial Architecture Search Framework for Two-party Computation-based Secure Neural Network Deployment. CoRR abs/2306.15513 (2023) - [i26]Xi Xie, Hongwu Peng, Amit Hasan, Shaoyi Huang, Jiahui Zhao, Haowen Fang, Wei Zhang, Tong Geng, Omer Khan, Caiwen Ding:
Accel-GCN: High-Performance GPU Accelerator Design for Graph Convolution Networks. CoRR abs/2308.11825 (2023) - [i25]James Chenhao Liang, Yiming Cui, Qifan Wang, Tong Geng, Wenguan Wang, Dongfang Liu:
ClusterFormer: Clustering As A Universal Visual Learner. CoRR abs/2309.13196 (2023) - [i24]Hongwu Peng, Ran Ran, Yukui Luo, Jiahui Zhao, Shaoyi Huang, Kiran Thorat, Tong Geng, Chenghong Wang, Xiaolin Xu, Wujie Wen, Caiwen Ding:
LinGCN: Structural Linearized Graph Convolutional Network for Homomorphically Encrypted Inference. CoRR abs/2309.14331 (2023) - [i23]Hongwu Peng, Caiwen Ding, Tong Geng, Sutanay Choudhury, Kevin J. Barker, Ang Li:
Evaluating Emerging AI/ML Accelerators: IPU, RDU, and NVIDIA/AMD GPUs. CoRR abs/2311.04417 (2023) - 2022
- [j10]Pouya Haghi, Anqi Guo, Qingqing Xiong, Chen Yang, Tong Geng, Justin T. Broaddus, Ryan J. Marshall, Derek Schafer, Anthony Skjellum, Martin C. Herbordt:
Reconfigurable switches for high performance and flexible MPI collectives. Concurr. Comput. Pract. Exp. 34(6) (2022) - [j9]Xiao Li, Shengkai Zhang, Tong Geng, Jiaxing Li, Benxin Zhu, Laixing Liu, Feng Xiao:
An improved algorithm for extracting crossovers of satellite ground tracks. Comput. Geosci. 166: 105179 (2022) - [j8]Shengkai Zhang, Tong Geng, Chaohui Zhu, Jiaxing Li, Xiao Li, Benxin Zhu, Laixing Liu, Feng Xiao:
Arctic Sea Ice Freeboard Estimation and Variations From Operation IceBridge. IEEE Trans. Geosci. Remote. Sens. 60: 1-10 (2022) - [c50]Hongwu Peng, Shaoyi Huang, Shiyang Chen, Bingbing Li, Tong Geng, Ang Li, Weiwen Jiang, Wujie Wen, Jinbo Bi, Hang Liu, Caiwen Ding:
A length adaptive algorithm-hardware co-design of transformer on FPGA through sparse attention and dynamic pipelining. DAC 2022: 1135-1140 - [c49]Anqi Guo, Tong Geng, Yongan Zhang, Pouya Haghi, Chunshu Wu, Cheng Tan, Yingyan Lin, Ang Li, Martin C. Herbordt:
FCsN: A FPGA-Centric SmartNIC Framework for Neural Networks. FCCM 2022: 1-2 - [c48]Anqi Guo, Tong Geng, Yongan Zhang, Pouya Haghi, Chunshu Wu, Cheng Tan, Yingyan Lin, Ang Li, Martin C. Herbordt:
A Framework for Neural Network Inference on FPGA-Centric SmartNICs. FPL 2022: 1-8 - [c47]Chunshu Wu, Sahan Bandara, Tong Geng, Anqi Guo, Pouya Haghi, Vipin Sachdeva, Woody Sherman, Martin C. Herbordt:
Optimized Mappings for Symmetric Range-Limited Molecular Force Calculations on FPGAs. FPL 2022: 101-108 - [c46]Chengming Zhang, Tong Geng, Anqi Guo, Jiannan Tian, Martin C. Herbordt, Ang Li, Dingwen Tao:
H-GCN: A Graph Convolutional Network Accelerator on Versal ACAP Architecture. FPL 2022: 200-208 - [c45]Cheng Tan, Nicolas Bohm Agostini, Tong Geng, Chenhao Xie, Jiajia Li, Ang Li, Kevin J. Barker, Antonino Tumeo:
DRIPS: Dynamic Rebalancing of Pipelined Streaming Applications on CGRAs. HPCA 2022: 304-316 - [c44]Haoran You, Tong Geng, Yongan Zhang, Ang Li, Yingyan Lin:
GCoD: Graph Convolutional Network Acceleration via Dedicated Algorithm and Accelerator Co-Design. HPCA 2022: 460-474 - [c43]Po-Hao Chen, Pouya Haghi, Jae Yoon Chung, Tong Geng, Richard West, Anthony Skjellum, Martin C. Herbordt:
The Viability of Using Online Prediction to Perform Extra Work while Executing BSP Applications. HPEC 2022: 1-7 - [c42]Deniz Gurevin, Mohsin Shan, Tong Geng, Weiwen Jiang, Caiwen Ding, Omer Khan:
Towards Real-Time Temporal Graph Learning. ICCD 2022: 263-271 - [c41]Hongwu Peng, Deniz Gurevin, Shaoyi Huang, Tong Geng, Weiwen Jiang, Omer Khan, Caiwen Ding:
Towards Sparsification of Graph Neural Networks. ICCD 2022: 272-279 - [c40]Yixuan Luo, Payman Behnam, Kiran Thorat, Zhuo Liu, Hongwu Peng, Shaoyi Huang, Shu Zhou, Omer Khan, Alexey Tumanov, Caiwen Ding, Tong Geng:
CoDG-ReRAM: An Algorithm-Hardware Co-design to Accelerate Semi-Structured GNNs on ReRAM. ICCD 2022: 280-289 - [c39]Zhirui Hu, Jinyang Li, Zhenyu Pan, Shanglin Zhou, Lei Yang, Caiwen Ding, Omer Khan, Tong Geng, Weiwen Jiang:
On the Design of Quantum Graph Convolutional Neural Network in the NISQ-Era and Beyond. ICCD 2022: 290-297 - [c38]Cheng Tan, Thierry Tambe, Jeff Jun Zhang, Bo Fang, Tong Geng, Gu-Yeon Wei, David Brooks, Antonino Tumeo, Ganesh Gopalakrishnan, Ang Li:
ASAP: automatic synthesis of area-efficient and precision-aware CGRAs. ICS 2022: 4:1-4:13 - [c37]Chengming Zhang, Sian Jin, Tong Geng, Jiannan Tian, Ang Li, Dingwen Tao:
CEAZ: accelerating parallel I/O via hardware-algorithm co-designed adaptive lossy compression. ICS 2022: 12:1-12:13 - [i22]Tong Geng, Chunshu Wu, Yongan Zhang, Cheng Tan, Chenhao Xie, Haoran You, Martin C. Herbordt, Yingyan Lin, Ang Li:
I-GCN: A Graph Convolutional Network Accelerator with Runtime Locality Enhancement through Islandization. CoRR abs/2203.03606 (2022) - [i21]Wei Sun, Ang Li, Tong Geng, Sander Stuijk, Henk Corporaal:
Dissecting Tensor Cores via Microbenchmarks: Latency, Throughput and Numerical Behaviors. CoRR abs/2206.02874 (2022) - [i20]Yanfei Li, Tong Geng, Samuel Alexander Stein, Ang Li, Huimin Yu:
GAAF: Searching Activation Functions for Binary Neural Networks through Genetic Algorithm. CoRR abs/2206.03291 (2022) - [i19]Yuke Wang, Boyuan Feng, Zheng Wang, Tong Geng, Ang Li, Yufei Ding:
GMI-DRL: Empowering Multi-GPU Deep Reinforcement Learning with GPU Spatial Multiplexing. CoRR abs/2206.08482 (2022) - [i18]Chengming Zhang, Tong Geng, Anqi Guo, Jiannan Tian, Martin C. Herbordt, Ang Li, Dingwen Tao:
H-GCN: A Graph Convolutional Network Accelerator on Versal ACAP Architecture. CoRR abs/2206.13734 (2022) - [i17]Hongwu Peng, Shaoyi Huang, Shiyang Chen, Bingbing Li, Tong Geng, Ang Li, Weiwen Jiang, Wujie Wen, Jinbo Bi, Hang Liu, Caiwen Ding:
A Length Adaptive Algorithm-Hardware Co-design of Transformer on FPGA Through Sparse Attention and Dynamic Pipelining. CoRR abs/2208.03646 (2022) - [i16]Hongwu Peng, Deniz Gurevin, Shaoyi Huang, Tong Geng, Weiwen Jiang, Omer Khan, Caiwen Ding:
Towards Sparsification of Graph Neural Networks. CoRR abs/2209.04766 (2022) - [i15]Yuke Wang, Boyuan Feng, Zheng Wang, Tong Geng, Kevin J. Barker, Ang Li, Yufei Ding:
Empowering GNNs with Fine-grained Communication-Computation Pipelining on Multi-GPU Platforms. CoRR abs/2209.06800 (2022) - [i14]Deniz Gurevin, Mohsin Shan, Tong Geng, Weiwen Jiang, Caiwen Ding, Omer Khan:
Towards Real-Time Temporal Graph Learning. CoRR abs/2210.04114 (2022) - 2021
- [b1]Tong Geng:
FPGA-based high-performance neural network acceleration. Boston University, USA, 2021 - [j7]Yanfei Li, Tong Geng, Ang Li, Huimin Yu:
BCNN: Binary complex neural network. Microprocess. Microsystems 87: 104359 (2021) - [j6]Shengkai Zhang, Yue Xuan, Jiaxing Li, Tong Geng, Xiao Li, Feng Xiao:
Arctic Sea Ice Freeboard Retrieval from Envisat Altimetry Data. Remote. Sens. 13(8): 1414 (2021) - [j5]Tong Geng, Shengkai Zhang, Feng Xiao, Jiaxing Li, Yue Xuan, Xiao Li, Fei Li:
DEM Generation with ICESat-2 Altimetry Data for the Three Antarctic Ice Shelves: Ross, Filchner-Ronne and Amery. Remote. Sens. 13(24): 5137 (2021) - [j4]Tong Geng, Ang Li, Tianqi Wang, Chunshu Wu, Yanfei Li, Runbin Shi, Wei Wu, Martin C. Herbordt:
O3BNN-R: An Out-of-Order Architecture for High-Performance and Regularized BNN Inference. IEEE Trans. Parallel Distributed Syst. 32(1): 199-213 (2021) - [j3]Cheng Tan, Chenhao Xie, Tong Geng, Andres Marquez, Antonino Tumeo, Kevin J. Barker, Ang Li:
ARENA: Asynchronous Reconfigurable Accelerator Ring to Enable Data-Centric Parallel Computing. IEEE Trans. Parallel Distributed Syst. 32(12): 2880-2892 (2021) - [c36]Tong Geng, Xiliang Lin, Harikesh S. Nair, Jun Hao, Bin Xiang, Shurui Fan:
Comparison Lift: Bandit-based Experimentation System for Online Advertising. AAAI 2021: 15117-15126 - [c35]Hongwu Peng, Shanglin Zhou, Scott Weitze, Jiaxin Li, Sahidul Islam, Tong Geng, Ang Li, Wei Zhang, Minghu Song, Mimi Xie, Hang Liu, Caiwen Ding:
Binary Complex Neural Network Acceleration on FPGA : (Invited Paper). ASAP 2021: 85-92 - [c34]Cheng Tan, Nicolas Bohm Agostini, Jeff Zhang, Marco Minutoli, Vito Giovanni Castellana, Chenhao Xie, Tong Geng, Ang Li, Kevin J. Barker, Antonino Tumeo:
OpenCGRA: Democratizing Coarse-Grained Reconfigurable Arrays. ASAP 2021: 149-155 - [c33]Chunshu Wu, Tong Geng, Sahan Bandara, Chen Yang, Vipin Sachdeva, Woody Sherman, Martin C. Herbordt:
Upgrade of FPGA Range-Limited Molecular Dynamics to Handle Hundreds of Processors. FCCM 2021: 142-151 - [c32]Tong Geng, Chunshu Wu, Cheng Tan, Chenhao Xie, Anqi Guo, Pouya Haghi, Sarah Yuan He, Jiajia Li, Martin C. Herbordt, Ang Li:
A Survey: Handling Irregularities in Neural Network Acceleration with FPGAs. HPEC 2021: 1-8 - [c31]Pouya Haghi, Anqi Guo, Tong Geng, Anthony Skjellum, Martin C. Herbordt:
Workload Imbalance in HPC Applications: Effect on Performance of In-Network Processing. HPEC 2021: 1-8 - [c30]Chunshu Wu, Sahan Bandara, Tong Geng, Vipin Sachdeva, Woody Sherman, Martin C. Herbordt:
System-Level Modeling of GPU/FPGA Clusters for Molecular Dynamics Simulations. HPEC 2021: 1-8 - [c29]Daniel Manu, Yi Sheng, Junhuan Yang, Jieren Deng, Tong Geng, Ang Li, Caiwen Ding, Weiwen Jiang, Lei Yang:
FL-DISCO: Federated Generative Adversarial Network for Graph-based Molecule Drug Discovery: Special Session Paper. ICCAD 2021: 1-7 - [c28]Hongwu Peng, Shiyang Chen, Zhepeng Wang, Junhuan Yang, Scott A. Weitze, Tong Geng, Ang Li, Jinbo Bi, Minghu Song, Weiwen Jiang, Hang Liu, Caiwen Ding:
Optimizing FPGA-based Accelerator Design for Large-Scale Molecular Similarity Search (Special Session Paper). ICCAD 2021: 1-7 - [c27]Yongan Zhang, Haoran You, Yonggan Fu, Tong Geng, Ang Li, Yingyan Lin:
G-CoS: GNN-Accelerator Co-Search Towards Both Better Accuracy and Efficiency. ICCAD 2021: 1-9 - [c26]Cheng Tan, Tong Geng, Chenhao Xie, Nicolas Bohm Agostini, Jiajia Li, Ang Li, Kevin J. Barker, Antonino Tumeo:
DynPaC: Coarse-Grained, Dynamic, and Partially Reconfigurable Array for Streaming Applications. ICCD 2021: 33-40 - [c25]Hongwu Peng, Shaoyi Huang, Tong Geng, Ang Li, Weiwen Jiang, Hang Liu, Shusen Wang, Caiwen Ding:
Accelerating Transformer-based Deep Learning Models on FPGAs using Column Balanced Block Pruning. ISQED 2021: 142-148 - [c24]Tong Geng, Chunshu Wu, Yongan Zhang, Cheng Tan, Chenhao Xie, Haoran You, Martin C. Herbordt, Yingyan Lin, Ang Li:
I-GCN: A Graph Convolutional Network Accelerator with Runtime Locality Enhancement through Islandization. MICRO 2021: 1051-1063 - [c23]Boyuan Feng, Yuke Wang, Tong Geng, Ang Li, Yufei Ding:
APNN-TC: accelerating arbitrary precision neural networks on ampere GPU tensor cores. SC 2021: 37 - [i13]Yanfei Li, Tong Geng, Ang Li, Huimin Yu:
BCNN: Binary Complex Neural Network. CoRR abs/2104.10044 (2021) - [i12]Boyuan Feng, Yuke Wang, Tong Geng, Ang Li, Yufei Ding:
APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU Tensor Cores. CoRR abs/2106.12169 (2021) - [i11]Chengming Zhang, Sian Jin, Tong Geng, Jiannan Tian, Ang Li, Dingwen Tao:
CEAZ: Accelerating Parallel I/O via Hardware-Algorithm Co-Design of Efficient and Adaptive Lossy Compression. CoRR abs/2106.13306 (2021) - [i10]Hongwu Peng, Shanglin Zhou, Scott Weitze, Jiaxin Li, Sahidul Islam, Tong Geng, Ang Li, Wei Zhang, Minghu Song, Mimi Xie, Hang Liu, Caiwen Ding:
Binary Complex Neural Network Acceleration on FPGA. CoRR abs/2108.04811 (2021) - [i9]Hongwu Peng, Shiyang Chen, Zhepeng Wang, Junhuan Yang, Scott A. Weitze, Tong Geng, Ang Li, Jinbo Bi, Minghu Song, Weiwen Jiang, Hang Liu, Caiwen Ding:
Optimizing FPGA-based Accelerator Design for Large-Scale Molecular Similarity Search. CoRR abs/2109.06355 (2021) - [i8]Yongan Zhang, Haoran You, Yonggan Fu, Tong Geng, Ang Li, Yingyan Lin:
G-CoS: GNN-Accelerator Co-Search Towards Both Better Accuracy and Efficiency. CoRR abs/2109.08983 (2021) - [i7]Haoran You, Tong Geng, Yongan Zhang, Ang Li, Yingyan Lin:
GCoD: Graph Convolutional Network Acceleration via Dedicated Algorithm and Accelerator Co-Design. CoRR abs/2112.11594 (2021) - 2020
- [j2]Feng Xiao, Fei Li, Shengkai Zhang, Jiaxing Li, Tong Geng, Yue Xuan:
Estimating Arctic Sea Ice Thickness with CryoSat-2 Altimetry Data Using the Least Squares Adjustment Method. Sensors 20(24): 7011 (2020) - [j1]Tianqi Wang, Tong Geng, Ang Li, Xi Jin, Martin C. Herbordt:
FPDeep: Scalable Acceleration of CNN Training on Deeply-Pipelined FPGA Clusters. IEEE Trans. Computers 69(8): 1143-1158 (2020) - [c22]Tong Geng, Xiliang Lin, Harikesh S. Nair:
Online Evaluation of Audiences for Targeted Advertising via Bandit Experiments. AAAI 2020: 13273-13279 - [c21]Pouya Haghi, Tong Geng, Anqi Guo, Tianqi Wang, Martin C. Herbordt:
FP-AMG: FPGA-Based Acceleration Framework for Algebraic Multigrid Solvers. FCCM 2020: 148-156 - [c20]Tong Geng, Chunshu Wu, Cheng Tan, Bo Fang, Ang Li, Martin C. Herbordt:
CQNN: a CGRA-based QNN Framework. HPEC 2020: 1-7 - [c19]Pouya Haghi, Anqi Guo, Qingqing Xiong, Rushi Patel, Chen Yang, Tong Geng, Justin T. Broaddus, Ryan J. Marshall, Anthony Skjellum, Martin C. Herbordt:
FPGAs in the Network and Novel Communicator Support Accelerate MPI Collectives. HPEC 2020: 1-10 - [c18]Chunshu Wu, Tong Geng, Chen Yang, Vipin Sachdeva, Woody Sherman, Martin C. Herbordt:
A Communication-Efficient Multi-Chip Design for Range-Limited Molecular Dynamics. HPEC 2020: 1-8 - [c17]Pouya Haghi, Anqi Guo, Tong Geng, Justin T. Broaddus, Derek Schafer, Anthony Skjellum, Martin C. Herbordt:
A Reconfigurable Compute-in-the-Network FPGA Assistant for High-Level Collective Support with Distributed Matrix Multiply Case Study. FPT 2020: 159-164 - [c16]Runbin Shi, Peiyan Dong, Tong Geng, Yuhao Ding, Xiaolong Ma, Hayden Kwok-Hay So, Martin C. Herbordt, Ang Li, Yanzhi Wang:
CSB-RNN: a faster-than-realtime RNN acceleration framework with compressed structured blocks. ICS 2020: 24:1-24:12 - [c15]Tong Geng, Ang Li, Runbin Shi, Chunshu Wu, Tianqi Wang, Yanfei Li, Pouya Haghi, Antonino Tumeo, Shuai Che, Steven K. Reinhardt, Martin C. Herbordt:
AWB-GCN: A Graph Convolutional Network Accelerator with Runtime Workload Rebalancing. MICRO 2020: 922-936 - [i6]Runbin Shi, Peiyan Dong, Tong Geng, Yuhao Ding, Xiaolong Ma, Hayden Kwok-Hay So, Martin C. Herbordt, Ang Li, Yanzhi Wang:
CSB-RNN: A Faster-than-Realtime RNN Acceleration Framework with Compressed Structured Blocks. CoRR abs/2005.05758 (2020) - [i5]Tong Geng, Xiliang Lin, Harikesh S. Nair, Jun Hao, Bin Xiang, Shurui Fan:
Comparison Lift: Bandit-based Experimentation System for Online Advertising. CoRR abs/2009.07899 (2020)
2010 – 2019
- 2019
- [c14]Tong Geng, Tianqi Wang, Chunshu Wu, Chen Yang, Shuaiwen Leon Song, Ang Li, Martin C. Herbordt:
LP-BNN: Ultra-low-Latency BNN Inference with Layer Parallelism. ASAP 2019: 9-16 - [c13]Tianqi Wang, Tong Geng, Xi Jin, Martin C. Herbordt:
Accelerating AP3M-Based Computational Astrophysics Simulations with Reconfigurable Clusters. ASAP 2019: 181-184 - [c12]Chen Yang, Tong Geng, Tianqi Wang, Charles Lin, Jiayi Sheng, Vipin Sachdeva, Woody Sherman, Martin C. Herbordt:
Molecular Dynamics Range-Limited Force Evaluation Optimized for FPGAs. ASAP 2019: 263-271 - [c11]Tianqi Wang, Tong Geng, Xi Jin, Martin C. Herbordt:
FP-AMR: A Reconfigurable Fabric Framework for Adaptive Mesh Refinement Applications. FCCM 2019: 245-253 - [c10]Qingqing Xiong, Rushi Patel, Chen Yang, Tong Geng, Anthony Skjellum, Martin C. Herbordt:
GhostSZ: A Transparent FPGA-Accelerated Lossy Compression Framework. FCCM 2019: 258-266 - [c9]Tong Geng, Tianqi Wang, Chunshu Wu, Chen Yang, Wei Wu, Ang Li, Martin C. Herbordt:
O3BNN: an out-of-order architecture for high-performance binarized neural network inference with fine-grained pruning. ICS 2019: 461-472 - [c8]Ang Li, Tong Geng, Tianqi Wang, Martin C. Herbordt, Shuaiwen Leon Song, Kevin J. Barker:
BSTC: a novel binarized-soft-tensor-core design for accelerating bit-based approximated neural nets. SC 2019: 38:1-38:30 - [c7]Chen Yang, Tong Geng, Tianqi Wang, Rushi Patel, Qingqing Xiong, Ahmed Sanaullah, Chunshu Wu, Jiayi Sheng, Charles Lin, Vipin Sachdeva, Woody Sherman, Martin C. Herbordt:
Fully integrated FPGA molecular dynamics simulations. SC 2019: 67:1-67:31 - [i4]Tong Geng, Tianqi Wang, Ang Li, Xi Jin, Martin C. Herbordt:
A Scalable Framework for Acceleration of CNN Training on Deeply-Pipelined FPGA Clusters with Weight and Workload Balancing. CoRR abs/1901.01007 (2019) - [i3]Chen Yang, Tong Geng, Tianqi Wang, Rushi Patel, Qingqing Xiong, Ahmed Sanaullah, Jiayi Sheng, Charles Lin, Vipin Sachdeva, Woody Sherman, Martin C. Herbordt:
Fully Integrated On-FPGA Molecular Dynamics Simulations. CoRR abs/1905.05359 (2019) - [i2]Tong Geng, Xiliang Lin, Harikesh S. Nair:
Online Evaluation of Audiences for Targeted Advertising via Bandit Experiments. CoRR abs/1907.02178 (2019) - [i1]Tong Geng, Ang Li, Tianqi Wang, Chunshu Wu, Yanfei Li, Antonino Tumeo, Martin C. Herbordt:
UWB-GCN: Hardware Acceleration of Graph-Convolution-Network through Runtime Workload Rebalancing. CoRR abs/1908.10834 (2019) - 2018
- [c6]Tong Geng, Tianqi Wang, Ahmed Sanaullah, Chen Yang, Rui Xu, Rushi Patel, Martin C. Herbordt:
FPDeep: Acceleration and Load Balancing of CNN Training on FPGA Clusters. FCCM 2018: 81-84 - [c5]Tong Geng, Tianqi Wang, Ahmed Sanaullah, Chen Yang, Rushi Patel, Martin C. Herbordt:
A Framework for Acceleration of CNN Training on Deeply-Pipelined FPGA Clusters with Work and Weight Load Balancing. FPL 2018: 394-398 - [c4]Tong Geng, Erkan Diken, Tianqi Wang, Lech Józwiak, Martin C. Herbordt:
An Access-Pattern-Aware On-Chip Vector Memory System with Automatic Loading for SIMD Architectures. HPEC 2018: 1-7 - [c3]Zikun Xiang, Tianqi Wang, Tong Geng, Tian Xiang, Xi Jin, Martin C. Herbordt:
Soft-Core. Multiple-Lane, FPGA-based ADCs for a Liquid Helium Environment. HPEC 2018: 1-6 - 2016
- [c2]Tong Geng, Luc Waeijen, Maurice Peemen, Henk Corporaal, Yifan He:
MacSim: A MAC-Enabled High-Performance Low-Power SIMD Architecture. DSD 2016: 160-167 - [c1]Yifan He, Maurice Peemen, Luc Waeijen, Erkan Diken, Mattia Fiumara, Gerard K. Rauwerda, Henk Corporaal, Tong Geng:
A configurable SIMD architecture with explicit datapath for intelligent learning. SAMOS 2016: 156-163
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-13 01:04 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint