default search action
Cong Guo 0003
Person information
- unicode name: 郭聪
- affiliation: Shanghai Jiao Tong University, Department of Computer Science and Engineering, China
Other persons with the same name
- Cong Guo — disambiguation page
- Cong Guo 0001 — Beijing Institute of Technology, School of Computer Science and Technology, Beijing Engineering Research Center of Massive Language Information Processing and Cloud Computing Application, China
- Cong Guo 0002
— University of Science and Technology of China, CAS Key Laboratory of Technology in Geo-Spatial Information Processing and Application Systems, Hefei, China
- Cong Guo 0004 — Peking University, School of EECS, MoE Key Laboratory of Machine Perception, China
- Cong Guo 0005 — University of Waterloo, Cheriton School of Computer Science, ON, Canada
- Cong Guo 0006 — Tsinghua University, Department of Electronic Engineering, Beijing, China
- Cong Guo 0007
— Chongqing University, State Key Laboratory of Mechanical Transmissions, School of Automotive Engineering, China
- Cong Guo 0008
— Southwest University, College of Artificial Intelligence, Chongqing, China
- Cong Guo 0009 — Guilin University of Electronic Technology, School of Mathematics and Computing Science, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2026
- [j4]Yangjie Zhou
, Zhihui Zhang
, Shuwen Lu, Cong Guo
, Jingwen Leng
, Feng Zhang, Yufei Ma, Yun Liang
, Minyi Guo
:
A Full-Stack Framework for GNN Acceleration via Partition-Compiler-Architecture Co-Design. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 45(5): 2348-2361 (2026) - [c24]Haoxuan Shan, Cong Guo, Chiyue Wei, Feng Cheng, Junyao Zhang, Hai Helen Li, Yiran Chen:
Platinum: Path-Adaptable LUT-Based Accelerator Tailored for Low-Bit Weight Matrix Multiplication. ASP-DAC 2026: 1449-1455 - [c23]Weiming Hu
, Zihan Zhang
, Haoyan Zhang
, Chen Zhang
, Cong Guo
, Yu Feng
, Tianchi Hu
, Guanglin Li
, Guipeng Hu
, Junsong Wang
, Jingwen Leng
:
M2XFP: A Metadata-Augmented Microscaling Data Format for Efficient Low-bit Quantization. ASPLOS (2) 2026: 1151-1167 - [c22]Yuzhe Fu, Changchun Zhou, Hancheng Ye, Bowen Duan
, Qiyu Huang, Chiyue Wei, Cong Guo, Hai Helen Li, Yiran Chen:
FractalCloud: A Fractal-Inspired Architecture for Efficient Large-Scale Point Cloud Processing. HPCA 2026: 1-15 - [c21]Chiyue Wei, Cong Guo, Junyao Zhang, Haoxuan Shan, Yifan Xu
, Ziyue Zhang
, Yudong Liu, Qinsi Wang, Changchun Zhou, Hai Helen Li, Yiran Chen:
Focus: A Streaming Concentration Architecture for Efficient Vision-Language Models. HPCA 2026: 1-18 - [i33]Weiming Hu, Zihan Zhang, Haoyan Zhang, Chen Zhang, Cong Guo, Yu Feng, Tianchi Hu, Guanglin Li, Guipeng Hu, Junsong Wang, Jingwen Leng:
M2XFP: A Metadata-Augmented Microscaling Data Format for Efficient Low-bit Quantization. CoRR abs/2601.19213 (2026) - 2025
- [j3]Yiran Chen
, Cong Guo
, Yintao He
, Mingyuan Ma, Tergel Molom-Ochir
, Nicky Ramos, Haoxuan Shan, Chiyue Wei
, Hai Li
:
Circuits to Systems: Codesigning Efficient AI Hardware. IEEE Des. Test 42(6): 54-62 (2025) - [j2]Chen Zhang
, Yang Wang
, Zhiqiang Xie
, Cong Guo
, Yunxin Liu
, Jingwen Leng
, Zhigang Ji
, Yuan Xie
, Ru Huang:
DSTC: Dual-Side Sparse Tensor Core for DNNs Acceleration on Modern GPU Architectures. IEEE Trans. Computers 74(2): 341-355 (2025) - [c20]Chiyue Wei, Cong Guo, Feng Cheng, Shiyu Li, Hao (Frank) Yang, Hai Helen Li, Yiran Chen:
Prosperity: Accelerating Spiking Neural Networks via Product Sparsity. HPCA 2025: 806-820 - [c19]Weiming Hu, Haoyan Zhang
, Cong Guo, Yu Feng, Renyang Guan, Zhendong Hua, Zihan Liu, Yue Guan, Minyi Guo, Jingwen Leng:
M-ANT: Efficient Low-bit Group Quantization for LLMs via Mathematically Adaptive Numerical Type. HPCA 2025: 1112-1126 - [c18]Zihan Liu
, Xinhao Luo, Junxian Guo, Wentao Ni, Yangjie Zhou, Yue Guan, Cong Guo, Weihao Cui, Yu Feng, Minyi Guo, Yuhao Zhu, Minjia Zhang, Chen Jin, Jingwen Leng:
VQ-LLM: High-performance Code Generation for Vector Quantization Augmented LLM Inference. HPCA 2025: 1496-1509 - [c17]Feng Cheng
, Cong Guo
, Chiyue Wei
, Junyao Zhang
, Changchun Zhou
, Edward Hanson
, Jiaqi Zhang
, Xiaoxiao Liu
, Hai Li
, Yiran Chen
:
Ecco: Improving Memory Bandwidth and Capacity for LLMs via Entropy-Aware Cache Compression. ISCA 2025: 793-807 - [c16]Chiyue Wei
, Bowen Duan
, Cong Guo
, Jingyang Zhang
, Qingyue Song
, Hai Li
, Yiran Chen
:
Phi: Leveraging Pattern-based Hierarchical Sparsity for High-Efficiency Spiking Neural Networks. ISCA 2025: 930-943 - [c15]Cong Guo
, Chiyue Wei
, Jiaming Tang
, Bowen Duan
, Song Han
, Hai Li
, Yiran Chen
:
Transitive Array: An Efficient GEMM Accelerator with Result Reuse. ISCA 2025: 990-1004 - [c14]Yangjie Zhou
, Honglin Zhu
, Qian Qiu
, Weihao Cui
, Zihan Liu
, Peng Chen
, Mohamed Wahib
, Cong Guo
, Siyuan Feng
, Jintao Meng
, Haidong Lan
, Jingwen Leng
, Yun Lin
, Jin Song Dong
, Wenxi Zhu
, Minwen Deng
:
A Sample-Free Compilation Framework for Efficient Dynamic Tensor Computation. SC 2025: 167-184 - [i32]Mark Horton, Tergel Molom-Ochir, Peter Liu, Bhavna Gopal, Chiyue Wei, Cong Guo, Brady Taylor, Deliang Fan, Shan X. Wang, Hai Li, Yiran Chen:
Hamming Attention Distillation: Binarizing Keys and Queries for Efficient Long-Context Transformers. CoRR abs/2502.01770 (2025) - [i31]Weiming Hu, Haoyan Zhang, Cong Guo, Yu Feng, Renyang Guan, Zhendong Hua, Zihan Liu, Yue Guan, Minyi Guo, Jingwen Leng:
M-ANT: Efficient Low-bit Group Quantization for LLMs via Mathematically Adaptive Numerical Type. CoRR abs/2502.18755 (2025) - [i30]Zihan Liu, Xinhao Luo, Junxian Guo, Wentao Ni, Yangjie Zhou, Yue Guan, Cong Guo, Weihao Cui, Yu Feng, Minyi Guo, Yuhao Zhu, Minjia Zhang, Jingwen Leng, Chen Jin:
VQ-LLM: High-performance Code Generation for Vector Quantization Augmented LLM Inference. CoRR abs/2503.02236 (2025) - [i29]Chiyue Wei, Cong Guo, Feng Cheng, Shiyu Li, Hao (Frank) Yang, Hai Helen Li, Yiran Chen:
Prosperity: Accelerating Spiking Neural Networks via Product Sparsity. CoRR abs/2503.03379 (2025) - [i28]Cong Guo, Chiyue Wei, Jiaming Tang, Bowen Duan, Song Han, Hai Li, Yiran Chen:
Transitive Array: An Efficient GEMM Accelerator with Result Reuse. CoRR abs/2504.16339 (2025) - [i27]Feng Cheng, Cong Guo, Chiyue Wei, Junyao Zhang, Changchun Zhou, Edward Hanson, Jiaqi Zhang, Xiaoxiao Liu, Hai Helen Li, Yiran Chen:
Ecco: Improving Memory Bandwidth and Capacity for LLMs via Entropy-aware Cache Compression. CoRR abs/2505.06901 (2025) - [i26]Chiyue Wei, Bowen Duan, Cong Guo, Jingyang Zhang, Qingyue Song, Hai Helen Li, Yiran Chen:
Phi: Leveraging Pattern-based Hierarchical Sparsity for High-Efficiency Spiking Neural Networks. CoRR abs/2505.10909 (2025) - [i25]Jiale Xu, Rui Zhang, Yi Xiong, Cong Guo, Zihan Liu, Yangjie Zhou, Weiming Hu, Hao Wu, Changxu Shao, Ziqing Wang, Yongjie Yuan, Junping Zhao, Minyi Guo, Jingwen Leng:
eLLM: Elastic Memory Management Framework for Efficient LLM Serving. CoRR abs/2506.15155 (2025) - [i24]Linshen Liu, Boyan Su, Junyue Jiang, Guanlin Wu, Cong Guo, Ceyu Xu, Hao (Frank) Yang
:
Towards Accurate and Efficient 3D Object Detection for Autonomous Driving: A Mixture of Experts Computing System on Edge. CoRR abs/2507.04123 (2025) - [i23]Xinhua Chen, Sitao Huang, Cong Guo, Chiyue Wei, Yintao He, Jianyi Zhang, Hai Helen Li, Yiran Chen:
DPad: Efficient Diffusion Language Models with Suffix Dropout. CoRR abs/2508.14148 (2025) - [i22]Yuzhe Fu, Changchun Zhou, Hancheng Ye, Bowen Duan, Qiyu Huang, Chiyue Wei, Cong Guo, Hai Helen Li, Yiran Chen:
FractalCloud: A Fractal-Inspired Architecture for Efficient Large-Scale Point Cloud Processing. CoRR abs/2511.07665 (2025) - [i21]Tergel Molom-Ochir, Benjamin F. Morris III, Mark Horton, Chiyue Wei, Cong Guo, Brady Taylor, Peter Liu, Shan X. Wang, Deliang Fan, Hai Helen Li, Yiran Chen:
CAMformer: Associative Memory is All You Need. CoRR abs/2511.19740 (2025) - [i20]Haoxuan Shan, Cong Guo, Chiyue Wei, Feng Cheng, Junyao Zhang, Hai Li, Yiran Chen:
Platinum: Path-Adaptable LUT-Based Accelerator Tailored for Low-Bit Weight Matrix Multiplication. CoRR abs/2511.21910 (2025) - [i19]Chiyue Wei, Cong Guo, Junyao Zhang, Haoxuan Shan, Yifan Xu, Ziyue Zhang, Yudong Liu, Qinsi Wang, Changchun Zhou, Hai (Helen) Li, Yiran Chen:
Focus: A Streaming Concentration Architecture for Efficient Vision-Language Models. CoRR abs/2512.14661 (2025) - 2024
- [j1]Cong Guo
, Fengchen Xue
, Jingwen Leng
, Yuxian Qiu
, Yue Guan
, Weihao Cui
, Quan Chen
, Minyi Guo
:
Accelerating Sparse DNNs Based on Tiled GEMM. IEEE Trans. Computers 73(5): 1275-1289 (2024) - [c13]Cong Guo
, Rui Zhang
, Jiale Xu
, Jingwen Leng
, Zihan Liu
, Ziyu Huang
, Minyi Guo
, Hao Wu
, Shouren Zhao
, Junping Zhao
, Ke Zhang
:
GMLake: Efficient and Transparent GPU Memory Defragmentation for Large-scale DNN Training with Virtual Memory Stitching. ASPLOS (2) 2024: 450-466 - [c12]Zihan Liu
, Wentao Ni
, Jingwen Leng
, Yu Feng
, Cong Guo
, Quan Chen
, Chao Li
, Minyi Guo
, Yuhao Zhu
:
JUNO: Optimizing High-Dimensional Approximate Nearest Neighbour Search with Sparsity-Aware Algorithm and Ray-Tracing Core Mapping. ASPLOS (2) 2024: 549-565 - [i18]Cong Guo
, Rui Zhang, Jiale Xu, Jingwen Leng, Zihan Liu, Ziyu Huang, Minyi Guo, Hao Wu, Shouren Zhao, Junping Zhao, Ke Zhang:
GMLake: Efficient and Transparent GPU Memory Defragmentation for Large-scale DNN Training with Virtual Memory Stitching. CoRR abs/2401.08156 (2024) - [i17]Cong Guo
, Fengchen Xue, Jingwen Leng, Yuxian Qiu, Yue Guan, Weihao Cui, Quan Chen, Minyi Guo:
Accelerating Sparse DNNs Based on Tiled GEMM. CoRR abs/2402.10876 (2024) - [i16]Jiale Xu, Rui Zhang, Cong Guo
, Weiming Hu, Zihan Liu, Feiyang Wu, Yu Feng, Shixuan Sun, Changxu Shao, Yuhong Guo, Junping Zhao, Ke Zhang, Minyi Guo, Jingwen Leng:
vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving. CoRR abs/2407.15309 (2024) - [i15]Yangjie Zhou, Honglin Zhu, Qian Qiu, Weihao Cui, Zihan Liu, Cong Guo
, Siyuan Feng, Jintao Meng, Haidong Lan, Jingwen Leng, Wenxi Zhu, Minwen Deng:
Vortex: Efficient Sample-Free Dynamic Tensor Program Optimization via Hardware-aware Strategy Space Hierarchization. CoRR abs/2409.01075 (2024) - [i14]Cong Guo
, Feng Cheng, Zhixu Du, James Kiessling, Jonathan Ku, Shiyu Li, Ziru Li, Mingyuan Ma, Tergel Molom-Ochir, Benjamin Morris, Haoxuan Shan, Jingwei Sun
, Yitu Wang, Chiyue Wei, Xueying Wu, Yuhao Wu, Hao (Frank) Yang, Jingyang Zhang, Junyao Zhang, Qilin Zheng, Guanglei Zhou, Hai Li, Yiran Chen:
A Survey: Collaborative Hardware and Software Design in the Era of Large Language Models. CoRR abs/2410.07265 (2024) - 2023
- [c11]Yangjie Zhou
, Yaoxu Song
, Jingwen Leng
, Zihan Liu
, Weihao Cui
, Zhendong Zhang
, Cong Guo
, Quan Chen
, Li Li
, Minyi Guo
:
AdaptGear: Accelerating GNN Training via Adaptive Subgraph-Level Kernels on GPUs. CF 2023: 52-62 - [c10]Cong Guo
, Jiaming Tang
, Weiming Hu
, Jingwen Leng
, Chen Zhang
, Fan Yang
, Yunxin Liu
, Minyi Guo
, Yuhao Zhu
:
OliVe: Accelerating Large Language Models via Hardware-friendly Outlier-Victim Pair Quantization. ISCA 2023: 3:1-3:15 - [i13]Cong Guo
, Jiaming Tang, Weiming Hu, Jingwen Leng, Chen Zhang, Fan Yang, Yunxin Liu, Minyi Guo, Yuhao Zhu:
OliVe: Accelerating Large Language Models via Hardware-friendly Outlier-Victim Pair Quantization. CoRR abs/2304.07493 (2023) - [i12]Yangjie Zhou, Yaoxu Song, Jingwen Leng, Zihan Liu, Weihao Cui, Zhendong Zhang, Cong Guo, Quan Chen, Li Li, Minyi Guo:
AdaptGear: Accelerating GNN Training via Adaptive Subgraph-Level Kernels on GPUs. CoRR abs/2305.17408 (2023) - [i11]Shuwen Lu, Zhihui Zhang, Cong Guo
, Jingwen Leng, Yangjie Zhou, Minyi Guo:
Accelerating Generic Graph Neural Networks via Architecture, Compiler, Partition Method Co-Design. CoRR abs/2308.08174 (2023) - [i10]Zihan Liu, Wentao Ni, Jingwen Leng, Yu Feng, Cong Guo
, Quan Chen, Chao Li, Minyi Guo, Yuhao Zhu:
JUNO: Optimizing High-Dimensional Approximate Nearest Neighbour Search with Sparsity-Aware Algorithm and Ray-Tracing Core Mapping. CoRR abs/2312.01712 (2023) - 2022
- [c9]Cong Guo
, Yuxian Qiu, Jingwen Leng, Chen Zhang
, Ying Cao, Quanlu Zhang, Yunxin Liu, Fan Yang, Minyi Guo:
Nesting Forward Automatic Differentiation for Memory-Efficient Deep Neural Network Training. ICCD 2022: 738-745 - [c8]Cong Guo
, Yuxian Qiu, Jingwen Leng, Xiaotian Gao, Chen Zhang, Yunxin Liu, Fan Yang, Yuhao Zhu, Minyi Guo:
SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian Approximation. ICLR 2022 - [c7]Cong Guo
, Chen Zhang
, Jingwen Leng, Zihan Liu
, Fan Yang, Yunxin Liu, Minyi Guo, Yuhao Zhu:
ANT: Exploiting Adaptive Numerical Data Type for Low-bit Deep Neural Network Quantization. MICRO 2022: 1414-1433 - [c6]Mustafa Tarik Sanic, Cong Guo
, Jingwen Leng, Minyi Guo, Weiyin Ma:
Towards Reliable AI Applications via Algorithm-Based Fault Tolerance on NVDLA. MSN 2022: 736-743 - [i9]Cong Guo, Yuxian Qiu, Jingwen Leng, Xiaotian Gao, Chen Zhang, Yunxin Liu, Fan Yang, Yuhao Zhu, Minyi Guo:
SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian Approximation. CoRR abs/2202.07471 (2022) - [i8]Zhengyi Li, Cong Guo
, Zhanda Zhu, Yangjie Zhou, Yuxian Qiu, Xiaotian Gao, Jingwen Leng, Minyi Guo:
Efficient Activation Quantization via Adaptive Rounding Border for Post-Training Quantization. CoRR abs/2208.11945 (2022) - [i7]Cong Guo
, Chen Zhang, Jingwen Leng, Zihan Liu, Fan Yang, Yunxin Liu, Minyi Guo, Yuhao Zhu:
ANT: Exploiting Adaptive Numerical Data Type for Low-bit Deep Neural Network Quantization. CoRR abs/2208.14286 (2022) - [i6]Cong Guo
, Yuxian Qiu, Jingwen Leng, Chen Zhang, Ying Cao, Quanlu Zhang, Yunxin Liu, Fan Yang, Minyi Guo:
Nesting Forward Automatic Differentiation for Memory-Efficient Deep Neural Network Training. CoRR abs/2209.10778 (2022) - 2021
- [c5]Yangjie Zhou, Mengtian Yang, Cong Guo
, Jingwen Leng, Yun Liang, Quan Chen, Minyi Guo, Yuhao Zhu:
Characterizing and Demystifying the Implicit Convolution Algorithm on Commercial Matrix-Multiplication Accelerators. IISWC 2021: 214-225 - [c4]Yang Wang, Chen Zhang
, Zhiqiang Xie, Cong Guo
, Yunxin Liu, Jingwen Leng:
Dual-side Sparse Tensor Core. ISCA 2021: 1083-1095 - [i5]Yang Wang, Chen Zhang, Zhiqiang Xie, Cong Guo, Yunxin Liu, Jingwen Leng:
Dual-side Sparse Tensor Core. CoRR abs/2105.09564 (2021) - [i4]Yangjie Zhou, Mengtian Yang, Cong Guo, Jingwen Leng, Yun Liang, Quan Chen, Minyi Guo, Yuhao Zhu:
Characterizing and Demystifying the Implicit Convolution Algorithm on Commercial Matrix-Multiplication Accelerators. CoRR abs/2110.03901 (2021) - 2020
- [c3]Cong Guo, Yangjie Zhou, Jingwen Leng, Yuhao Zhu, Zidong Du, Quan Chen, Chao Li, Bin Yao, Minyi Guo:
Balancing Efficiency and Flexibility for DNN Acceleration via Temporal GPU-Systolic Array Integration. DAC 2020: 1-6 - [c2]Cong Guo
, Bo Yang Hsueh, Jingwen Leng, Yuxian Qiu, Yue Guan, Zehuan Wang, Xiaoying Jia, Xipeng Li, Minyi Guo, Yuhao Zhu:
Accelerating sparse DNN models without hardware-support via tile-wise sparsity. SC 2020: 16 - [i3]Cong Guo, Yangjie Zhou, Jingwen Leng, Yuhao Zhu, Zidong Du, Quan Chen, Chao Li, Minyi Guo, Bin Yao:
Balancing Efficiency and Flexibility for DNN Acceleration via Temporal GPU-Systolic Array Integration. CoRR abs/2002.08326 (2020) - [i2]Cong Guo, Bo Yang Hsueh, Jingwen Leng, Yuxian Qiu, Yue Guan, Zehuan Wang, Xiaoying Jia, Xipeng Li, Minyi Guo, Yuhao Zhu:
Accelerating Sparse DNN Models without Hardware-Support via Tile-Wise Sparsity. CoRR abs/2008.13006 (2020)
2010 – 2019
- 2019
- [c1]Yuxian Qiu, Jingwen Leng, Cong Guo
, Quan Chen, Chao Li, Minyi Guo, Yuhao Zhu:
Adversarial Defense Through Network Profiling Based Path Extraction. CVPR 2019: 4777-4786 - [i1]Yuxian Qiu, Jingwen Leng, Cong Guo, Quan Chen, Chao Li, Minyi Guo, Yuhao Zhu:
Adversarial Defense Through Network Profiling Based Path Extraction. CoRR abs/1904.08089 (2019)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-05-08 23:41 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint