Patdnn: Achieving real-time dnn execution on mobile devices with pattern-based weight pruning W Niu, X Ma, S Lin, S Wang, X Qian, X Lin, Y Wang, B Ren Proceedings of the Twenty-Fifth International Conference on Architectural …, 2020 | 316 | 2020 |
Spvit: Enabling faster vision transformers via latency-aware soft token pruning Z Kong, P Dong, X Ma, X Meng, W Niu, M Sun, X Shen, G Yuan, B Ren, ... European conference on computer vision, 620-640, 2022 | 277 | 2022 |
Pconv: The missing but desirable sparsity in dnn weight pruning for real-time execution on mobile devices X Ma, FM Guo, W Niu, X Lin, J Tang, K Ma, B Ren, Y Wang Proceedings of the AAAI conference on artificial intelligence 34 (04), 5117-5124, 2020 | 234 | 2020 |
Dnnfusion: accelerating deep neural networks execution with advanced operator fusion W Niu, J Guan, Y Wang, G Agrawal, B Ren Proceedings of the 42nd ACM SIGPLAN International Conference on Programming …, 2021 | 200 | 2021 |
Yolobile: Real-time object detection on mobile devices via compression-compilation co-design Y Cai, H Li, G Yuan, W Niu, Y Li, X Tang, B Ren, Y Wang Proceedings of the AAAI conference on artificial intelligence 35 (2), 955-963, 2021 | 135 | 2021 |
Sparcl: Sparse continual learning on the edge Z Wang, Z Zhan, Y Gong, G Yuan, W Niu, T Jian, B Ren, S Ioannidis, ... Advances in Neural Information Processing Systems 35, 20366-20380, 2022 | 87 | 2022 |
Achieving on-mobile real-time super-resolution with neural architecture and pruning search Z Zhan, Y Gong, P Zhao, G Yuan, W Niu, Y Wu, T Zhang, M Jayaweera, ... Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 72 | 2021 |
Mest: Accurate and fast memory-economic sparse training framework on the edge G Yuan, X Ma, W Niu, Z Li, Z Kong, N Liu, Y Gong, Z Zhan, C He, Q Jin, ... Advances in Neural Information Processing Systems 34, 20838-20850, 2021 | 68 | 2021 |
Zhenglun Kong, Ning Liu, Yifan Gong, Zheng Zhan, Chaoyang He, Qing Jin, et al. Mest: Accurate and fast memory-economic sparse training framework on the edge G Yuan, X Ma, W Niu, Z Li Advances in Neural Information Processing Systems 34 (20838-20850), 3, 2021 | 67 | 2021 |
Rtmobile: Beyond real-time mobile acceleration of rnns for speech recognition P Dong, S Wang, W Niu, C Zhang, S Lin, Z Li, Y Gong, B Ren, X Lin, ... 2020 57th ACM/IEEE Design Automation Conference (DAC), 1-6, 2020 | 66 | 2020 |
Towards artificial general intelligence (agi) in the internet of things (iot): Opportunities and challenges F Dou, J Ye, G Yuan, Q Lu, W Niu, H Sun, L Guan, G Lu, G Mai, N Liu, ... arXiv preprint arXiv:2309.07438, 2023 | 57 | 2023 |
Npas: A compiler-aware framework of unified network pruning and architecture search for beyond real-time mobile acceleration Z Li, G Yuan, W Niu, P Zhao, Y Li, Y Cai, X Shen, Z Zhan, Z Kong, Q Jin, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 39 | 2021 |
Compiler-aware neural architecture search for on-mobile real-time super-resolution Y Wu, Y Gong, P Zhao, Y Li, Z Zhan, W Niu, H Tang, M Qin, B Ren, ... European Conference on Computer Vision, 92-111, 2022 | 38 | 2022 |
26ms inference time for resnet-50: Towards real-time execution of all dnns on smartphone W Niu, X Ma, Y Wang, B Ren arXiv preprint arXiv:1905.00571, 2019 | 37 | 2019 |
Pruning parameterization with bi-level optimization for efficient semantic segmentation on the edge C Yang, P Zhao, Y Li, W Niu, J Guan, H Tang, M Qin, B Ren, X Lin, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 32 | 2023 |
A privacy-preserving-oriented dnn pruning and mobile acceleration framework Y Gong, Z Zhan, Z Li, W Niu, X Ma, W Wang, B Ren, C Ding, X Lin, X Xu, ... Proceedings of the 2020 on Great Lakes Symposium on VLSI, 119-124, 2020 | 30 | 2020 |
An image enhancing pattern-based sparsity for real-time inference on mobile devices X Ma, W Niu, T Zhang, S Liu, S Lin, H Li, W Wen, X Chen, J Tang, K Ma, ... Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 30 | 2020 |
Lazydit: Lazy learning for the acceleration of diffusion transformers X Shen, Z Song, Y Zhou, B Chen, Y Li, Y Gong, K Zhang, H Tan, J Kuen, ... Proceedings of the AAAI Conference on Artificial Intelligence 39 (19), 20409 …, 2025 | 29 | 2025 |
Survey: Exploiting data redundancy for optimization of deep learning JA Chen, W Niu, B Ren, Y Wang, X Shen ACM Computing Surveys 55 (10), 1-38, 2023 | 29 | 2023 |
Towards high-quality and efficient video super-resolution via spatial-temporal data overfitting G Li, J Ji, M Qin, W Niu, B Ren, F Afghah, L Guo, X Ma 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR …, 2023 | 28 | 2023 |