Energon: Toward efficient acceleration of transformers using dynamic sparse attention Z Zhou, J Liu, Z Gu, G Sun IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2022 | 28 | 2022 |
{PetS}: A unified framework for {Parameter-Efficient} transformers serving Z Zhou, X Wei, J Zhang, G Sun 2022 USENIX Annual Technical Conference (USENIX ATC 22), 489-504, 2022 | 28 | 2022 |
BlockGNN: Towards efficient GNN acceleration using block-circulant weight matrices Z Zhou, B Shi, Z Zhang, Y Guan, G Sun, G Luo 2021 58th ACM/IEEE Design Automation Conference (DAC), 1009-1014, 2021 | 27 | 2021 |
GNNear: Accelerating Full-Batch Training of Graph Neural Networks with Near-Memory Processing Z Zhou, C Li, X Wei, X Wang, G Sun Proceedings of the International Conference on Parallel Architectures and …, 2022 | 25* | 2022 |
DIMM-Link: Enabling Efficient Inter-DIMM Communication for Near-Memory Processing Z Zhou, C Li, F Yang, G Sun 2023 IEEE International Symposium on High-Performance Computer Architecture …, 2023 | 10 | 2023 |
Hardware-assisted service live migration in resource-limited edge computing systems Z Zhou, X Li, X Wang, Z Liang, G Sun, G Luo 2020 57th ACM/IEEE Design Automation Conference (DAC), 1-6, 2020 | 8 | 2020 |
Edge-stream: A stream processing approach for distributed applications on a hierarchical edge-computing system X Wang, Z Zhou, P Han, T Meng, G Sun, J Zhai 2020 IEEE/ACM Symposium on Edge Computing (SEC), 14-27, 2020 | 6 | 2020 |
Reconfigurable asic implementation of asynchronous recurrent neural networks S Nelson, SY Kim, J Di, Z Zhou, Z Yuan, G Sun 2021 27th IEEE International Symposium on Asynchronous Circuits and Systems …, 2021 | 5 | 2021 |
{SaFace}: Towards Scenario-aware Face Recognition via Edge Computing System Z Zhou, B Wu, Z Liang, G Sun, C Xu, G Luo 3rd USENIX Workshop on Hot Topics in Edge Computing (HotEdge 20), 2020 | 4 | 2020 |
FD-CNN: A Frequency-Domain FPGA Acceleration Scheme for CNN-Based Image-Processing Applications X Wang, Z Zhou, Z Yuan, J Zhu, Y Cao, Y Zhang, K Sun, G Sun ACM Transactions on Embedded Computing Systems 22 (6), 1-30, 2023 | 3 | 2023 |
Automated design of chiplets A Sangiovanni-Vincentelli, Z Liang, Z Zhou, J Zhang Proceedings of the 2023 International Symposium on Physical Design, 1-8, 2023 | 3 | 2023 |
NMExplorer: An Efficient Exploration Framework for DIMM-based Near-Memory Tensor Reduction C Li, Z Zhou, X Li, G Sun, D Niu 2023 60th ACM/IEEE Design Automation Conference (DAC), 1-6, 2023 | 2 | 2023 |
METRO: A software-hardware co-design of interconnections for spatial DNN accelerators Z Wang, G Sun, J Zhu, Z Zhou, Y Guo, Z Yuan arXiv preprint arXiv:2108.10570, 2021 | 2 | 2021 |
Accelerate service live migration in resource-limited edge computing systems Z Zhou, X Li, G Sun Proceedings of the 4th ACM/IEEE Symposium on Edge Computing, 354-355, 2019 | 2 | 2019 |
Rapid configuration of asynchronous recurrent neural networks for ASIC implementations S Nelson, W Khalil, SY Kim, J Di, Z Zhou, Z Yuan, G Sun 2021 IEEE High Performance Extreme Computing Conference (HPEC), 1-6, 2021 | 1 | 2021 |
PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization C Li, Z Zhou, Y Wang, F Yang, T Cao, M Yang, Y Liang, G Sun Proceedings of the 29th ACM International Conference on Architectural …, 2024 | | 2024 |
SpecPIM: Accelerating Speculative Inference on PIM-Enabled System via Architecture-Dataflow Co-Exploration C Li, Z Zhou, S Zheng, J Zhang, Y Liang, G Sun Proceedings of the 29th ACM International Conference on Architectural …, 2024 | | 2024 |
Toward CXL-Native Memory Tiering via Device-Side Profiling Z Zhou, Y Chen, T Zhang, Y Wang, R Shu, S Xu, P Cheng, L Qu, Y Xiong, ... arXiv preprint arXiv:2403.18702, 2024 | | 2024 |
Polaris: Enhancing CXL-based Memory Expanders with Memory-side Prefetching Z Zhou, S Xu, Y Chen, T Zhang, R Shu, L Qu, P Cheng, Y Xiong, G Sun International Symposium on Advanced Parallel Processing Technologies, 19-39, 2023 | | 2023 |