Publications

  • Conferences

    1. Rubayet Rongon and Xuechen Zhang. “iAUG: Accelerating Augmentation with Importance Sampling in Deep Neural Network Training”. In Proceedings of the 31st International European Conference on Parallel and Distributed Computing (EURO-PAR’25), Dresden, Germany, August 2025. (Acceptance Rate: 29%)
    2. Weijian Chen, Shuibing He, Haoyang Ou, and Xuechen Zhang. “LeapGNN: Accelerating Distributed GNN Training Leveraging Feature-Centric Model Migration”.  In the 23rd USENIX Conference on File and Storage Technologies (FAST ’25), Santa Clara, CA, February 2025. (Acceptance Rate: 21.5%)
    3. Siling Yang, Shuibing He, Wenjiong Wang, Yanlong Yin, Tong Wu, Weijian Chen, Xuechen Zhang, Xina-He Sun, and Dan Feng, “GOPIM: GCN-Oriented Pipeline Optimization for PIM Accelerators”. In The 31st IEEE International Symposium on High-Performance Computer Architecture (HPCA-31), Las Vegas, NV, March 2025. (Acceptance Rate: 21%)
    4. Rubayet Rongon, Chen Cao, and Xuechen Zhang, “A Study of Data-Path Bugs in PyTorch with a Focus on Memory Management”. In Proceedings of the 2024 IEEE International Conference on Big Data (BigData’24), Washington DC, December 2024. (Acceptance Rate: 18.8%)
    5. Zhe Pan, Shuibing He, Xu Li, Xuechen Zhang, Yanlong Yin, Rui Wang, Lidan Shou, Mingli Song, Xian- He Sun, and Gang Chen. Enumeration of Billions of Maximal Bicliques in Bipartite Graphs without Using GPUs. In Proceedings of the ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC’24), Atlanta, GA, November 2024. (Acceptance Rate: 22.7%)

     

    • Journal

      1. Dang Zheng, Shuibing He, Xuechen Zhang, Peiyi Hong, Zhenxin Li, Xinyu Chen, Haozhe Song, Xian-He Sun, and Gang Chen, “PMAlloc: A Holistic Approach to Improving Persistent Memory Allocation”. ACM Transactions on Computer Systems (TOCS), 2024.
      2. Siling Yang, Shuibing He, Hexiao Duan, Weijian Chen, Xuechen Zhang, Tong Wu, and Yanlong Yin, “APQ: Automated DNN Pruning and Quantization for ReRAM-based Accelerators”, IEEE Transactions on Parallel and Distributed Systems (TPDS), vol. 34, no. 9, Sep. 2023.
      3. Ping Chen, Shuibing He, Xuechen Zhang, Shuaiben Chen, Peiyi Hong, Yanlong Yin, and Xian-He Sun, “Accelerating Tensor Swapping in GPUs with Self-Tuning Compression”, IEEE Transactions on Parallel and Distributed Systems (TPDS), vol. 33, issue 12, December 2022.
      4. Soklong Lim, Tyler Coy, Zaixin Lu, Bin Ren, and Xuechen Zhang, “NVGRAPH: Enforcing Crash Consistency of Evolving Network Analytics in NVMM Systems”, IEEE Transactions on Parallel and Distributed Systems (TPDS), vol. 31, no. 6, June 2020.
      5. Shuibing He, Yanlong Yin, Xian-He Sun, Xuechen Zhang, Zongpeng Li, “Optimizing Parallel I/O Accesses through Pattern-Directed and Layout-Aware Replication”. IEEE Transactions on Computers (TC),  vol. 69, no. 2, February 2020.
      ©