Seleted Publications

  CodeLLM & AI4SE:

  • [ICSE-SEIP'26]  Peng Di, Faqiang Chen, Xiao Bai, Hongjun Yang, Qingfeng Li, Ganglin Wei, Jian Mou, Feng Shi, Keting Chen, Peng Tang, Zhitao Shen, Zheng Li, Wenhui Shi, Junwei Guo, Hang Yu,
    OpenDerisk: An Industrial Framework for AI-Driven SRE, with Design, Implementation, and Case Studies.
    The IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2026.
  • [AAAI'26]  Jian Wu, Hang Yu, Bingchang Liu, Yang Wenjie, Peng Di, Jianguo Li, Yue Zhang,
    LAMDAS: LLM as an Implicit Classifier for Domain-specific Data Selection.
    The AAAI Conference on Artificial Intelligence, 2026.
  • [NeurIPS'25]  Hongyuan Tao, Ying Zhang, Zhenhao Tang, Hongen Peng, Xukun Zhu, Bingchang Liu, Yingguang Yang, Ziyin Zhang, Zhaogui Xu, Haipeng Zhang, Linchao Zhu, Rui Wang, Hang Yu, Jianguo Li, Peng Di,
    Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks.
    The Annual Conference on Neural Information Processing Systems, 2025.
  • [ACL'25]  Ziyin Zhang, Hang Yu, Sage Lee, Peng Di, Jianguo Li, Rui Wang,
    GALLa: Graph Aligned Large Language Models for Improved Source Code Understanding.
    The Annual Meeting of the Association for Computational Linguistics, 2025.
  • [arXiv]  CodeFuse & Ling Team,
    Every Sample Matters: Leveraging Mixture-of-Experts and High-Quality Data for Efficient and Accurate Code LLM.
    arXiv preprint arXiv:2311.12785, [cs.LG], 2025.
  • [ASE'24]  Cong Li, Zhaogui Xu, Peng Di, Dongxia Wang, Zheng Li, and Qian Zheng,
    Understanding Code Changes Practically with Small-Scale Language Models.
    The IEEE/ACM International Conference on Automated Software Engineering, 2024.
  • [ICSE-SEIP'24]  Peng Di, Jianguo Li, Hang Yu, Wei Jiang, et al.,
    CodeFuse-13B: A Pretrained Multi-lingual Code Large Language Model.
    The IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2024.
  • [arXiv]  Xiaoxia Liu, Jingyi Wang, Jun Sun, Xiaohan Yuan, Guoliang Dong, Peng Di, Wenhai Wang, Dongxia Wang,
    Prompting Frameworks for Large Language Models: A Survey. arXiv preprint arXiv:2311.12785, [cs.SE], 2023.
  •  Program Analysis:

  • [ICSE-SEIP'26]  Xiaoheng Xie, Gang Fan, Xiaojun Lin, Ang Zhou, Shijie Li, Xunjin Zheng, Yinan Liang, Yu Zhang, Na Yu, Haokun Li, Xinyu Chen, Yingzhuang Chen, Yi Zhen, Dejun Dong, Xianjin Fu, Jinzhou Su, Fuxiong Pan, Pengshuai Luo, Youzheng Feng, Ruoxiang Hu, Hanyang Guo, Jing Fan, Xiao Xiao, Peng Di,
    Principles and Practices of Large-Scale Code Analysis at Ant Group: A Data- and Logic-Oriented Approach.
    The IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2026.
  • [CGO'26]  Puzhuo Liu, Peng Di, Jingling Xue, Yu Jiang,
    BIT: Empowering Binary Analysis Through the LLVM Toolchain.
    The IEEE/ACM International Symposium on Code Generation and Optimization, 2026.
  • [ICSE'26]  Dawei Chen, Wuxia Jin, Hui Guo, Guanlin Qiao, Peng Di, Ting Liu,
    Evolving Trends, Patterns, and Hidden Pitfalls: Unveiling JavaScript Feature Usage in the Wild.
    The IEEE/ACM International Conference on Software Engineering, 2026.
  • [TOSEM'25 & Journal-First of ASE'25]  Puzhuo Liu, Chengnian Sun, Yaowen Zheng, Xuan Feng, Chuan Qin, Yuncheng Wang, Zhenyang Xu, Zhi Li, Peng Di, Yu Jiang, Limin Sun,
    LLM-Powered Static Binary Taint Analysis.
    The ACM Transactions on Software Engineering and Methodology (TOSEM), and
    Journal-First Track of the IEEE/ACM International Conference on Automated Software Engineering, 2025.
  • [ICSE'25]  Qingkai Shi, Xiaoheng Xie, Xianjin Fu, Peng Di, Huawei Li, Ang Zhou, Gang Fan,
    Datalog-Based Language-Agnostic Change Impact Analysis for Microservices.
    The IEEE/ACM International Conference on Software Engineering, 2025.
  • [ICSE'25, Distinguished Paper Award]   Mingyuan Wu, Jiahong Xiang, Kunqiu Chen, Peng Di, Shin Hwei Tan, Heming Cui, Yuqun Zhang,
    Tumbling Down the Rabbit Hole: How do Assisting Exploration Strategies Facilitate Grey-box Fuzzing?
    The IEEE/ACM International Conference on Software Engineering, 2025.
  • [OOPSLA'24]  Zhenyu Yan, Xin Zhang, Peng Di,
    Scaling Abstraction Refinement for Program Analyses in Datalog Using Graph Neural Networks.
    The ACM SIGPLAN International Conference on Object-Oriented Programming Systems, Languages, and Applications, 2024.
  • [FSE'24]  Weigang He, Peng Di, Mengli Ming, Chengyu Zhang, Ting Su, Shijie Li, Yulei Sui,
    Finding and Understanding Defects in Static Analyzers by Constructing Automated Oracles.
    The ACM International Conference on the Foundations of Software Engineering, 2024.
  • [ICSE-SEIP'24]  Peng Di, Bingchang Liu, Yiyi Gao,
    MicroFuzz: An Efficient Fuzzing Framework for Microservices.
    The IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2024.
  • [TSE'24]  Haofeng Li, Tian Tan, Yue Li, Jie Lu, Haining Meng, Liqing Cao, Yongheng Huang, Lian Li, Lin Gao, Peng Di, Liang Lin, ChenXi Cui.
    Generic Sensitivity: Generics-Guided Context Sensitivity for Pointer Analysis.
    The IEEE Transactions on Software Engineering, 2024.
  • [ISSTA'23]  Jiangchao Liu, Jierui Liu, Peng Di, Diyu Wu, Hengjie Zheng, Alex X. Liu, Jingling Xue,
    Hybrid Inlining: A Framework for Compositional and Context-Sensitive Static Analysis.
    The ACM SIGSOFT International Symposium on Software Testing and Analysis, 2023.
  • [ICSE-SEIP'23]  Zelin Zhao, Xizao Wang, Zhaogui Xu, Zhenhao Tang, Yongchao Li and Peng Di.
    Incremental Call Graph Construction in Industrial Practice.
    The IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2023.
  • [ICSE-SEIP'23]  Zexin Zhong, Jiangchao Liu, Diyu Wu, Peng Di, Yulei Sui, Alex X. Liu and John C.S. Lui.
    Scalable Compositional Static Taint Analysis for Sensitive Data Tracing on Industrial Micro-Services.
    The IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2023.
  • [ICSE-SEIP'22]  Jiangchao Liu, Jierui Liu, Peng Di, Alex X. Liu, Zexin Zhong.
    Record and Replay of Online Traffic for Microservices with Automatic Mocking Point Identification.
    The IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2022.
  • [ICSE-SEIP'22]  Zexin Zhong, Jiangchao Liu, Diyu Wu, Peng Di, Yulei Sui, Alex X. Liu.
    Field-based Static Taint Analysis for Industrial Microservice.
    The IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2022.
  •  AI Infra & Parallel Programming:

  • [PPoPP'26]  Zhiqiang Liang, Hongyu Gao, Fang Liu, Jue Wang, Xingguo Shi, Juyu Gu, Peng Di, San Li, Lei Tang, Chunbao Zhou, Lian Zhao, Yangang Wang, Xuebin Chi,
    TAC: Cache-based System for Accelerating Billion-Scale GNN Training on Multi-GPU Platform.
    The ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2026.
  • [TSE'25]  Xiaoxia Liu, Peng Di, Cong Li, Jun Sun, Jingyi Wang,
    Efficient Function Orchestration for Large Language Models.
    The IEEE Transactions on Software Engineering, 2025.
  • [TOCS'23]  Jie Zhao, Jinchen Xu, Peng Di, Wang Nie, Jiahui Hu, Yanzhi Yi, Sijia Yang, Zhen Geng, Renwei Zhang, Bojie Li, Zhiliang Gan, Xuefeng Jin.
    Modeling the Interplay between Loop Tiling and Fusion in Optimizing Compilers using Affine Relations.
    The ACM Transactions on Computer Systems, 2023.
  • [PLDI'21]  Jie Zhao, Bojie Li, Wang Nie, Zhen Geng, Renwei Zhang, Xiong Gao, Bin Cheng, Chen Wu, Yun Cheng, Zheng Li, Peng Di, Kun Zhang and Xuefeng Jin.
    AKG: Automatic Kernel Generation for Neural Processing Units using Polyhedral Transformations.
    The ACM SIGPLAN Conference on Programming Language Design and Implementation, 2021.
  • [MICRO'20, Best Paper Nomination]  Jie Zhao, Peng Di.
    Optimizing the Memory Hierarchy by Compositing Automatic Transformations on Computations and Data.
    The IEEE/ACM International Symposium on Microarchitecture, 2020.
  • [ICPP'16]  Feng Zhang, Peng Di, Hao Zhou and Jingling Xue.
    RegTT: Accelerating Tree Traversals on GPUs by Exploiting Regularities.
    The International Conference on Parallel Processing, 2016.
  • [EuroLLVM'16]  Yulei Sui, Peng Di, Ding Ye, Hua Yan and Jingling Xue.
    SVF: Static Value-Flow Analysis in LLVM.
    The 2016 European LLVM Conference, 2016.
  • [PMAM'16]  Peng Di, Yulei Sui.
    Accelerating Dynamic Data Race Detection Using Static Thread Interference Analysis.
    The International Workshop on Programming Models and Applications for Multicores and Manycores co-located with the PPoPP, 2016.
  • [CGO'16]  Yulei Sui, Peng Di, Jingling Xue.
    Sparse Flow-Sensitive Pointer Analysis for Multithreaded C Programs.
    The IEEE/ACM International Symposium on Code Generation and Optimization, 2016.
  • [CCPE'16]  Jilin Zhang, Junfeng Yuan, Jian Wan, Peng Di.
    Efficient Parallel Implementation of Incompressible Pipe Flow Algorithm Based on SIMPLE.
    Concurrency and Computation: Practice and Experience, 2016.
  • [ICPP'15]  Peng Di, Yulei Sui, Ding Ye, Jingling Xue.
    Region-Based May-Happen-in-Parallel Analysis for C Programs.
    The International Conference on Parallel Processing, 2015.
  • [ICPP'12]  Peng Di, Ding Ye, Yu Su, Yulei Sui, Jingling Xue.
    Automatic Parallelization of Tiled Loop Nests with Enhanced Fine-Grained Parallelism on GPUs.
    The International Conference on Parallel Processing, 2012.
  • [CCGrid'12]  Jilin Zhang, Liting Zhu, Jie Mao, Jian Wan, Peng Di.
    An Efficient Parallel Implementation for Three-dimensional Incompressible Pipe Flow based on SIMPLE.
    The IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, 2012.
  • [PC'12]  Peng Di, Hui Wu, Jingling Xue, Feng Wang, Canqun Yang.
    Parallelizing SOR for GPGPUs Using Alternate Loop Tiling.
    Parallel Computing, 2012.
  • [Euro-Par'11]  Peng Di, Jingling Xue.
    Model-Driven Tile Size Selection for DOACROSS Loops on GPUs.
    The International European Conference on Parallel and Distributed Computing, 2011.
  • [ICPP'10]  Peng Di, Qing Wan, Xuemeng Zhang, Hui Wu and Jingling Xue.
    Toward Harnessing DOACROSS Parallelism for Multi-GPGPUs.
    The International Conference on Parallel Processing, 2010.