About me
I am a Senior Staff Engineer at Ant Group, where I lead the Intelligent Platform Engineering (IPE) team. My work is dedicated to driving a paradigm shift in the DevOps landscape by deeply integrating Large Language Models (LLMs) and program analysis techniques to enhance developer productivity, software quality, and security.
In parallel with my industry role, I maintain a connection to academia as an Adjunct Associate Professor at The University of New South Wales and serve as an Industrial PhD Mentor at Zhejiang University.
Experience & Key Projects
Ant Group (2020 – Present)
My primary focus is on building enterprise-scale products that redefine software engineering for Ant Group’s internal infrastructure. While we are proud of our open-source contributions, they represent just the tip of the iceberg of our team’s overall innovation and impact.
- CodeFuse LLM: : Our foundational pretrained code LLM, designed to enhance coding effectiveness. It has been successfully implemented in critical software lifecycle across the group, covering AI coding and operations, code quality, security and relibility assurance.
[CodeFuse] [CodeFuse-CGM] [Ling-Coder-lite]
contribute to [WeaveFox] [TBox] - OpenDeRisk: an AI-native risk intelligence system. It empowers your application to act as a 24/7 risk-aware manager, delivering continuous and comprehensive protection.
[OpenDeRisk on GitHub] - OpAgent: our new web agent that achieves a state-of-the-art 71.6% success rate on the WebArena benchmark! Most web agents fail because they are trained on static data. OpAgent is different. It learns by interacting directly with live websites using Agentic RL (Reinforcement Learning), allowing it to adapt to the web's dynamic nature. Its modular framework (Planner, Grounder, Reflector) enables robust self-correction, mastering complex web tasks where others struggle.
[OpAgent on Github] [OpAgent on HuggingFace] [Technical report]
Huawei (2018 – 2020)
Before joining Ant, I worked at Huawei as a Technical Expert of AI compiler to build AI infra for Huawei AI ecosystem.
- MindSpore/AKG: I co-designed the MindSpore framework and founded its AKG compiler for the Ascend chip, driving its commercialization through performance optimization. I also played a key role in open-sourcing the platform, establishing its developer ecosystem.
[MindSpore][MindSpore/AKG on GitHub]
The University of New South Wales (2013 – 2018)
During my time in academia as a Lecturer and Associate Professor, my research had a strong focus on real-world application and commercial translation.
- SVF (Static Value-Flow Analysis): I cooperatively developed a program analyser SVF, a leading pointer analysis framework built on LLVM. Its core technology became a cornerstone for multiple commercial software security products, effectively shaping the ecosystem in this domain.
[SVF Project Page]
Research Interests
- Code Large Language Model, AI Software Engineering
- Programming Language, Program Analysis, Software Security
- AI Infra, Parallel Programming
News
My team is looking for self-motivated technical experts and internship students, who are interested in machine learning, program analysis, AI software engineering. Please no hesitate to contact me (dipeng dot dp at antgroup dot com).
- 2026-03: Our two papers on the Codefuse ToolChain were accepted to FSE 2026 (Industry Track). RepoFuse delivers fast (~150ms), accurate repository-level code completion by combining semantic dependencies with similar patterns, reducing token usage and boosting productivity. NES learns from historical editing patterns to predict where and how to edit next—without user instructions—providing sub-250ms, high-accuracy edits that improve developer flow. Together, these works form the core of Codefuse’s practical development tooling.
- 2026-02: We’re excited to open-source OpAgent, our new web agent that achieves a state-of-the-art 71.6% success rate on the WebArena benchmark! Please check repository [OpAgent on Github] [OpAgent on HuggingFace] [Technical report].
- 2026-01: “Prompting Frameworks for Large Language Models: A Survey” has been accepted by ACM Computing Surveys.
- 2025-12: Our two papers, introducing our open-sourced OpenDerisk and Codefuse-Query respectively, has been accepted by ICSE-SEIP 2026.
- 2025-11: Our binary analysis work, titled “BIT: Empowering Binary Analysis Through the LLVM Toolchain” has been accepted by CGO 2026. This paper describes the application of binary analysis technology in Ant Group’s real-world IoT scenarios (Alipay Tap to Pay), identifying program issues through the collaborative analysis of data flows using both source code and binary.
- 2025-09: Our paper, titled “Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks” has been accepted by NeurIPS 2025.
- 2025-08: We are proud to open-source OpenDeRisk, an AI-native risk intelligence system that transforms your application into a 24/7 risk-aware manager, offering continuous, comprehensive, and deep-layer protection. Available now on GitHub.
- 2025-06: Our paper, titled “Evolving Trends, Patterns, and Hidden Pitfalls: Unveiling JavaScript Feature Usage in the Wild” has been accepted by ICSE 2026.
- 2025-05: We are pleased to announce the updated version of the CGM-72B-V1.2. The model further achieves a remarkable 44.00% resolve rate on the SWE-Bench-Lite leaderboard.CGM
- 2025-05: Our paper, titled “GALLa: Graph Aligned Large Language Models for Improved Source Code Understanding” has been accepted to the main conference of ACL 2025.
- 2025-04: Our paper, titled “Tumbling Down the Rabbit Hole: How do Assisting Exploration Strategies Facilitate Grey-box Fuzzing?” wins ACM SIGSOFT Distinguished Paper Award.
- 2025-03: We have open-sourced our latest MoE code LLM, Ling-Coder-Lite, available on GitHub and HuggingFace. This model is based on the Ant Ling-MoE architecture. Additionally, the technical report, SFT dataset and DPO dataset are also openly accessible.
- 2024-12: Our paper, titled “LLM-Powered Static Binary Taint Analysis.” has been accepted by ACM Transactions on Software Engineering and Methodology (TOSEM).
- 2024-11: I was honored to be invited to present Ant Group’s recent work on CodeLLM at CCF ChinaSoft 2024. Additionally, our team member, Cong Li, was awarded the Distinguished Dissertation Award by the CCF Systems Software.
- 2024-10: Our paper, titled “Datalog-Based Language-Agnostic Change Impact Analysis for Microservices” has been accepted by ICSE 2025. This paper introduces the application of our datalog-based program analysis system CodeFuse-Query on the complex change impact analysis of microservices.
- 2024-10: “MiniChecker: Detecting Data Privacy Risk of Abusive Permission Request Behavior in Mini-Programs” earned the distinguished paper award of ASE’24.
- 2024-08: Our paper, titled “Understanding Code Changes Practically with Small-Scale Language Models” has been accepted by ASE 2024. Congratulations to Cong Li, the first author of this work, who joined the Ant Postdoctoral Programme one year ago! Well done! And another paper from our team, titled “MiniChecker: Detecting Data Privacy Risk of Abusive Permission Request Behavior in Mini-Programs”, has been accepted by ASE 2024 too.
- 2024-07: Our paper, titled “Tumbling Down the Rabbit Hole: How do Assisting Exploration Strategies Facilitate Grey-box Fuzzing?” has been accepted by ICSE 2025.
- 2024-06: Our paper, titled “Scaling Abstraction Refinement for Program Analyses in Datalog Using Graph Neural Networks”, which introduced an approach to combining graph neural networks and constrait solving to improve program analyses, has been accepted by OOPSLA 2024.
- 2024-05: Our team won 2024 T-Star Award, Ant Group’s highest technological award. This award celebrates our innovative contributions to CodeFuse.
- 2024-04: Our paper, titled “Finding and Understanding Defects in Static Analyzers by Constructing Automated Oracles”, which was cooperated with East China Normal University, ETH Zurich, and University of New South Wales, has been accepted by FSE 2024.
- 2024-03: Our paper, titled “Generic Sensitivity: Generics-Guided Context Sensitivity for Pointer Analysis”, which was cooperated with the Chinese Academy of Sciences and Nanjing University, has been accepted by IEEE Transactions on Software Engineering (TSE).
- 2023-12: Our two papers “CodeFuse-13B: A Pretrained Multi-lingual Code Large Language Model” and “MicroFuzz: An Efficient Fuzzing Framework for Microservices” are accepted by ICSE-SEIP 2024.
- 2023-12: Good News: we opened our datalog-based static program analyser CodeFuse-Query.
- 2023-11: Our paper “Modeling the Interplay between Loop Tiling and Fusion in Optimizing Compilers using Affine Relations” was accepted by ACM Transactions on Computer Systems (TOCS). We are so fortunate to be the authors of one of the last round of TOCS publications since this journal is no longer accepting submissions and will cease publication after 2023.
- 2023-09: The pretrained code LLM, CodeFuse, which was trained by our team at AntGroup, has been open-sourced on GitHub and HuggingFace. The technical report has been published on link.
- 2023-04: Our paper “Hybrid Inlining: A Framework for Compositional and Context-Sensitive Static Analysis” is accepted by ISSTA 2023.
- 2022-12: Our two papers “Incremental Call Graph Construction in Industrial Practice” and “Scalable Compositional Static Taint Analysis for Sensitive Data Tracing on Industrial Micro-Services” are accepted by ICSE-SEIP 2023.
- 2022-12: Our SAST and SCA products officially enter the Chinese Product and Manufacturer List of Software Supply Chain Security.
- 2022-10: I was invited to serve SANER 2023 as an industrial track PC member.
- 2022-02: I was invited to join the Industry Advisory Board of Monash University.
- 2022-01: Our two papers “Record and Replay of Online Traffic for Microservices with Automatic Mocking Point Identification” and “Field-based Static Taint Analysis for Industrial Microservice” are accepted by ICSE-SEIP 2022.
- 2021-12: I was invited to introduce ANT’s AIOps on CCF ChinaSoft 2021. link
- 2021-10: I was invited to present our unified program analysis framework on CCF CNCC 2021. link
- 2021-03: Our paper “AKG: Automatic Kernel Generation for Neural Processing Units using Polyhedral Transformations” is accepted by PLDI 2021.
- 2020-10: Our paper “Optimizing the Memory Hierarchy by Compositing Automatic Transformations on Computations and Data” gains Best Paper Nomination.
- 2020-07: Our paper “Optimizing the Memory Hierarchy by Compositing Automatic Transformations on Computations and Data” is accepted by MICRO-53.
- 2020-06: Our AKG is open-sourced. (GitHub, Gitee).
