Seleted Publications
CodeLLM & AI4SE:
OpenDerisk: An Industrial Framework for AI-Driven SRE, with Design, Implementation, and Case Studies.
The IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2026.
LAMDAS: LLM as an Implicit Classifier for Domain-specific Data Selection.
The AAAI Conference on Artificial Intelligence, 2026.
Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks.
The Annual Conference on Neural Information Processing Systems, 2025.
GALLa: Graph Aligned Large Language Models for Improved Source Code Understanding.
The Annual Meeting of the Association for Computational Linguistics, 2025.
Every Sample Matters: Leveraging Mixture-of-Experts and High-Quality Data for Efficient and Accurate Code LLM.
arXiv preprint arXiv:2311.12785, [cs.LG], 2025.
Understanding Code Changes Practically with Small-Scale Language Models.
The IEEE/ACM International Conference on Automated Software Engineering, 2024.
CodeFuse-13B: A Pretrained Multi-lingual Code Large Language Model.
The IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2024.
Prompting Frameworks for Large Language Models: A Survey. arXiv preprint arXiv:2311.12785, [cs.SE], 2023.
Program Analysis:
Principles and Practices of Large-Scale Code Analysis at Ant Group: A Data- and Logic-Oriented Approach.
The IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2026.
BIT: Empowering Binary Analysis Through the LLVM Toolchain.
The IEEE/ACM International Symposium on Code Generation and Optimization, 2026.
Evolving Trends, Patterns, and Hidden Pitfalls: Unveiling JavaScript Feature Usage in the Wild.
The IEEE/ACM International Conference on Software Engineering, 2026.
LLM-Powered Static Binary Taint Analysis.
The ACM Transactions on Software Engineering and Methodology (TOSEM), and
Journal-First Track of the IEEE/ACM International Conference on Automated Software Engineering, 2025.
Datalog-Based Language-Agnostic Change Impact Analysis for Microservices.
The IEEE/ACM International Conference on Software Engineering, 2025.
Tumbling Down the Rabbit Hole: How do Assisting Exploration Strategies Facilitate Grey-box Fuzzing?
The IEEE/ACM International Conference on Software Engineering, 2025.
Scaling Abstraction Refinement for Program Analyses in Datalog Using Graph Neural Networks.
The ACM SIGPLAN International Conference on Object-Oriented Programming Systems, Languages, and Applications, 2024.
Finding and Understanding Defects in Static Analyzers by Constructing Automated Oracles.
The ACM International Conference on the Foundations of Software Engineering, 2024.
MicroFuzz: An Efficient Fuzzing Framework for Microservices.
The IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2024.
Generic Sensitivity: Generics-Guided Context Sensitivity for Pointer Analysis.
The IEEE Transactions on Software Engineering, 2024.
Hybrid Inlining: A Framework for Compositional and Context-Sensitive Static Analysis.
The ACM SIGSOFT International Symposium on Software Testing and Analysis, 2023.
Incremental Call Graph Construction in Industrial Practice.
The IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2023.
Scalable Compositional Static Taint Analysis for Sensitive Data Tracing on Industrial Micro-Services.
The IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2023.
Record and Replay of Online Traffic for Microservices with Automatic Mocking Point Identification.
The IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2022.
Field-based Static Taint Analysis for Industrial Microservice.
The IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2022.
AI Infra & Parallel Programming:
TAC: Cache-based System for Accelerating Billion-Scale GNN Training on Multi-GPU Platform.
The ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2026.
Efficient Function Orchestration for Large Language Models.
The IEEE Transactions on Software Engineering, 2025.
Modeling the Interplay between Loop Tiling and Fusion in Optimizing Compilers using Affine Relations.
The ACM Transactions on Computer Systems, 2023.
AKG: Automatic Kernel Generation for Neural Processing Units using Polyhedral Transformations.
The ACM SIGPLAN Conference on Programming Language Design and Implementation, 2021.
Optimizing the Memory Hierarchy by Compositing Automatic Transformations on Computations and Data.
The IEEE/ACM International Symposium on Microarchitecture, 2020.
RegTT: Accelerating Tree Traversals on GPUs by Exploiting Regularities.
The International Conference on Parallel Processing, 2016.
SVF: Static Value-Flow Analysis in LLVM.
The 2016 European LLVM Conference, 2016.
Accelerating Dynamic Data Race Detection Using Static Thread Interference Analysis.
The International Workshop on Programming Models and Applications for Multicores and Manycores co-located with the PPoPP, 2016.
Sparse Flow-Sensitive Pointer Analysis for Multithreaded C Programs.
The IEEE/ACM International Symposium on Code Generation and Optimization, 2016.
Efficient Parallel Implementation of Incompressible Pipe Flow Algorithm Based on SIMPLE.
Concurrency and Computation: Practice and Experience, 2016.
Region-Based May-Happen-in-Parallel Analysis for C Programs.
The International Conference on Parallel Processing, 2015.
Automatic Parallelization of Tiled Loop Nests with Enhanced Fine-Grained Parallelism on GPUs.
The International Conference on Parallel Processing, 2012.
An Efficient Parallel Implementation for Three-dimensional Incompressible Pipe Flow based on SIMPLE.
The IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, 2012.
Parallelizing SOR for GPGPUs Using Alternate Loop Tiling.
Parallel Computing, 2012.
Model-Driven Tile Size Selection for DOACROSS Loops on GPUs.
The International European Conference on Parallel and Distributed Computing, 2011.
Toward Harnessing DOACROSS Parallelism for Multi-GPGPUs.
The International Conference on Parallel Processing, 2010.
