Yue Yu (余跃)

I am currently a Professor at Pengcheng Laboratory (PCL). Prior to joining PCL, I was with the Trustie Group at the National University of Defense Technology (NUDT) from 2011 to 2024. My research interests lie at the intersection of Software Engineering, Distributed & Cloud Computing, and Artificial Intelligence. Currently, my work focuses on Distributed AI Systems and Computing Power Networks, with an emphasis on efficient training/inference of foundation models, resource scheduling for heterogeneous computing, and intelligent software infrastructure.

Prospective Students and Collaborators: I am actively recruiting PhD students, postdoctoral researchers, and young faculty members. If you are interested in building next-generation AI systems or intelligent computing infrastructure, please feel free to contact me.

My main research interest lies in the intersection of software engineering, distributed & cloud computing, and artificial intelligence.

Contact Information

Pengcheng Laboratory
Shenzhen, Guangdong, P.R.C
Email: yuy <at> pcl <dot> ac <dot> cn

Education

DECAL Lab, Computer Science Department, University of California, Davis, USA.
2014.10--2015.10 Visiting Ph.D. student. Advisor: Prem Devanbu and Vladimir Filkov
National Laboratory for Parallel and Distributed Processing (PDL), National University of Defense Technology (NUDT), China.
2013.3--2016.6 Ph.D. in Software Engineering. Advisor: Huaimin Wang
Computer School, National University of Defense Technology, China.
2011.9--2013.3 M.S. in Computer Science. Advisor: Huaimin Wang
Computer School, Wuhan University (WHU), China. (Top 2, Postgraduate Recommendation)
2007.9--2011.6 B.E. in Information Security
School of Journalism and Communication, Wuhan University, China
2009.03-2010.06 Minor in Journalism and Communication.

Community Service

Reviewer:

Journal: IEEE Transactions on Software Engineering (TSE), ACM Transactions on Software Engineering and Methodology (TOSEM), IEEE Transactions on Reliability (TR), ACM Transactions on Internet Technology (TOIT), Empirical Software Engineering (ESE), Information and Software Technology (IST), Journal of Systems and Software (JSS), Science China Information Sciences, Journal of Software
Program/Organizing Committee Member: FSE 2024, SANER 2023, FSE 2021, MSR 2017, ICA3PP 2017, ATC 2018

Projects

Principal Investigator:

OSS Service Environment for the New Generation of Artificial Intelligence, National Grand Research and Development Plan (2020AAA0103504), 2020.07-2023.06.
Research on Theories and Mechanisms of Crowd-based Development for Open Source Ecosystem, National Natural Science Foundation of China (61702534), 2018.01-2020.12.
Mining Social Coding Repository and Network, Postgraduate Research and Innovation Project of Hunan Province (CX2013B032), 2013.03-2015.03.

Collaborator:

Intelligent Software Development Environment for Crowd Collaboration, National Grand Research and Development Plan (2016YFB1000805), 2016.07-2019.06.
Research on Software Situation Analysis based on Knowlege Mapping of Open Source Ecosystem, National Natural Science Foundation of China (61502512), 2016.01-2018.12.
Research on Software Requirement Elicitation and Modeling based on Crowd Collaboration in Network Environment, National Natural Science Foundation of China (61432020), 2015.01-2019.12.
Research and Application of Software Information Network Mining for Crowd Production, National Natural Science Foundation of China (61472430), 2015.01-2018.12.

Publications

2026

Yapeng Jiang, Minghao Gan, Zicong Hong, Wuhui Chen, Junyuan Liang, Yue Yu (corresponding author), Meng Guo, Zibin Zheng. Kairox: Adaptive GPU-CPU Hybrid LLM Inference via Online Neuron Balancing. The 20th USENIX Symposium on Operating Systems Design and Implementation (OSDI 2026), Seattle, WA, USA. (CCF-A) [pdf]
Bo Lv, Jingbo Sun, Jianwei Lv, Chen Tang, shaojie zhang, Nayu Liu, Guoxin Yu, Zihao Li, Qichao Zhang, Dongbin Zhao, Ping Luo, Yue Yu (corresponding author). Beyond Query Memorization: Large Language Model Routing with Query Decomposition and Historical Matching. The 63nd Annual Meeting of the Association for Computational Linguistics (ACL 2026), San Diego, California, USA. (CCF-A) [pdf]
Qi Wang, Hanyang Peng, Yue Yu (corresponding author). Symphony-MoE: Harmonizing Disparate Pre-trained Models into a Coherent Mixture-of-Experts. The 40th Annual AAAI Conference on Artificial Intelligence (AAAI 2026), Singapore. (CCF-A) [pdf]
Tanghaoran Zhang, Xinjun Mao, Shangwen Wang, Yuxin Zhao, Yao Lu, Zezhou Tang, Wenyu Xu, Longfei Sun, Changrong Xie, Kang Yang, Yue Yu (corresponding author). Coding in a Bubble? Evaluating LLMs in Resolving Context Adaptation Bugs During Code Adaptation. the ACM International Conference on the Foundations of Software Engineering (FSE 2026), Montreal, Canada. (CCF-A) [pdf]
Han Zhang, RuibinZheng, ZeXuan Yi, Zhuo Zhang, Hanyang Peng, Hui Wang, Jiayin Qi, Binxing Fang, Ruifeng Xu, Yue Yu (corresponding author). GEPO: Group Expectation Policy Optimization for Stable Heterogeneous Reinforcement Learning. The 14th International Conference on Learning Representations (ICLR 2026), Brazil. [pdf]
Yong Chu, Xun Zhou, Zenglin Xu, Hui Wang, Yue Yu. Map as a Prompt: Learning Multi-Modal Spatial-Signal Foundation Models for Cross-scenario Wireless Localization. The 14th International Conference on Learning Representations (ICLR 2026), Brazil. [pdf]
Dexia Chen, Qianjie Zhu, Weibing Li, Yue Yu, Tong Zhang, Ruixuan Wang. Preserve and Sculpt: Manifold-Aligned Fine-tuning of Vision-Language Models for Few-Shot Learning. The 14th International Conference on Learning Representations (ICLR 2026), Brazil. [pdf]
Jinglong Luo, Zhuo Zhang, Yehong Zhang, Shiyu Liu, Ye Dong, Hui Wang, Yue Yu, Xun Zhou, Zenglin Xu. SecP-Tuning: Efficient Privacy-Preserving Prompt Tuning for Large Language Models via MPC. The 14th International Conference on Learning Representations (ICLR 2026), Brazil. [pdf]
Chunyan Liu, Yan Lei, Huan Xie, Jinping Wang, Yue Yu, David Lo. Survey on Learning-based Dynamic Fault Localization: From Traditional Machine Learning to Large Language Models. ACM Computing Surveys (CSUR), 2026. (JCR-1) [pdf]
Zhihao Zhang, Lu Tang, Huiba Li, Yue Yu, Jiwu Shu, Yiming Zhang. ParaSync: Exploiting Fine-Grained Parallelism for Efficient File Synchronization. The 24th USENIX Conference on File and Storage Technologies (FAST 2026), Santa Clara, USA. (CCF-A) [pdf]
Zhiyuan Fang, Xingfan Yu, Yuegui Huang, Zicong Hong, Yufeng Lyu, Wuhui Chen, Yue Yu, Fan Yu. Fate: Fast Edge Inference of Mixture-of-Experts Models via Cross-Layer Gate. THE ACM Web Conference (WWW 2026), Dubai, United Arab Emirates. (CCF-A) [pdf]
Cai Ke, Bin Liang, Xin Liu, Yue Yu, Hui Wang, Ruifeng Xu. Dynamic Memory Forest: Constructing and Tracing Conversational Trajectories for Long-Term Conversation. The 49th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2026), Melbourne, Australia. [pdf]

2025

Zhiyuan Fang, Yuegui Huang, Zicong Hong, Yufeng Lyu, Wuhui Chen, Yue Yu (corresponding author), Fan Yu, Zibin Zheng. Klotski: Efficient Mixture-of-Expert Inference via Expert-Aware Multi-Batch Pipeline. The ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2025), Rotterdam, Netherlands. (CCF-A) [pdf]
Bo Lv, Nayu Liu, Chen Tang, Xin Liu, Yue Yu (corresponding author), Ping Luo. SpecEM: Training-Free LLM Ensembling via Iterative Drafting, Verification, and Online Feedback. The Thirty-ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025), San Diego, USA. (CCF-A) [pdf]
Tanghaoran Zhang, Yue Yu (corresponding author), Xinjun Mao, Shangwen Wang, Kang Yang, Yao Lu, Zhang Zhang and Yuxin Zhao. Instruct or Interact? Exploring and Eliciting LLMs’ Capability in Code Snippet Adaptation Through Prompt Engineering. The 47th International Conference on Software Engineering (ICSE 2025), Ottawa, Ontario, Canada. (CCF-A) [pdf]
Tanghaoran Zhang, Xinjun Mao, Shangwen Wang, Yuxin Zhao, Yao Lu, Jin Zhang, Zhang Zhang, Kang Yang and Yue Yu (corresponding author). AdaptEval: A Benchmark for Evaluating Large Language Models on Code Snippet Adaptation. The 40th IEEE/ACM International Conference on Automated Software Engineering (ASE 2025), Seoul, Korea. (CCF-A) [pdf]
Junming Qiu, Rongzhen Ye, Weilin Luo, Kunxun Qi, Hai Wan*, Yue Yu (corresponding author). OBDD-NET: End-to-End Learning of Ordered Binary Decision Diagrams. The 34th ACM International Conference on Information and Knowledge Management (CIKM 2025), Seoul, Korea. (CCF-B) [pdf]
Y. Huang, Y. Jiang, Z. Hong, W. Chen, B. Wang, W. Zhu, Yue Yu, Zibin Zheng. Obscura: Concealing Recomputation Overhead in Training of Large Language Models with Bubble-filling Pipeline Transformation. 2025 USENIX Annual Technical Conference (ATC 2025), BOSTON, USA. (CCF-A) [pdf]
Han Zhang, Zhuo Zhang, Yi Zhang, Yuanzhao Zhai, Hanyang Peng, Yu Lei, Yue Yu, Hui Wang, Bin Liang, Lin Gui, Ruifeng Xu. Correcting Large Language Model Behavior via Influence Function. The 39th Annual AAAI Conference on Artificial Intelligence (AAAI 2025), Pennsylvania, USA. (CCF-A) [pdf]
Bin Liang, Shiwei Chen, Lin Gui, Hui Wang, Yue Yu, Ruifeng Xu, Kam-Fai Wong. Centrality-guided Pre-training for Graph. The 13th International Conference on Learning Representations (ICLR 2025), Singapore. [pdf]
Yuanzhao Zhai, Zhuo Zhang, Cheng Yang, Kele Xu, Yue Yu, Wei Li, Hui Wang, Zenglin Xu, Dawei Feng, Bo Ding, Huaimin Wang. Preference-Strength-Aware Self-Improving Alignment with Generative Preference Models. The 48th edition of the ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2025), Padua, Italy. [pdf]
Han Zhang, Lin Gui, Yu Lei, Yuanzhao Zhai, Yehong Zhang, Zhuo Zhang, Yulan He, Hui Wang, Yue Yu, Kam-Fai Wong, Bin Liang, Ruifeng Xu. COPR: Continual Human Preference Learning via Optimal Policy Regularization. The 63nd Annual Meeting of the Association for Computational Linguistics (ACL 2025), Vienna, Austria. (CCF-A) [pdf]
Bo Lv, Nayu Liu, Yang Shen, Xin Liu, Ping Luo, Yue Yu. Whether LLMs Know If They Know: Identifying Knowledge Boundaries via Debiased Historical In-Context Learning. The 63nd Annual Meeting of the Association for Computational Linguistics (ACL 2025), Vienna, Austria. (CCF-A) [pdf]
Jinglong Luo, Guanzhong Chen, Yehong Zhang, SHIYU LIU, Hui Wang, Yue Yu, Xun Zhou, Yuan Qi, Zenglin Xu. CENTAUR: Bridging the Impossible Trinity of Privacy, Efficiency, and Performance in Privacy-Preserving Transformer Inference. The 63nd Annual Meeting of the Association for Computational Linguistics (ACL 2025), Vienna, Austria. (CCF-A) [pdf]
Yang Shen, Tao Wang, Xunhui Zhang, Yang Zhang, Cheng Yang, Yue Yu, Huaimin Wang. Are External Contributions Important to Project Productivity in Open Source Software? A Deep Insight based on Issue Entropy. The 28th ACM SIGCHI Conference on Computer-Supported Cooperative Work and Social Computing (CSCW 2025), Bergen, Norway. (CCF-A) [pdf]
Qiwen Ke, Yina Lv, Zhihao Zhang, Zhirong Shen, Yue Yu, Hailiang Chen, Zhenglong Song, Xinbiao Gan, Jiaxin Li, Dongsheng Li, Xin Yao, Meiling Wang, Yiming Zhang. CrossFS: Improving Cross-Domain File System Performance with CRDT-Based Metadata Synchronization. ACM Transactions on Storage (TOS), 2025. (CCF-A) [pdf]
Shuyue Zhou, Xinbing Hu, Ronglong Wu, Jiahua Lu, Zhirong Shen, Zikang Xu, Yue Yu, Yuze Jiang, Jiwu Shu, Kunling Yang, Feilong Lin, Yiming Zhang. Looking Back to Move Forward: Unveiling the Mysteries of HBM Errors to Predict Future Failures. ACM Transactions on Storage (TOS), 2025. (CCF-A) [pdf]
Xiaohan Bi, Binhang Qi, Hailong Sun, Xiang Gao, Yue Yu, Xiaojun Liang. NeMo: A Neuron-Level Modularizing-While-Training Approach for Decomposing DNN Models. ACM Transactions on Software Engineering and Methodology (TOSEM), 2025. (CCF-A) [pdf]
Hanbo Bi, Yingchao Feng, Boyuan Tong, Mengyu Wang, Haichen Yu, Yongqiang Mao, Hao Chang, Wenhui Diao, Peijin Wang, Yue Yu, Hanyang Peng, Yehong Zhang, Kun Fu, Xian Sun. RingMoE: Mixture-of-modality-experts multi-modal foundation models for universal remote sensing image interpretation. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025. (CCF-A) [pdf]

2024

Yingwei Ma, Yue Liu, Yue Yu (corresponding author), Yuanliang Zhang, Yu Jiang, Changjian Wang, Shanshan Li. At Which Training Stage Does Code Data Help LLMs Reasoning?. The 12th International Conference on Learning Representations (ICLR 2024), Vienna, Austria. [pdf]
Xin Mu, Yu Wang, Zhengan Huang, Junzuo Lai, Yehong Zhang, Hui Wang, Yue Yu (corresponding author). EncryIP: A Practical Encryption-Based Framework for Model Intellectual Property Protection. The 38th AAAI Conference on Artificial Intelligence (AAAI 2024), Vancouver, Canada. (CCF-A) [pdf]
Guoxin Yu, Lemao Liu, Mo Yu, Yue Yu (corresponding author), Xiang Ao. Rethinking the Evaluation of In-Context Learning for LLMs. The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024), Miami, USA. (CCF-A) [pdf]
Zicong Hong, Jian Lin, Song Guo, Sifu Luo, Wuhui Chen, Roger Wattenhofer, Yue Yu. Optimus: Warming Serverless ML Inference via Inter-Function Model Transformation. 19th European Conference on Computer Systems (EuroSys 2024), Athens, Greece. (CCF-A) [pdf]
Haoran Liu, Zhouyang Jia, Shanshan Li, Yan Lei, Yue Yu, Yu Jiang, Xiaoguang Mao, Liao Xiangke. Cut to the Chase: An Error-Oriented Approach to Detect Error-Handling Bugs. The ACM International Conference on the Foundations of Software Engineering (FSE 2024), Porto de Galinhas, Brazil. (CCF-A) [pdf]
Tanghaoran Zhang, Yao Lu, Yue Yu, Xinjun Mao, Yang Zhang, and Yuxin Zhao. How do Developers Adapt Code Snippets to Their Contexts? An Empirical Study of Context-Based Code Snippet Adaptations. IEEE Transactions on Software Engineering (TSE), 2024. (SCI, CCF-A) [pdf]
Huan Xie, Yan Lei, Meng Yan, Shanshan Li, Xiaoguang Mao, Yue Yu, David Lo. Towards More Precise Coincidental Correctness Detection with Deep Semantic Learning. IEEE Transactions on Software Engineering (TSE), 2024. (SCI, CCF-A) [pdf]
Jiahang Zhou, Yanyu Chen, Zicong Hong, Wuhui Chen,Yue Yu, Tao Zhang, Hui Wang, Chuanfu Zhang, Zibin Zheng. Training and Serving System of Foundation Models: A Comprehensive Survey. IEEE Open Journal of the Computer Society, 2024. (SCI) [pdf]
Xueyang Tang, Song Guo, Jingcai Guo, Jie Zhang, Yue Yu. Causally Motivated Personalized Federated Invariant Learning with Shortcut-Averse Information-Theoretic Regularization. 41st International Conference on Machine Learning (ICML 2024), Vienna, Austria. (CCF-A) [pdf]
Jiewei Zhang, Song Guo, Peiran Dong, Jie Zhang, Ziming Liu, Yue Yu, Xiaoming Wu. Easing Concept Bleeding in Diffusion via Entity Localization and Anchoring. 41st International Conference on Machine Learning (ICML 2024), Vienna, Austria. (CCF-A) [pdf]
Bo Lv, Chen Tang, Yanan Zhang, Xin Liu, Ping Luo, Yue Yu. URG: A Unified Ranking and Generation Method for Ensembling Language Models. The 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024), Bangkok, Thailand. (CCF-A) [pdf]
Jinglong Luo, Yehong Zhang, Zhuo Zhang, Jiaqi Zhang, Xin Mu, Hui Wang, Yue Yu, Zenglin Xu. SecFormer: Fast and Accurate Privacy-Preserving Inference for Transformer Models via SMPC. The 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024), Bangkok, Thailand. (CCF-A) [pdf]
Xiaojie Li, Yibo Yang, Xiangtai Li, Jianlong Wu, Yue Yu, Bernard Ghanem, Min Zhang. GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning. The 18th uropean Conference on Computer Vision (ECCV 2024), Milan, Italy. (CCF-A) [pdf]

2023

Shujiong Tang, Yue Yu (corresponding author), Hui Wang, Guiliang Wang, Wuhui Chen, Zenglin Xu, Song Guo, Wen Gao. A Survey on Scheduling Techniques in Computing and Network Convergence. IEEE Communications Surveys and Tutorials (COMST), 2023. (SCI, JCR-Q1) [pdf]
Hanyang Peng, Shuang Qin, Yue Yu (corresponding author), Jin Wang, Hui Wang, Ge Li. Birder: Communication-Efficient 1-bit Adaptive Optimizer for Practical Distributed DNN Training. The 37th Conference on Neural Information Processing Systems (NeurIPS 2023), New Orleans, USA. (CCF-A) [pdf]
Zicong Hong, Xiaoyu Qiu, Jian Lin, Wuhui Chen, Yue Yu (corresponding author), Hui Wang, Song Guo, Wen Gao. Intelligence-Endogenous Management Platform for Computing and Network Convergence. IEEE Network, 2023. (SCI, JCR-Q1) [pdf]
Ying Wang, Peng Sun, Lin Pei, Yue Yu (corresponding author), Chang Xu, Shing-Chi Cheung, Hai Yu, Zhiliang Zhu. PLUMBER: Boosting the Propagation of Vulnerability Fixes in the npm Ecosystem. IEEE Transactions on Software Engineering (TSE), 2023. (SCI, CCF-A) [pdf][Plumber]
Shuzheng Gao, Cuiyun Gao, Chaozheng Wang, Jun Sun, David Lo, Yue Yu. Two Sides of the Same Coin: Exploiting the Impact of Identifiers in Neural Code Comprehension. The 45th International Conference on Software Engineering (ICSE 2023), Melbourne, Australia. (CCF-A) [pdf]
Teng Wang, Zhouyang Jia, Shanshan Li, Si Zheng, Yue Yu, Erci Xu, Shaoliang Peng, Xiangke Liao. Understanding and Detecting On-the-Fly Configuration Bugs. The 45th International Conference on Software Engineering (ICSE 2023), Melbourne, Australia. (CCF-A, Distinguished Paper Award) [pdf]
Yingwei Ma, Yue Yu (corresponding author), Shanshan Li, Zhouyang Jia, Jun Ma, Rulin Xu, Wei Dong, Xiangke Liao. MulCS: Towards a Unified Deep Representation for Multilingual Code Search. The 30th IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER 2023), Macao, China. (CCF-B, Distinguished Paper Award) [pdf]
Bo Lv, Xin Liu, Shaojie Dai, Nayu Liu, Fan Yang, Ping Luo and Yue Yu (corresponding author). DSP: Discriminative Soft Prompts for Zero-Shot Entity and Relation Extraction. The 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), Toronto, Canada. (CCF-A) [pdf]
Zhuo Zhang, Yuanhang Yang, Yong Dai, Qifan Wang, Yue Yu, Lizhen Qu and Zenglin Xu. FedPETuning: When Federated Learning Meets the Parameter-Efficient Tuning Methods of Pre-trained Language Models. The 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), Toronto, Canada. (CCF-A) [pdf]
Yibo Wang, Ying Wang, Tingwei Zhang, Yue Yu, Shing-Chi Cheung, Hai Yu, Zhiliang Zhu. Can Machine Learning Pipelines Be Better Configured?. The ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2023), San Francisco, USA. (CCF-A, Distinguished Paper Award) [pdf]
Haochen He, Erci Xu, Shanshan Li, Zhouyang Jia, Si Zheng, Yue Yu, Jun Ma, Xiangke Liao. When Database Meets New Storage Devices: Understanding and Exposing Performance Mismatches via Configurations. The 49th International Conference on Very Large Data Bases (VLDB 2023), Vancouver, Canada. (CCF-A) [pdf]
Yupeng Yin, Xianglong Zhang, Huanle Zhang, Feng Li, Yue Yu, Xiuzhen Cheng, Pengfei Hu. Ginver: Generative Model Inversion Attacks Against Collaborative Inference. ACM Web Conference (WWW 2023), Austin, USA. (CCF-A) [pdf]
Jinglong Luo, Yehong Zhang, Jiaqi Zhang, Shuang Qin,Yue Yu, Hui Wang, Zenglin Xu. Practical Privacy-Preserving Gaussian Process Regression via Secret Sharing. The 39th Conference on Uncertainty in Artificial Intelligence (UAI 2023), Pittsburgh, USA. (CCF-B) [pdf]
Xiaoqi Wang, Yingjie Cheng, Yaning Yang, Yue Yu, Fei Li, Shaoliang Peng. Multitask joint strategies of self-supervised representation learning on biomedical networks for drug discovery. Nature Machine Intelligence, 2023. [pdf][Supplementary Materials]
Hanyang Peng, Yue Yu, Shiqi Yu. Re-Thinking the Effectiveness of Batch Normalization and Beyond. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023. (SCI, CCF-A) [pdf]
Xiaojie Li, Jianlong Wu, Shaowei He, Kang Shuo, Yue Yu, Liqiang Nie, Min Zhang. Fine-grained Key-Value Memory Enhanced Predictor for Video Representation Learning. The 31st ACM International Conference on Multimedia (ACM MM 2023), Ottawa, Canada. (CCF-A) [pdf]
Xiaojie Li, Shaowei He, Jianlong Wu, Yue Yu, Liqiang Nie, Min Zhang. Mask Again: Masked Knowledge Distillation for Masked Video Modeling. The 31st ACM International Conference on Multimedia (ACM MM 2023), Ottawa, Canada. (CCF-A) [pdf]
Jiaying Li, Yan Lei, Shanshan Li, Haifang Zhou, Yue Yu, Zhouyang Jia, Yingwei Ma, Teng Wang. A Two-Stage Framework for Ambiguous Classification in Software Engineering. The 34th IEEE International Symposium on Software Reliability Engineering (ISSRE 2023), Florence, Italy. (CCF-B) [pdf]

2022

Zhixing Li, Yue Yu (corresponding author), Tao Wang, Lei Yan, Ying Wang, and Huaimin Wang. To Follow or Not to Follow: Understanding Issue/Pull-Request Templates on GitHub. IEEE Transactions on Software Engineering (TSE), 2022. (SCI, CCF-A) [pdf]
Tanghaoran Zhang, Yue Yu (corresponding author), Xinjun Mao, Yao Lu, Zhixing Li, Huaimin Wang. FENSE: A Feature-Based Ensemble Modeling Approach to Cross-Project Just-in-Time Defect Prediction. Empirical Software Engineering (EMSE), 2022. (CCF-B, JCR-1) [pdf]
Chen Zeng, Yue Yu (corresponding author), Shanshan Li, Xin Xia, Zhiming Wang, Mingyang Geng, Linxiao Bai, Wei Dong, and Xiangke Liao. deGraphCS: Embedding Variable-based Flow Graph for Neural Code Search. ACM Transactions on Software Engineering and Methodology (TOSEM), 2022. (SCI, CCF-A) [pdf]
Xunhui Zhang, Yue Yu (corresponding author), Georgios Gousios, Ayushi Rastogi. Pull Request Decisions Explained: An Empirical Overview. IEEE Transactions on Software Engineering (TSE), 2022. (SCI, CCF-A) [pdf]
Xunhui Zhang, Yue Yu (corresponding author), Tao Wang, Ayushi Rastogi, Huaimin Wang. Pull Request Latency Explained: An Empirical Overview. Empirical Software Engineering (EMSE), 2022. (CCF-B, JCR-1) [pdf][ESEC/FSE 2022 Journal First]
Zhixing Li, Yue Yu (corresponding author), Tao Wang, Shanshan Li, Huaiming Wang. Opportunities and Challenges in Repeated Revisions to Pull-Requests: An Empirical Study. The 25th ACM Conference On Computer-Supported Cooperative Work And Social Computing (CSCW 2022), Taipei, China. (CCF-A) [pdf]
Xunhui Zhang, Tao Wang, Yue Yu (corresponding author), Qiubing Zeng, Zhixing Li, Huaiming Wang. Who, What, Why and How? Towards the Monetary Incentive in Crowd Collaboration: A Case Study of Github's Sponsor Mechanism. The ACM CHI Conference on Human Factors in Computing Systems (CHI 2022), New Orleans, USA. (CCF-A) [pdf]
Haochen He, Zhouyang Jia, Shanshan Li, Yue Yu (corresponding author), Chenglong Zhou, Qing Liao, Ji Wang, Xiangke Liao. Multi-Intention-Aware Configuration Selection for Performance Tuning. The 44th International Conference on Software Engineering (ICSE 2022), Pittsburgh, USA. (CCF-A) [pdf]
Deze Wang, Zhouyang Jia, Shanshan Li, Yue Yu, Yun Xiong, Wei Dong, Xiangke Liao. Bridging Pre-trained Models and Downstream Tasks for Source Code Understanding. The 44th International Conference on Software Engineering (ICSE 2022), Pittsburgh, USA. (CCF-A) [pdf]
Haibin Zheng, Zhiqing Chen, Tianyu Du, Xuhong Zhang, Yao Cheng, Shouling Ji, Jingyi Wang, Yue Yu, Jinyin Chen. NeuronFair: Interpretable White-Box Fairness Testing through Biased Neuron Identification. The 44th International Conference on Software Engineering (ICSE 2022), Pittsburgh, USA. (CCF-A) [pdf]
Huan Xie, Yan Lei, Meng Yan, Yue Yu, Xin Xia, Xiaoguang Mao. A Universal Data Augmentation Approach for Fault Localization. The 44th International Conference on Software Engineering (ICSE 2022), Pittsburgh, USA. (CCF-A) [pdf]
Zhuo Zhang, Yan Lei, Meng Yan, Yue Yu, Jiachi Chen, Shangwen Wang, Xiaoguang Mao. Reentrancy Vulnerability Detection and Localization: A Deep Learning Based Two-phase Approach. The 37th IEEE/ACM International Conference on Automated Software Engineering (ASE 2022), Michigan, USA. (CCF-A) [pdf]
Shuyao Jiang, Jiacheng Shen, Shengnan Wu, Yu Cai, Yue Yu, Yangfan Zhou. Towards Usable Neural Comment Generation via Code-comment Linkage Interpretation: Method and Empirical Study. IEEE Transactions on Software Engineering (TSE), 2022. (SCI, CCF-A) [pdf]
Zhuo Zhang, Yan Lei, Ting Su, Meng Yan, Xiaoguang Mao, Yue Yu. Influential Global and Local Contexts Guided Trace Representation for Fault Localization. ACM Transactions on Software Engineering and Methodology (TOSEM), 2022. (SCI, CCF-A) [pdf]

2021

Zhixing Li, Yue Yu (corresponding author), Tao Wang, Gang Yin, Shanshan Li, and Huaimin Wang. Are You Still Working on This? An Empirical Study on Pull Request Abandonment. IEEE Transactions on Software Engineering (TSE), 2021. (SCI, CCF-A) [pdf][data][ICSE 2022 Journal First]
Deze Wang, Yue Yu (corresponding author), Shanshan Li, Wei Dong, Ji Wang, Qing liao. MulCode: A Multi-task Learning Approach for Source Code Understanding. The 28th IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER 2021). (CCF-B) [pdf]
Haoran Liu, Yue Yu (corresponding author), Shanshan Li, Mingyang Gen, Xiaoguang Mao, Xiangke Liao. How to Cherry Pick the Bug Report for Better Summarization?. Empirical Software Engineering (EMSE), 2021. (CCF-B, JCR-1) [pdf]
Yuqing Ma, Shihao Bai, Wei Liu, Shuo Wang, Yue Yu, Xiao Bai, Xianglong Liu, Meng Wang. Transductive Relation-Propagation with Decoupling Training for Few-Shot Learning. IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021. (CCF-B) [pdf]
Zuohui Chen, Renxuan Wang, Jingyang Xiang, Yue Yu, Xin Xia, Shouling Ji, Qi Xuan, Xiaoniu Yang. Detecting Adversarial Samples with Graph-Guided Testing. The 36th IEEE/ACM International Conference on Automated Software Engineering (ASE), 2021. (CCF-A, Late Breaking Results Track) [pdf]
Yu Zhang, Yue Yu (corresponding author), Tao Wang, Zhixing Li and Xiaochuan Wang. Dual Channel Among Task and Contribution on OSS Communities: An Empirical Study. International Journal of Software Engineering and Knowledge Engineering (IJSEKE), 2021. (SCI, CCF-C) [pdf]
PanGu-a: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation. arXiv preprint arXiv:2104.12369 (2021). [pdf]

2020

Zhixing Li, Yue Yu (corresponding author), Minghui Zhou, Tao Wang, Gang Yin, Long Lan, and Huaimin Wang. Redundancy, Context, and Preference: An Empirical Study of Duplicate Pull Requests in OSS Projects. IEEE Transactions on Software Engineering (TSE), 2020. (SCI, CCF-A) [pdf][data]
Haoran Liu, Yue Yu (corresponding author), Shanshan Li, Yong Guo, Deze Wang, Xiaoguang Mao. BugSum: Deep Context Understanding for Bug Report Summarization. The 28th IEEE/ACM International Conference on Program Comprehension (ICPC 2020), Seoul, South Korea. (CCF-B) [pdf]
Xunhui Zhang, Ayushi Rastogi, Yue Yu (corresponding author). On the Shoulders of Giants: A New Dataset for Pull-based Development Research. The 17th International Conference on Mining Software Repositories (MSR 2020), Seoul, South Korea. (CCF-C, CORE-A) [pdf][data]
Jinyin Chen, Keke Hu, Yue Yu, Zhuangzhi Chen, Qi Xuan, Yi Liu, Vladimir Filkov. Software Visualization and Deep Transfer Learning for Effective Software Defect Prediction. The 42th International Conference on Software Engineering (ICSE 2020), Seoul, South Korea. (CCF-A) [pdf]
Haochen He, Zhouyang Jia, Shanshan Li, Erci Xu, Tingting Yu, Yue Yu, Ji Wang, Xiangke Liao. CP-Detector: Using Configuration-related Performance Properties to Expose Performance Bugs. The 35th IEEE/ACM International Conference on Automated Software Engineering (ASE 2020), Melbourne, Australia. (CCF-A) [pdf]
Weijiang Feng, Long Lan, Yong Luo, Yue Yu, Xiang Zhang, Zhigang Luo. Near-Online Multi-pedestrian Tracking via Combining Multiple Consistent Appearance Cues. IEEE Transactions on Circuits and Systems for Video Technology （TCSVT） (SCI, CCF-B) [pdf]
Tao Wang, Gang Yin, Yue Yu, Yang Zhang, Huaimin Wang. Crowd-intelligence-based software development method and practices. Science China Information Sciences, Springer, 2020. (SCI, CCF-B, in Chinese) [pdf]

2019

Zhixing Li, Yue Yu (corresponding author), Tao Wang, Gang Yin, Xinjun Mao, Huaimin Wang. Detecting Duplicate Contributions in Pull-based Model Combining Textual and Change Similarities. Journal of Computer Science and Technology (JCST), Springer, 2019. (SCI, CCF-B) [pdf]
Fan Qiang, Yue Yu (corresponding author), Tao Wang, Gang Yin, Huaimin Wang. Why API Documentation is Insufficient for Developers: an Empirical Study. Science China Information Sciences, Springer, 2019. (SCI, CCF-B, Letter) [pdf]
Zhixing Li, Yue Yu (corresponding author), Tao Wang, Gang Yin, Xinjun Mao, Huaimin Wang. HAF: A Hybrid Annotation Framework Based on Expert Knowledge and Learning Technique. Science China Information Sciences, Springer, 2019. (SCI, CCF-B, Letter) [pdf]
Dongyang Hu, Yang Zhang, Junsheng Chang, Gang Yin, Yue Yu, Tao Wang. Multi-reviewing pull-requests: An exploratory study on GitHub OSS projects. Information and Software Technology (IST), Elsevier, 2019. (SCI, CCF-B) [pdf]
Haotian Wang, Wenjing Yang, Zhipeng Lin, Yue Yu. TMDA: Task-Specific Multi-Source Domain Adaptation via Clustering Embedded Adversarial Training. IEEE International Conference on Data Mining (ICDM 2019), Beijing, China, 2019. (CCF-B) [pdf]

2018

Yue Yu*, Zhixing Li*, Gang Yin, Tao Wang, Huaimin Wang. A Dataset of Duplicate Pull-requests in GitHub. The 15th International Conference on Mining Software Repositories (MSR 2018), Gothenburg, Sweden, 2018. (CCF-C, CORE-A) [pdf]
Yang Zhang, Yue Yu, Huaimin Wang, Bogdan Vasilescu and Vladimir Filkov. Within-Ecosystem Issue Linking: A Large-scale Study of Rails. The 7th International Workshop on Mining Software Repositories (SoftwareMining@ASE 2018), France. 2018. [pdf]
Shangwen Wang, Tao Wang, Xiaoguang Mao, Gang Yin, Yue Yu. A Hybrid Approach for Tag Hierarchy Construction. The 17th International Conference on Software Reuse (ICSR 2018), Madrid, Spain, 2018. [pdf]
Yue Yu, Yarong Zeng, Qiang Fan, Huaimin Wang. Transferring Well-Trained Models for Cross-Project Issue Classification: A Large-Scale Empirical Study. The 10th Asia-Pacific Symposium on Internetware (Internetware 2018), Beijing, China, 2018. [pdf]

2017

Qiang Fan, Yue Yu (corresponding author), Gang Yin, Tao Wang, Huaimin Wang. Where is the Road for Issue Reports Classification Based on Text Mining. Empirical Software Engineering and Measurement (ESEM 2017), Toronto, Canada, 2017. (CCF-B) [pdf]
Zhixing Li, Yue Yu (corresponding author), Gang Yin, Tao Wang, Qiang Fan, Huaimin Wang. Automatic Classification of Review Comments in Pull-based Development Model. The 29th International Conference on Software Engineering and Knowledge Engineering (SEKE), Pittsburgh, USA, 2017. (CCF-C) [pdf]
Zhixing Li, Yue Yu (corresponding author), Gang Yin, Tao Wang, Huaimin Wang. What are they talking about? Analyzing Code Reviews in Pull-based Development Model. Journal of Computer Science and Technology (JCST), Springer, 2017. (SCI, CCF-B) [pdf]
Xunhui Zhang, Tao Wang, Gang Yin, Cheng Yang, Yue Yu, Huaimin Wang. DevRec: A Developer Recommendation System for Open Source Repositories. International Conference on Software Reuse (ICSR), Springer, 2017. (CCF-C) [pdf]
Zhixing Li, Gang Yin, Yue Yu, Tao Wang, Huaimin Wang. Detecting duplicate pull-requests in GitHub. Asia-Pacific Symposium on Internetware, ACM, 2017. [pdf]

2016

Yue Yu, Huaimin Wang, Gang Yin, Tao Wang. Reviewer Recommendation for Pull-Requests in GitHub: What Can We Learn from Code Review and Bug Assignment?. Information and Software Technology (IST), Elsevier, 2016. (SCI, CCF-B) [pdf]
Yue Yu, Gang Yin, Tao Wang, Cheng Yang, Huaimin Wang. Determinants of pull-based development in the context of continuous integration. Science China Information Sciences, Springer, 2016. (SCI, CCF-B) [pdf]
Yang Zhang, Huaimin Wang, Gang Yin, Tao Wang, Yue Yu. Social media in GitHub: the role of @-mention in assisting software development. Science China Information Sciences, Springer, 2016. (SCI, CCF-B) [pdf]
Zhixing Li, Gang Yin, Yang Zhang, Yue Yu, and Huaimin Wang. Correlation-based software search by leveraging software term database. Frontiers of Computer Science (FCS), Springer, 2016. (SCI, CCF-C) [pdf]

2015

Bogdan Vasilescu*, Yue Yu*, Huaimin Wang, Prem Devanbu, Vladimir Filkov. Quality and Productivity Outcomes Relating to Continuous Integration in GitHub. The 10th Joint Meeting of the European Software Engineering Conference and the ACM SIGSOFT Symposium on the Foundations of Software Engineering (ESEC/FSE 2015), Bergamo, Italy, 2015. (*both are first authors and contributed equally to the work, CCF-A) [pdf]
Yue Yu, Huaimin Wang, Vladimir Filkov, Prem Devanbu, Bogdan Vasilescu. Wait For It: Determinants of Pull Request Evaluation Latency on GitHub. The 12th International Conference on Mining Software Repositories (MSR 2015), Florence, Italy, 2015. (CCF-C, CORE-A) [pdf]
Yang Zhang, Huaimin Wang, Gang Yin, Tao Wang, Yue Yu. Exploring the Use of @-mention to Assist Software Development in GitHub. The Seventh Asia-Pacific Symposium on Internetware, Wuhan, China, 2015. [pdf]
Yang Zhang, Huaimin Wang, Gang Yin, Tao Wang, Yue Yu. Evaluating Bug Severity Using Crowd-based Knowledge: An Exploratory Study. The Seventh Asia-Pacific Symposium on Internetware, Wuhan, China, 2015. [pdf]

Awards and Honors

ACM Gordon Bell Prize Finalist for Climate Modelling: Kilometer-Scale AI-Powered and Performance-Portable Earth System Model (AP3ESM) to Achieve Year-Scale Simulation Speed on Heterogeneous Supercomputers. International Conference for High Performance Computing, Networking, Storage and Analysis (SC 2025), St. Louis, USA. [pdf]
ACM Gordon Bell Prize Finalist for Climate Modelling: A Performance-Portable Kilometer-Scale Global Ocean Model on ORISE and New Sunway Heterogeneous Supercomputers. International Conference for High Performance Computing, Networking, Storage and Analysis (SC 2024), Atlanta, USA. [pdf] [Finalist]
CCF NASAC Youth Software Innovation Award, 2024; (NASAC青年软件创新奖)
DAMO Academy Young Science Talent “Most Promising Award”, 2024; (达摩院青橙奖最具潜力奖)
Outstanding Ph.D. Thesis Award of Hunan Province, China, 2018; (湖南省优秀博士学位论文奖)
ACM Changsha Doctoral Dissertation Award, China, 2016; (ACM中国长沙分会优秀博士论文奖)
2013 OW2 International Programming Contest Special Prize (Only One)
2012 OW2 International Programming Contest First Prize (Top 1)
2011 National Information Security Competition Third Prize (17%)
2010 National Information Security Competition First Prize (2.3%)
Awarded First-class Scholarship in WHU 2010-2011 (Top 5%)
Awarded National Scholarship of China 2009-2010 (Top 2)
Honored as Outstanding Graduates of WHU 2007-2011 (20%)