Yue Yu (余跃)

Hi there! I am currently an Associate Professor of Computer Science at National University of Defense Technology. I joined Trustie Group at August 2011, advised by Prof. Huaimin Wang. Fortunately, I have got the great opportunity to visit UC Davis supported by CSC scholarship.

My main research interest is software engineering, spanning from mining software repositories, analyzing social coding networks and software crowdsourcing.


Contact Information

  • National Laboratory for Parallel and Distributed Processing
    College of Computer, National University of Defense Technology
    Changsha, Hunan Province, 410073, P.R.C
    Email: yuyue <at> nudt <dot> edu <dot> cn


Education

  • DECAL Lab, Computer Science Department, University of California, Davis, USA.
    2014.10--2015.10 Visiting Ph.D. student. Advisor: Prem Devanbu and Vladimir Filkov
  •  
  • National Laboratory for Parallel and Distributed Processing (PDL), National University of Defense Technology (NUDT), China.
    2013.3--2016.6 Ph.D. in Software Engineering. Advisor: Huaimin Wang
  •  
  • Computer School, National University of Defense Technology, China.
    2011.9--2013.3 M.S. in Computer Science. Advisor: Huaimin Wang
  •  
  • Computer School, Wuhan University (WHU), China. (Top 2, Postgraduate Recommendation)
    2007.9--2011.6 B.E. in Information Security
  •  
  • School of Journalism and Communication, Wuhan University, China
    2009.03-2010.06 Minor in Journalism and Communication.

Community Service

    Reviewer:
  • Conference: AAAI 2019-2021, IJCAI 2019, CHI 2018, CCCC 2016, ICPC 2015, APSEC 2015, Internetware 2015
  • Journal: IEEE Transactions on Software Engineering (TSE), ACM Transactions on Software Engineering and Methodology (TOSEM), IEEE Transactions on Reliability (TR), ACM Transactions on Internet Technology (TOIT), Empirical Software Engineering (ESE), Information and Software Technology (IST), Journal of Systems and Software (JSS), Science China Information Sciences, Journal of Software
  • Program/Organizing Committee Member: MSR 2017, ICA3PP 2017, ATC 2018


Projects

    Principal Investigator:

  • OSS Service Environment for the New Generation of Artificial Intelligence, National Grand Research and Development Plan (2020AAA0103504), 2020.07-2023.06.
  •  
  • Research on Theories and Mechanisms of Crowd-based Development for Open Source Ecosystem, National Natural Science Foundation of China (61702534), 2018.01-2020.12.
  •  
  • Mining Social Coding Repository and Network, Postgraduate Research and Innovation Project of Hunan Province (CX2013B032), 2013.03-2015.03.
  •  

    Collaborator:

  • Intelligent Software Development Environment for Crowd Collaboration, National Grand Research and Development Plan (2016YFB1000805), 2016.07-2019.06.
  •  
  • Research on Software Situation Analysis based on Knowlege Mapping of Open Source Ecosystem, National Natural Science Foundation of China (61502512), 2016.01-2018.12.
  •  
  • Research on Software Requirement Elicitation and Modeling based on Crowd Collaboration in Network Environment, National Natural Science Foundation of China (61432020), 2015.01-2019.12.
  •  
  • Research and Application of Software Information Network Mining for Crowd Production, National Natural Science Foundation of China (61472430), 2015.01-2018.12.
  •  

Publications

2021

  • Zhixing Li, Yue Yu (corresponding author), Tao Wang, Gang Yin, Shanshan Li, and Huaimin Wang. Are You Still Working on This? An Empirical Study on Pull Request Abandonment. IEEE Transactions on Software Engineering (TSE), 2021. (SCI, CCF-A) [pdf][data]
  • Deze Wang, Yue Yu (corresponding author), Shanshan Li, Wei Dong, Ji Wang, Qing liao. MulCode: A Multi-task Learning Approach for Source Code Understanding. The 28th IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER 2021). (CCF-B) [pdf]
  • Haoran Liu, Yue Yu (corresponding author), Shanshan Li, Mingyang Gen, Xiaoguang Mao, Xiangke Liao. How to Cherry Pick the Bug Report for Better Summarization?. Empirical Software Engineering (EMSE), 2021. (CCF-B) [pdf]
  • Yuqing Ma, Shihao Bai, Wei Liu, Shuo Wang, Yue Yu, Xiao Bai, Xianglong Liu, Meng Wang. Transductive Relation-Propagation with Decoupling Training for Few-Shot Learning. IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021. (CCF-B) [pdf]
  • Zuohui Chen, Renxuan Wang, Jingyang Xiang, Yue Yu, Xin Xia, Shouling Ji, Qi Xuan, Xiaoniu Yang. Detecting Adversarial Samples with Graph-Guided Testing. The 36th IEEE/ACM International Conference on Automated Software Engineering (ASE), 2021. (CCF-A, Late Breaking Results Track)
  • Yu Zhang, Yue Yu (corresponding author), Tao Wang, Zhixing Li and Xiaochuan Wang. Dual Channel Among Task and Contribution on OSS Communities: An Empirical Study. International Journal of Software Engineering and Knowledge Engineering (IJSEKE), 2021. (SCI, CCF-C) [pdf]

2020

  • Zhixing Li, Yue Yu (corresponding author), Minghui Zhou, Tao Wang, Gang Yin, Long Lan, and Huaimin Wang. Redundancy, Context, and Preference: An Empirical Study of Duplicate Pull Requests in OSS Projects. IEEE Transactions on Software Engineering (TSE), 2020. (SCI, CCF-A) [pdf][data]
  • Haoran Liu, Yue Yu (corresponding author), Shanshan Li, Yong Guo, Deze Wang, Xiaoguang Mao. BugSum: Deep Context Understanding for Bug Report Summarization. The 28th IEEE/ACM International Conference on Program Comprehension (ICPC 2020), Seoul, South Korea. (CCF-B) [pdf]
  • Xunhui Zhang, Ayushi Rastogi, Yue Yu (corresponding author). On the Shoulders of Giants: A New Dataset for Pull-based Development Research. The 17th International Conference on Mining Software Repositories (MSR 2020), Seoul, South Korea. (CCF-C, CORE-A) [pdf][data]
  • Jinyin Chen, Keke Hu, Yue Yu, Zhuangzhi Chen, Qi Xuan, Yi Liu, Vladimir Filkov. Software Visualization and Deep Transfer Learning for Effective Software Defect Prediction. The 42th International Conference on Software Engineering (ICSE 2020), Seoul, South Korea. (CCF-A) [pdf]
  • Haochen He, Zhouyang Jia, Shanshan Li, Erci Xu, Tingting Yu, Yue Yu, Ji Wang, Xiangke Liao. CP-Detector: Using Configuration-related Performance Properties to Expose Performance Bugs. The 35th IEEE/ACM International Conference on Automated Software Engineering (ASE 2020), Melbourne, Australia. (CCF-A) [pdf]
  • Weijiang Feng, Long Lan, Yong Luo, Yue Yu, Xiang Zhang, Zhigang Luo. Near-Online Multi-pedestrian Tracking via Combining Multiple Consistent Appearance Cues. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT) (SCI, CCF-B) [pdf]
  • Tao Wang, Gang Yin, Yue Yu, Yang Zhang, Huaimin Wang. Crowd-intelligence-based software development method and practices. Science China Information Sciences, Springer, 2020. (SCI, CCF-B, in Chinese) [pdf]

2019

  • Zhixing Li, Yue Yu (corresponding author), Tao Wang, Gang Yin, Xinjun Mao, Huaimin Wang. Detecting Duplicate Contributions in Pull-based Model Combining Textual and Change Similarities. Journal of Computer Science and Technology (JCST), Springer, 2019. (SCI, CCF-B) [pdf]
  • Fan Qiang, Yue Yu (corresponding author), Tao Wang, Gang Yin, Huaimin Wang. Why API Documentation is Insufficient for Developers: an Empirical Study. Science China Information Sciences, Springer, 2019. (SCI, CCF-B, Letter) [pdf]
  • Zhixing Li, Yue Yu (corresponding author), Tao Wang, Gang Yin, Xinjun Mao, Huaimin Wang. HAF: A Hybrid Annotation Framework Based on Expert Knowledge and Learning Technique. Science China Information Sciences, Springer, 2019. (SCI, CCF-B, Letter) [pdf]
  • Dongyang Hu, Yang Zhang, Junsheng Chang, Gang Yin, Yue Yu, Tao Wang. Multi-reviewing pull-requests: An exploratory study on GitHub OSS projects. Information and Software Technology (IST), Elsevier, 2019. (SCI, CCF-B) [pdf]
  • Haotian Wang, Wenjing Yang, Zhipeng Lin, Yue Yu. TMDA: Task-Specific Multi-Source Domain Adaptation via Clustering Embedded Adversarial Training. IEEE International Conference on Data Mining (ICDM 2019), Beijing, China, 2019. (CCF-B) [pdf]

2018

  • Yue Yu*, Zhixing Li*, Gang Yin, Tao Wang, Huaimin Wang. A Dataset of Duplicate Pull-requests in GitHub. The 15th International Conference on Mining Software Repositories (MSR 2018), Gothenburg, Sweden, 2018. (CCF-C, CORE-A) [pdf]
  • Yang Zhang, Yue Yu, Huaimin Wang, Bogdan Vasilescu and Vladimir Filkov. Within-Ecosystem Issue Linking: A Large-scale Study of Rails. The 7th International Workshop on Mining Software Repositories (SoftwareMining@ASE 2018), France. 2018. [pdf]
  • Shangwen Wang, Tao Wang, Xiaoguang Mao, Gang Yin, Yue Yu. A Hybrid Approach for Tag Hierarchy Construction. The 17th International Conference on Software Reuse (ICSR 2018), Madrid, Spain, 2018. [pdf]
  • Yue Yu, Yarong Zeng, Qiang Fan, Huaimin Wang. Transferring Well-Trained Models for Cross-Project Issue Classification: A Large-Scale Empirical Study. The 10th Asia-Pacific Symposium on Internetware (Internetware 2018), Beijing, China, 2018. [pdf]

2017

  • Qiang Fan, Yue Yu (corresponding author), Gang Yin, Tao Wang, Huaimin Wang. Where is the Road for Issue Reports Classification Based on Text Mining. Empirical Software Engineering and Measurement (ESEM 2017), Toronto, Canada, 2017. (CCF-B) [pdf]
  • Zhixing Li, Yue Yu (corresponding author), Gang Yin, Tao Wang, Qiang Fan, Huaimin Wang. Automatic Classification of Review Comments in Pull-based Development Model. The 29th International Conference on Software Engineering and Knowledge Engineering (SEKE), Pittsburgh, USA, 2017. (CCF-C) [pdf]
  • Zhixing Li, Yue Yu (corresponding author), Gang Yin, Tao Wang, Huaimin Wang. What are they talking about? Analyzing Code Reviews in Pull-based Development Model. Journal of Computer Science and Technology (JCST), Springer, 2017. (SCI, CCF-B) [pdf]
  • Xunhui Zhang, Tao Wang, Gang Yin, Cheng Yang, Yue Yu, Huaimin Wang. DevRec: A Developer Recommendation System for Open Source Repositories. International Conference on Software Reuse (ICSR), Springer, 2017. (CCF-C) [pdf]
  • Zhixing Li, Gang Yin, Yue Yu, Tao Wang, Huaimin Wang. Detecting duplicate pull-requests in GitHub. Asia-Pacific Symposium on Internetware, ACM, 2017. [pdf]

2016

  • Yue Yu, Huaimin Wang, Gang Yin, Tao Wang. Reviewer Recommendation for Pull-Requests in GitHub: What Can We Learn from Code Review and Bug Assignment?. Information and Software Technology (IST), Elsevier, 2016. (SCI, CCF-B) [pdf]
  • Yue Yu, Gang Yin, Tao Wang, Cheng Yang, Huaimin Wang. Determinants of pull-based development in the context of continuous integration. Science China Information Sciences, Springer, 2016. (SCI, CCF-B) [pdf]
  • Yang Zhang, Huaimin Wang, Gang Yin, Tao Wang, Yue Yu. Social media in GitHub: the role of @-mention in assisting software development. Science China Information Sciences, Springer, 2016. (SCI, CCF-B) [pdf]
  • Zhixing Li, Gang Yin, Yang Zhang, Yue Yu, and Huaimin Wang. Correlation-based software search by leveraging software term database. Frontiers of Computer Science (FCS), Springer, 2016. (SCI, CCF-C) [pdf]

2015

  • Bogdan Vasilescu*, Yue Yu*, Huaimin Wang, Prem Devanbu, Vladimir Filkov. Quality and Productivity Outcomes Relating to Continuous Integration in GitHub. The 10th Joint Meeting of the European Software Engineering Conference and the ACM SIGSOFT Symposium on the Foundations of Software Engineering (ESEC/FSE 2015), Bergamo, Italy, 2015. (*both are first authors and contributed equally to the work, CCF-A) [pdf]
  • Yue Yu, Huaimin Wang, Vladimir Filkov, Prem Devanbu, Bogdan Vasilescu. Wait For It: Determinants of Pull Request Evaluation Latency on GitHub. The 12th International Conference on Mining Software Repositories (MSR 2015), Florence, Italy, 2015. (CCF-C, CORE-A) [pdf]
  • Yang Zhang, Huaimin Wang, Gang Yin, Tao Wang, Yue Yu. Exploring the Use of @-mention to Assist Software Development in GitHub. The Seventh Asia-Pacific Symposium on Internetware, Wuhan, China, 2015. [pdf]
  • Yang Zhang, Huaimin Wang, Gang Yin, Tao Wang, Yue Yu. Evaluating Bug Severity Using Crowd-based Knowledge: An Exploratory Study. The Seventh Asia-Pacific Symposium on Internetware, Wuhan, China, 2015. [pdf]

2014

  • Yue Yu, Huaimin Wang, Gang Yin, Charles X. Ling. Reviewer Recommender of Pull-Requests in GitHub. The 30th IEEE International Conference on Software Maintenance and Evolution (ICSME 2014), Victoria, Canada, 2014. (CCF-B) [pdf][Demo]
  • Yue Yu, Huaimin Wang, Gang Yin, Charles X. Ling. Who Should Review This Pull-Request: Reviewer Recommendation to Expedite Crowd Collaboration. The 21th Asia-Pacific Software Engineering Conference (APSEC 2014), JEJU, KOREA, 2014. (Best paper, CCF-C) [pdf]
  • Yue Yu, Huaimin Wang, Gang Yin, Tao Wang. Exploring the Patterns of Social Behavior in GitHub. International Workshop on Crowd-based Software Development Methods and Technologies (CrowdSoft), Hong Kong, 2014. [pdf]
  • Yang Zhang, Gang Yin, Yue Yu, Huaimin Wang. Investigating social media in GitHub's pull-requests: a case study on Ruby on Rails. International Workshop on Crowd-based Software Development Methods and Technologies (CrowdSoft), Hong Kong, 2014.
  • Yang Zhang, Gang Yin, Yue Yu, Huaimin Wang. An Exploratory Study of @-mention in GitHub’s Pull-requests. The 21th Asia-Pacific Software Engineering Conference (APSEC 2014), JEJU, KOREA, 2014. (CCF-C)

2013

  • Yue Yu, Huaimin Wang, Gang Yin, Xiang Li. HESA: The Construction and Evaluation of Hierarchical Software Feature Repository. The 25th International Conference on Software Engineering and Knowledge Engineering (SEKE 2013), Boston, USA, 2013. (CCF-C) [pdf] [slide]
  • Yue Yu, Huaimin Wang, Gang Yin, Bo Liu. Mining and Recommending Software Features across Multiple Web Repositories. The Fifth Asia-Pacific Symposium on Internetware, Changsha, China, 2013. [pdf][slide]
  • Yue Yu, Huaimin Wang, Bo Liu, Gang Yin. A Trusted Remote Attestation Model based on Trusted Computing. The 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom 2013), Melbourne, Australia, 2013. (CCF-C) [pdf]

2012

  • Xiang Li, Huaimin Wang, Gang Yin, Tao Wang, Cheng Yang, Yue Yu, Dengqing Tang. Inducing Taxonomy from Tags: An Agglomerative Hierarchical Clustering Framework. Advanced Data Mining and Applications (ADMA 2012), Springer Berlin Heidelberg, 2012. [pdf]

2011

  • Fajiang Yu, Xianglei Tang, Yue Yu. Trusted Computing Dynamic Attestation by Using Static Analysis Based Behavior Model. Proceedings of 9th IEEE International Symposium on Parallel and Distributed Processing with Applications Workshops, Bussan, Korea, 2011. [pdf]
  • Fajiang Yu, Yuewei Xu, Yue Yu. Optimization of Program Behavior Model for Trusted Computing Dynamic Attestation. Journal of Computational Information Systems, 7(5): 1436-1445, 2011 [pdf]

2010

  • Fajiang Yu, Yue Yu. Static Analysis-based Behavior Model Building for Trusted Computing Dynamic Verification. Wuhan University Journal of Natural Sciences, 15(3): 195-200, Wuhan University and Springer-Verlag Berlin Heidelberg, 2010. [pdf]

Awards and Honors

  • Outstanding Ph.D. Thesis Award of Hunan Province, China, 2018; (湖南省优秀博士学位论文奖)
  • ACM Changsha Doctoral Dissertation Award, China, 2016; (ACM中国长沙分会优秀博士论文奖)
  • 2013 OW2 International Programming Contest         Special Prize (Only One)
  • 2012 OW2 International Programming Contest         First Prize (Top 1)
  • 2011 National Information Security Competition      Third Prize (17%)
  • 2010 National Information Security Competition      First Prize (2.3%)
  • Awarded First-class Scholarship in WHU             2010-2011 (Top 5%)
  • Awarded National Scholarship of China              2009-2010 (Top 2)
  • Honored as Outstanding Graduates of WHU            2007-2011 (20%)