Improving Transformer-based Program Repair Model through False Behavior Diagnosis.

EMNLP 2023（2023）

引用 0|浏览7

暂无评分

摘要

Research on automated program repairs using transformer-based models has recently gained considerable attention. The comprehension of the erroneous behavior of a model enables the identification of its inherent capacity and provides insights for improvement. However, the current landscape of research on program repair models lacks an investigation of their false behavior. Thus, we propose a methodology for diagnosing and treating the false behaviors of transformer-based program repair models. Specifically, we propose 1) a behavior vector that quantifies the behavior of the model when it generates an output, 2) a behavior discriminator (BeDisc) that identifies false behaviors, and 3) two methods for false behavior treatment. Through a large-scale experiment on 55,562 instances employing four datasets and three models, the BeDisc exhibited a balanced accuracy of 86.6\% for false behavior classification. The first treatment, namely, early abortion, successfully eliminated 60.4\% of false behavior while preserving 97.4\% repair accuracy. Furthermore, the second treatment, namely, masked bypassing, resulted in an average improvement of 40.5\% in the top-1 repair accuracy. These experimental results demonstrated the importance of investigating false behaviors in program repair models.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要