Effect of Typos Correction on the validation performance of Duplicate Bug Reports Detection

نویسندگانBehzad Soleimani Neysiani
همایش10th International Conference on Information and Knowledge Technology (IKT)
تاریخ برگزاری همایش2019-12-31 - 2021-01-01
محل برگزاری همایش1 - تهران
ارائه به نام دانشگاهپژوهشگاه ارتباطات و فناوری اطلاعات
نوع ارائهسخنرانی
سطح همایشبین المللی

چکیده مقاله

Typos are usual in human typings like bug reports in software triage systems. More than half the percentages of bug reports have typos. Interconnected typos are a common type of typos in bug reports. There are some heuristic and non-heuristic approaches for automatic typo correction. Also, there are four datasets, including Android, Eclipse, Mozilla, and Open Office, which their typos are determined, and some of them are corrected. This study involves to evaluated the effect of typo correction on duplicate bug report detection (DBRD). The experimental results on the Android dataset show the typos correction can improve the validation performance of DBRD at most 1% averagely, which is negligible. Also, it is better to do not remove the typos from bug reports for DBRD. The automatic typo correction can be useful in DBRD a little as a pre-processing operator, but it can be more helpful when the users are writing the bug reports, which can correct their typos in realtime.

لینک ثابت مقاله

کلید واژه ها: Typo; Correction; Duplicate; Bug Report; Text Mining; Information Retrieval;