نویسندگان | Behzad Soleimani Neysiani |
---|---|
همایش | 10th International Conference on Information and Knowledge Technology (IKT) |
تاریخ برگزاری همایش | 2019-12-31 - 2021-01-01 |
محل برگزاری همایش | 1 - تهران |
ارائه به نام دانشگاه | پژوهشگاه ارتباطات و فناوری اطلاعات |
نوع ارائه | سخنرانی |
سطح همایش | بین المللی |
چکیده مقاله
Typos are usual in human typings like bug reports in software triage systems. More than half the percentages of bug reports have typos. Interconnected typos are a common type of typos in bug reports. There are some heuristic and non-heuristic approaches for automatic typo correction. Also, there are four datasets, including Android, Eclipse, Mozilla, and Open Office, which their typos are determined, and some of them are corrected. This study involves to evaluated the effect of typo correction on duplicate bug report detection (DBRD). The experimental results on the Android dataset show the typos correction can improve the validation performance of DBRD at most 1% averagely, which is negligible. Also, it is better to do not remove the typos from bug reports for DBRD. The automatic typo correction can be useful in DBRD a little as a pre-processing operator, but it can be more helpful when the users are writing the bug reports, which can correct their typos in realtime.
کلید واژه ها: Typo; Correction; Duplicate; Bug Report; Text Mining; Information Retrieval;