Digitala Vetenskapliga Arkivet

Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A data-driven approach for understanding invalid bug reports: An industrial case study
Blekinge Institute of Technology, Faculty of Computing, Department of Software Engineering.ORCID iD: 0000-0002-5964-5554
Blekinge Institute of Technology, Faculty of Computing, Department of Software Engineering.ORCID iD: 0000-0001-7266-5632
Blekinge Institute of Technology, Faculty of Computing, Department of Software Engineering.ORCID iD: 0000-0003-0639-4234
Lund University.
2023 (English)In: Information and Software Technology, ISSN 0950-5849, E-ISSN 1873-6025, Vol. 164, article id 107305Article in journal (Refereed) Published
Abstract [en]

Context: Bug reports created during software development and maintenance do not always describe deviations from a system's valid behavior. Such invalid bug reports may consume significant resources and adversely affect the prioritization and resolution of valid bug reports. There is a need to identify preventive actions to reduce the inflow of invalid bug reports. Existing research has shown that manually analyzing invalid bug report descriptions provides cues regarding preventive actions. However, such a manual approach is not cost-effective due to the time required to analyze a sufficiently large number of bug reports needed to identify useful patterns. Furthermore, the analysis needs to be repeated as the underlying causes of invalid bug reports change over time. Objective: In this study, we propose and evaluate the use of Latent Dirichlet Allocation (LDA), a topic modeling approach, to support practitioners in suggesting preventive actions to avoid the creation of similar invalid bug reports in the future. Method: In an industrial case study, we first manually analyzed descriptions of invalid bug reports to identify common patterns in their descriptions. We further investigated to what extent LDA can support this manual process. We used expert-based validation to evaluate the relevance of identified common patterns and their usefulness in suggesting preventive measures. Results: We found that invalid bug reports have common patterns that are perceived as relevant, and they can be used to devise preventive measures. Furthermore, the identification of common patterns can be supported with automation. Conclusion: Using LDA, practitioners can effectively identify representative groups of bug reports (i.e., relevant common patterns) from a large number of bug reports and analyze them further to devise preventive measures. © 2023 The Author(s)

Place, publisher, year, edition, pages
Elsevier, 2023. Vol. 164, article id 107305
Keywords [en]
Bug classification, Bug management, Invalid bug reports, LDA, Software analytics, Software maintenance, Topic modeling, Cost effectiveness, Software design, Statistics, Bug managements, Bug reports, Industrial case study, Invalid bug report, Latent Dirichlet allocation, Preventive action, Preventive measures, Software analytic, Computer software maintenance
National Category
Software Engineering
Identifiers
URN: urn:nbn:se:bth-25295DOI: 10.1016/j.infsof.2023.107305ISI: 001053508400001Scopus ID: 2-s2.0-85166970380OAI: oai:DiVA.org:bth-25295DiVA, id: diva2:1789165
Funder
ELLIIT - The Linköping‐Lund Initiative on IT and Mobile CommunicationsKnowledge Foundation, 20220235Available from: 2023-08-18 Created: 2023-08-18 Last updated: 2025-09-30Bibliographically approved
In thesis
1. Software Analytics for Supporting Practitioners in Bug Management
Open this publication in new window or tab >>Software Analytics for Supporting Practitioners in Bug Management
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Context: In large-scale software development, a large number of bug reports are submitted during software development and maintenance. Practitioners need the ability to analyze this abundant data to make data-driven decisions about bug management tasks.

Objective: This thesis aims to utilize software analytics (SA) to support practitioners in bug management. The objectives of this thesis are (1) to identify and structure the knowledge on the use of SA for software engineering (SE) tasks and (2) to investigate and evaluate the practical application of SA to support practitioners in managing invalid bug reports (IBRs).

Method: We conducted a tertiary review and systematic mapping study to achieve the first objective and comparative experiments and two industrial case studies to achieve the second objective. Throughout the thesis work, we relied on a technology transfer model to guide the research and facilitate the adoption of ML techniques for the early identification of IBRs at the case company.

Results: We provide a comprehensive map of various SA applications for SE tasks and a decision matrix that can assist in selecting the most appropriate ML technique for bug report classification for a given context. Our results indicate that an ML technique can identify IBRs with acceptable accuracy at an early stage in practice. Furthermore, the results of an SA-based approach indicate that it can support practitioners in devising preventive measures for IBRs.

Conclusion: Through industrial validations, this thesis provides evidence of the usefulness of SA in bug management, particularly in supporting practitioners in managing IBRs in large-scale software development.

Place, publisher, year, edition, pages
Karlskrona: Blekinge Tekniska Högskola, 2025. p. 231
Series
Blekinge Institute of Technology Doctoral Dissertation Series, ISSN 1653-2090 ; 2025:02
Keywords
Issue management, Bug reports, Invalid bug reports, Software analytics, Machine learning, AutoML, Large language models
National Category
Software Engineering
Research subject
Software Engineering
Identifiers
urn:nbn:se:bth-27197 (URN)978-91-7295-494-6 (ISBN)
Public defence
2025-02-13, J1630, Campus Gräsvik, Karlskrona, 13:15 (English)
Opponent
Supervisors
Funder
ELLIIT - The Linköping‐Lund Initiative on IT and Mobile Communications
Available from: 2024-12-04 Created: 2024-12-03 Last updated: 2025-09-30Bibliographically approved

Open Access in DiVA

fulltext(843 kB)269 downloads
File information
File name FULLTEXT01.pdfFile size 843 kBChecksum SHA-512
70489919f33a0c4f55b2c1ede237ba64293b3a0ba803c2e49969efd0638d40970288032d44a069c493937b9f8bade803a46cd2be60e696e5590cd47f7618d728
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Laiq, MuhammadAli, Nauman binBörstler, Jürgen
By organisation
Department of Software Engineering
In the same journal
Information and Software Technology
Software Engineering

Search outside of DiVA

GoogleGoogle Scholar
Total: 278 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 523 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf