Digitala Vetenskapliga Arkivet

Endre søk
ExporteraLink to record
Permanent link

Direct link
BETA

Prosjekt

Prosjekttyp/Bidragsform
Grant to research environment
Tittel [en]
SERT- Software Engineering ReThought
Abstract [en]
SERT – Software Engineering ReThought is a groundbreaking research project with the aim to take on the next generation challenges facing companies developing software intensive systems and products. We as an engineering lab are blazing the road introducing 3:rd generation empirical software engineering – denoting close co-production of pragmatic problem solving in close collaboration with our industrial partners as we perform engineering research into topics critical for engineering and business success. SERTs formulation of 3:rd generation empirical software engineering will utilize related knowledge areas as catalysts to solve challenges. Value-based engineering, Data-driven evidence based engineering, and Human-based development will complement software engineering competence in an integrated eco-system of competence focused on the challenges at hand.All areas in software engineering, ranging from inception, realization to evolution are part of the research venture – reflecting that companies need solutions covering their entire ecosystem.
Publikasjoner (10 av 170) Visa alla publikasjoner
Zabardast, E., Paudel, B. & Gonzalez-Huerta, J. (2026). Architecture Degradation at Scale: Challenges and Insights from Practice. In: Scanniello G., Romano S., Francese R., Lenarduzzi V., Vegas S. (Ed.), Product-Focused Software Process Improvement: 26th International Conference, PROFES 2025, Salerno, Italy, December 1–3, 2025, Proceedings. Paper presented at 26th International Conference on Product-Focused Software Process Improvement, PROFES 2025, Salerno, Dec 1-3, 2025 (pp. 451-460). Springer Science+Business Media B.V.
Åpne denne publikasjonen i ny fane eller vindu >>Architecture Degradation at Scale: Challenges and Insights from Practice
2026 (engelsk)Inngår i: Product-Focused Software Process Improvement: 26th International Conference, PROFES 2025, Salerno, Italy, December 1–3, 2025, Proceedings / [ed] Scanniello G., Romano S., Francese R., Lenarduzzi V., Vegas S., Springer Science+Business Media B.V., 2026, s. 451-460Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

Large-scale software systems often experience architectural degradation, affecting maintainability, scalability, and quality. To investigate this, we conducted focus groups with senior practitioners across three large organizations. Our analysis revealed four core challenge areas: managing dependencies, ownership and organizational barriers, balancing agility with stability and cost, as well as documentation drift. These findings offer practical insights for mitigating architectural degradation in complex environments. 

sted, utgiver, år, opplag, sider
Springer Science+Business Media B.V., 2026
Serie
Lecture Notes in Computer Science, ISSN 0302-9743 ; 16361
Emneord
Architecture Degradation, Architecture Erosion, Challenges, Insights, Technical Debt, Degradation, Software engineering, Challenge, Focus groups, Four-core, Insight, Large organizations, Large-scale software systems, Organizational barriers, Technical debts, Architecture
HSV kategori
Identifikatorer
urn:nbn:se:bth-28988 (URN)10.1007/978-3-032-12089-2_30 (DOI)2-s2.0-105023325552 (Scopus ID)9783032120885 (ISBN)
Konferanse
26th International Conference on Product-Focused Software Process Improvement, PROFES 2025, Salerno, Dec 1-3, 2025
Forskningsfinansiär
Knowledge Foundation, 20180010
Tilgjengelig fra: 2025-12-12 Laget: 2025-12-12 Sist oppdatert: 2025-12-12bibliografisk kontrollert
Sundelin, A. (2026). Learning Observability Tracing Through Experiential Learning. In: Giuseppe Scanniello, Valentina Lenarduzzi, Simone Romano, Sira Vegas, Rita Francese (Ed.), Product-Focused Software Process Improvement: 26th International Conference, PROFES 2025, Salerno, Italy, December 1–3, 2025, Proceedings. Paper presented at 26th International Conference on Product-Focused Software Process Improvement (PROFES 2025), Salerno, Dec 1-3, 2025 (pp. 419-428). Springer Nature
Åpne denne publikasjonen i ny fane eller vindu >>Learning Observability Tracing Through Experiential Learning
2026 (engelsk)Inngår i: Product-Focused Software Process Improvement: 26th International Conference, PROFES 2025, Salerno, Italy, December 1–3, 2025, Proceedings / [ed] Giuseppe Scanniello, Valentina Lenarduzzi, Simone Romano, Sira Vegas, Rita Francese, Springer Nature, 2026, s. 419-428Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

In a large-scale software development product development organization, we found that most developers, although experienced, were lacking architectural knowledge of the specific developed product.

As a remedy, we evaluated whether we could stimulate learning the product architecture by conducting training in how to use the product’s distributed tracing platform, built on the OpenTelemetry standard and the open-source Jaeger Tracing visualization tool.

We planned and participated in a training event, where parts of the organization explored, using experiential learning, how to set up and use tracing to troubleshoot a realistic fault scenario we prepared. Respondents were asked to rate the tool according to the Technology Adoption Model (TAM), and responses were collected on Likert-type scales, analyzed, and summarized using a Bayesian workflow.

Even as tool usage post-training was low, respondents still had a positive attitude toward using the tool, valued the experiential training, and expressed a strong intent to use the tool for program comprehension.

sted, utgiver, år, opplag, sider
Springer Nature, 2026
Serie
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 16361
Emneord
Micro-services, Observability, Distributed tracing, Experiential learning
HSV kategori
Identifikatorer
urn:nbn:se:bth-28612 (URN)10.1007/978-3-032-12089-2_27 (DOI)2-s2.0-1050233006192-s2.0-105023300619 (Scopus ID)9783032120892 (ISBN)
Konferanse
26th International Conference on Product-Focused Software Process Improvement (PROFES 2025), Salerno, Dec 1-3, 2025
Forskningsfinansiär
Knowledge Foundation, 20180010
Tilgjengelig fra: 2025-09-12 Laget: 2025-09-12 Sist oppdatert: 2025-12-12bibliografisk kontrollert
Novikov, O., Fucci, D., Adamov, O. & Mendez, D. (2026). Policy-Driven Software Bill of Materials on GitHub: An Empirical Study. In: Scanniello G., Romano S., Francese R., Lenarduzzi V., Vegas S. (Ed.), Product-Focused Software Process Improvement: 26th International Conference, PROFES 2025, Salerno, Italy, December 1–3, 2025, Proceedings. Paper presented at 26th International Conference on Product-Focused Software Process Improvement, PROFES 2025, Salerno, Dec 1-3, 2025 (pp. 253-268).
Åpne denne publikasjonen i ny fane eller vindu >>Policy-Driven Software Bill of Materials on GitHub: An Empirical Study
2026 (engelsk)Inngår i: Product-Focused Software Process Improvement: 26th International Conference, PROFES 2025, Salerno, Italy, December 1–3, 2025, Proceedings / [ed] Scanniello G., Romano S., Francese R., Lenarduzzi V., Vegas S., 2026, s. 253-268Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

Background. The Software Bill of Materials (SBOM) is a machine-readable list of all the software dependencies included in a software. SBOM emerged as way to assist securing the software supply chain. However, despite mandates from governments to use SBOM, research on this artifact is still in its early stages.

Aims. We want to understand the current state of SBOM in open-source projects, focusing specifically on policy-driven SBOMs—i.e., SBOM created to achieve security goals, such as enhancing project transparency and ensuring compliance, rather than being used as fixtures for tools or artificially generated for benchmarking or academic research purposes.

Method. We performed a mining software repository study to collect and carefully select 620 SBOM files hosted on GitHub. We analyzed the information reported in policy-driven SBOMs and the vulnerabilities associated with the declared dependencies by means of descriptive statistics.

Results. We show that only 0.56% of popular GitHub repositories contain policy-driven SBOM. The declared dependencies contain 2,202 unique vulnerabilities, while 22% of them do not report licensing information.

Conclusion. Our findings provide insights for SBOM usage to support security assessment and licensing. 

Serie
Lecture Notes in Computer Science, ISSN 0302-9743 ; 16361
Emneord
dependencies, open-source, SBOM, software security, Supply chain attacks, vulnerabilities, Network security, Open systems, Supply chains, Bill of materials, Dependency, Empirical studies, Policy driven, Software bill of material, Software dependencies, Supply chain attack, Vulnerability, Open source software
HSV kategori
Identifikatorer
urn:nbn:se:bth-28990 (URN)10.1007/978-3-032-12089-2_16 (DOI)2-s2.0-105023309206 (Scopus ID)9783032120885 (ISBN)
Konferanse
26th International Conference on Product-Focused Software Process Improvement, PROFES 2025, Salerno, Dec 1-3, 2025
Forskningsfinansiär
Knowledge Foundation, 20180010
Tilgjengelig fra: 2025-12-12 Laget: 2025-12-12 Sist oppdatert: 2025-12-12bibliografisk kontrollert
Paudel, B., Gonzalez-Huerta, J. & Zabardast, E. (2026). Temporal Evolution of Architectural Complexity and Technical Debt in Microservices: An Exploratory Case Study. In: Scanniello G., Romano S., Francese R., Lenarduzzi V., Vegas S. (Ed.), Product-Focused Software Process Improvement: 26th International Conference, PROFES 2025, Salerno, Italy, December 1–3, 2025, Proceedings. Paper presented at 26th International Conference on Product-Focused Software Process Improvement, PROFES 2025, Salerno, Dec 1-3, 2025 (pp. 285-302). Springer Science+Business Media B.V.
Åpne denne publikasjonen i ny fane eller vindu >>Temporal Evolution of Architectural Complexity and Technical Debt in Microservices: An Exploratory Case Study
2026 (engelsk)Inngår i: Product-Focused Software Process Improvement: 26th International Conference, PROFES 2025, Salerno, Italy, December 1–3, 2025, Proceedings / [ed] Scanniello G., Romano S., Francese R., Lenarduzzi V., Vegas S., Springer Science+Business Media B.V., 2026, s. 285-302Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

Over the last decade, software organizations have increasingly adopted microservices to effectively deal with evolving software systems, frequent demands for new features, and changing technologies. However, microservices are not a silver bullet; their success depends on the specific context and needs of each organization. Therefore, tracking the evolution of architectural complexity indicators is crucial for effective architectural governance and decision-making. In this paper, we explore the relationship between architectural complexity indicators and their evolution, specifically declared dependencies, API endpoints, inter-service communications, size, and technical debt. We used the static source code analysis methods along with SonarQube to measure architectural complexity, collecting data on all indicators over the past two and a half years. Our findings indicate that architectural complexity consistently grows, even within microservices. Most importantly, these indicators co-evolve, making the overall architecture more complicated than expected. Additionally, all complexity indicators grow rapidly when services are small and still evolving. The insights gained from this study can assist organizations in effectively managing their microservices, highlighting when they might be most prone to architectural degradation. 

sted, utgiver, år, opplag, sider
Springer Science+Business Media B.V., 2026
Serie
Lecture Notes in Computer Science, ISSN 0302-9743 ; 16361
Emneord
Architectural Complexity, Complexity Evolution, Industrial Case Study, Microservices Architecture, Technical Debt, Architecture, Complexity indicators, Exploratory case studies, Microservice architecture, Software organization, Software-systems, Technical debts, Temporal evolution, Computer software
HSV kategori
Identifikatorer
urn:nbn:se:bth-28987 (URN)10.1007/978-3-032-12089-2_18 (DOI)2-s2.0-105023329652 (Scopus ID)9783032120885 (ISBN)
Konferanse
26th International Conference on Product-Focused Software Process Improvement, PROFES 2025, Salerno, Dec 1-3, 2025
Forskningsfinansiär
Knowledge Foundation, 20180010
Tilgjengelig fra: 2025-12-12 Laget: 2025-12-12 Sist oppdatert: 2025-12-12bibliografisk kontrollert
Gonzalez-Huerta, J. & Zabardast, E. (2026). Towards Understanding Team Congestion in Large-Scale Software Development. In: Scanniello G., Romano S., Francese R., Lenarduzzi V., Vegas S. (Ed.), Product-Focused Software Process Improvement: 26th International Conference, PROFES 2025, Salerno, Italy, December 1–3, 2025, Proceedings. Paper presented at 26th International Conference on Product-Focused Software Process Improvement, PROFES 2025, Salerno, Dec 1-3, 2025 (pp. 353-368). Springer Science+Business Media B.V.
Åpne denne publikasjonen i ny fane eller vindu >>Towards Understanding Team Congestion in Large-Scale Software Development
2026 (engelsk)Inngår i: Product-Focused Software Process Improvement: 26th International Conference, PROFES 2025, Salerno, Italy, December 1–3, 2025, Proceedings / [ed] Scanniello G., Romano S., Francese R., Lenarduzzi V., Vegas S., Springer Science+Business Media B.V., 2026, s. 353-368Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

Background: Software Development organisations tend to organise the development of software-intensive products and services as a constellation of components meant to be developed and maintained by independent, autonomous teams. However, the maintenance and evolution of said products and services require team collaboration and coordination. This collaboration and coordination overhead piles on top of teams’ workload, often hindering teams’ throughput and lead time. Objectives: This paper aims to discuss how the use of pull request data can help identify congested teams when the arrival of new tasks exceeds the team’s ability to close them. To do so, we have conducted an empirical study in a software development organisation developing a large-scale product, to try to characterise congested teams and the characteristics of the code reviews they are involved in. Method: We have conducted a case study to start exploring how code review data can help us model team congestion, and understand whether the features of the code-review network, or the team type (platform vs product), can have a major impact on team congestion. Results: The results show that teams seem to experience varying levels of congestion based on pull request activity, with some indicating potential congestion. However, increased PR accumulation did not consistently lead to longer lead times, as seen in some teams where high PR backlogs did not significantly impact delivery cadence. Conclusions: Our findings suggest that while PR data can indicate potential congestion, its impact on lead time varies across teams. Both technical factors and unobserved contextual elements shape congestion. Deeper insights require combining repository metrics with qualitative inputs. 

sted, utgiver, år, opplag, sider
Springer Science+Business Media B.V., 2026
Serie
Lecture Notes in Computer Science, ISSN 0302-9743 ; 16361
Emneord
Case Study, Code-Review Data Analysis, Team Congestion, Team Coordination, Codes (symbols), Data reduction, Human resource management, Traffic congestion, Case-studies, Code review, Code-review data analyze, Large-scales, Leadtime, Product and services, Software development organizations, Team collaboration, Software design
HSV kategori
Identifikatorer
urn:nbn:se:bth-28991 (URN)10.1007/978-3-032-12089-2_22 (DOI)2-s2.0-105023305765 (Scopus ID)9783032120885 (ISBN)
Konferanse
26th International Conference on Product-Focused Software Process Improvement, PROFES 2025, Salerno, Dec 1-3, 2025
Forskningsfinansiär
Knowledge Foundation, 20180010
Tilgjengelig fra: 2025-12-12 Laget: 2025-12-12 Sist oppdatert: 2025-12-12bibliografisk kontrollert
Paudel, B., Gonzalez-Huerta, J., Mendez, D. & Klotins, E. (2025). A Data-Driven Approach to Optimize Internal Software Quality and Customer Value Delivery. In: Pfahl D., Anwar H., Gonzalez Huerta J., Klünder J. (Ed.), Product-Focused Software Process Improvement. Industry-, Workshop-, and Doctoral Symposium Papers: . Paper presented at 25th International Conference on Product-Focused Software Process Improvement, PROFES 2024, Tartu, Dec 2-4, 2024 (pp. 179-185). Springer Science+Business Media B.V., 15453
Åpne denne publikasjonen i ny fane eller vindu >>A Data-Driven Approach to Optimize Internal Software Quality and Customer Value Delivery
2025 (engelsk)Inngår i: Product-Focused Software Process Improvement. Industry-, Workshop-, and Doctoral Symposium Papers / [ed] Pfahl D., Anwar H., Gonzalez Huerta J., Klünder J., Springer Science+Business Media B.V., 2025, Vol. 15453, s. 179-185Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

The growing complexity, the ever-ending demands for new features, and the need to become faster to remain competitive force software development organizations to rethink their development and value delivery practices. While continuous delivery has become more popular, it still relies mainly on internal metrics, ad-hoc data, and expert opinions. As a result, software organizations stumble to find the balance between improving internal system quality and delivering external value. In fact, understanding and measuring customer value is on itself essential. In this PhD project, we aim for a better understanding of customer value and develop measurement instruments to be integrated with internal perspectives to drive proactive and continuous internal improvement while delivering relevant customer value. 

sted, utgiver, år, opplag, sider
Springer Science+Business Media B.V., 2025
Serie
Lecture Notes in Computer Science (LNCS), ISSN 0302-9743, E-ISSN 1611-3349 ; 15453
Emneord
Continuous Customer Value Delivery, Data-Driven Approach, Software Quality Improvement, Sales, Competitive forces, Customer values, Expert opinion, Quality value, Software development organizations, Software Quality, Software quality improvements, Value delivery
HSV kategori
Identifikatorer
urn:nbn:se:bth-27310 (URN)10.1007/978-3-031-78392-0_13 (DOI)001423667900013 ()2-s2.0-85211242536 (Scopus ID)9783031783913 (ISBN)
Konferanse
25th International Conference on Product-Focused Software Process Improvement, PROFES 2024, Tartu, Dec 2-4, 2024
Forskningsfinansiär
Knowledge Foundation, 20180010
Tilgjengelig fra: 2024-12-26 Laget: 2024-12-26 Sist oppdatert: 2025-09-30bibliografisk kontrollert
Tran, H. K., Ali, N. b., Unterkalmsteiner, M. & Börstler, J. (2025). A proposal and assessment of an improved heuristic for the Eager Test smell detection. Journal of Systems and Software, 226, Article ID 112438.
Åpne denne publikasjonen i ny fane eller vindu >>A proposal and assessment of an improved heuristic for the Eager Test smell detection
2025 (engelsk)Inngår i: Journal of Systems and Software, ISSN 0164-1212, E-ISSN 1873-1228, Vol. 226, artikkel-id 112438Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

Context: The evidence for the prevalence of test smells at the unit testing level has relied on the accuracy of detection tools, which have seen intense research in the last two decades. The Eager Test smell, one of the most prevalent, is often identified using simplified detection rules that practitioners find inadequate.

Objective: We aim to improve the rules for detecting the Eager Test smell.

Method: We reviewed the literature on test smells to analyze the definitions and detection rules of the Eager Test smell. We proposed a novel, unambiguous definition of the test smell and a heuristic to address the limitations of the existing rules. We evaluated our heuristic against existing detection rules by manually applying it to 300 unit test cases in Java.

Results: Our review identified 56 relevant studies. We found that inadequate interpretations of original definitions of the Eager Test smell led to imprecise detection rules, resulting in a high level of disagreement in detection outcomes. Also, our heuristic detected patterns of eager and non-eager tests that existing rules missed.

Conclusion: Our heuristic captures the essence of the Eager Test smell more precisely; hence, it may address practitioners’ concerns regarding the adequacy of existing detection rules.

sted, utgiver, år, opplag, sider
Elsevier, 2025
Emneord
Software testing, Test case quality, Test suite quality, Quality assurance, Test smells, Unit testing, Eager test Java JUnit
HSV kategori
Forskningsprogram
Programvaruteknik
Identifikatorer
urn:nbn:se:bth-27675 (URN)10.1016/j.jss.2025.112438 (DOI)001464187400001 ()2-s2.0-105001808870 (Scopus ID)
Tilgjengelig fra: 2025-03-31 Laget: 2025-03-31 Sist oppdatert: 2025-09-30bibliografisk kontrollert
Frattini, J., Fucci, D., Torkar, R., Montgomery, L., Unterkalmsteiner, M., Fischbach, J. & Mendez, D. (2025). Applying bayesian data analysis for causal inference about requirements quality: a controlled experiment. Empirical Software Engineering, 30(1), Article ID 29.
Åpne denne publikasjonen i ny fane eller vindu >>Applying bayesian data analysis for causal inference about requirements quality: a controlled experiment
Vise andre…
2025 (engelsk)Inngår i: Empirical Software Engineering, ISSN 1382-3256, E-ISSN 1573-7616, Vol. 30, nr 1, artikkel-id 29Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

It is commonly accepted that the quality of requirements specifications impacts subsequent software engineering activities. However, we still lack empirical evidence to support organizations in deciding whether their requirements are good enough or impede subsequent activities. We aim to contribute empirical evidence to the effect that requirements quality defects have on a software engineering activity that depends on this requirement. We conduct a controlled experiment in which 25 participants from industry and university generate domain models from four natural language requirements containing different quality defects. We evaluate the resulting models using both frequentist and Bayesian data analysis. Contrary to our expectations, our results show that the use of passive voice only has a minor impact on the resulting domain models. The use of ambiguous pronouns, however, shows a strong effect on various properties of the resulting domain models. Most notably, ambiguous pronouns lead to incorrect associations in domain models. Despite being equally advised against by literature and frequentist methods, the Bayesian data analysis shows that the two investigated quality defects have vastly different impacts on software engineering activities and, hence, deserve different levels of attention. Our employed method can be further utilized by researchers to improve reliable, detailed empirical evidence on requirements quality. © The Author(s) 2024.

sted, utgiver, år, opplag, sider
Springer, 2025
Emneord
Bayesian data analysis, Experiment, Replication, Requirements engineering, Requirements quality, Data accuracy, Data assimilation, Data consistency, Spatio-temporal data, Causal inferences, Controlled experiment, Domain model, Engineering activities, Quality defects, Requirement engineering, Requirement quality, Requirements specifications, Software quality
HSV kategori
Identifikatorer
urn:nbn:se:bth-27175 (URN)10.1007/s10664-024-10582-1 (DOI)2-s2.0-85209711862 (Scopus ID)
Forskningsfinansiär
Knowledge Foundation, 20180010
Tilgjengelig fra: 2024-11-29 Laget: 2024-11-29 Sist oppdatert: 2025-09-30bibliografisk kontrollert
Fucci, D., Di Penta, M., Romano, S. & Scanniello, G. (2025). Augmenting Software Bills of Materials with Software Vulnerability Description: A Preliminary Study on GitHub. In: Proceedings of the ACM SIGSOFT Symposium on the Foundations of Software Engineering: . Paper presented at 33rd ACM International Conference on the Foundations of Software Engineering, FSE Companion 2025, Trondheim, June 23-27, 2025 (pp. 631-635). Association for Computing Machinery (ACM)
Åpne denne publikasjonen i ny fane eller vindu >>Augmenting Software Bills of Materials with Software Vulnerability Description: A Preliminary Study on GitHub
2025 (engelsk)Inngår i: Proceedings of the ACM SIGSOFT Symposium on the Foundations of Software Engineering, Association for Computing Machinery (ACM), 2025, s. 631-635Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

Software Bills of Material (SBOMs) are becoming a consolidated-and often enforced by governmental regulations-way to describe software composition. However, based on recent studies, SBOMs suffer from limited support for their consumption and lack information beyond simple dependencies, especially regarding software vulnerabilities. This paper reports the results of a preliminary study in which we augmented SBOMs of 40 open-source projects with information about Common Vulnerabilities and Exposures (CVE) exposed by project dependencies. Our augmented SBOMs have been evaluated by submitting pull requests and by asking project owners to answer a survey. Although, in most cases, augmented SBOMs were not directly accepted because owners required a continuous SBOM update, the received feedback shows the usefulness of the suggested SBOM augmentation.

sted, utgiver, år, opplag, sider
Association for Computing Machinery (ACM), 2025
Emneord
SBOM, Software repositories, VEX, Vulnerabilities management
HSV kategori
Identifikatorer
urn:nbn:se:bth-28600 (URN)10.1145/3696630.3728513 (DOI)2-s2.0-105013970463 (Scopus ID)9798400712760 (ISBN)
Konferanse
33rd ACM International Conference on the Foundations of Software Engineering, FSE Companion 2025, Trondheim, June 23-27, 2025
Forskningsfinansiär
Knowledge Foundation, 20230087Knowledge Foundation, 20180010
Tilgjengelig fra: 2025-09-05 Laget: 2025-09-05 Sist oppdatert: 2025-09-30bibliografisk kontrollert
Dorner, M. (2025). Code Review as a Communication Network. (Doctoral dissertation). Karlskrona, Sweden: Blekinge Tekniska Högskola
Åpne denne publikasjonen i ny fane eller vindu >>Code Review as a Communication Network
2025 (engelsk)Doktoravhandling, med artikler (Annet vitenskapelig)
Abstract [en]

Background: Modern software systems are often too large and complex for an individual developer to fully oversee, making it difficult to understand the implications of changes. Therefore, most collaborative software projects rely on code review as communication network to foster asynchronous discussions about changes before they are merged. Although prior qualitative studies have revealed that practitioners view code review as a communication network, no formal theory or empirical validation exists. Without formalization and confirmatory evidence, the theory remains uncertain, limiting its credibility, practical relevance, and future development.

Objective: In this thesis, our objective is to (1) formalize the theory of code review as a communication network, (2) empirically evaluate the theory across varied perspectives, contexts, and conditions by quantifying the capability of code review to diffuse information among its participants, (3) demonstrate its practical relevance by applying the theory to the domain of tax compliance in collaborative software engineering, and (4) examine how the role of code review as a communication network for collaborative software engineering may evolve in the future.

Methods: To formalize the theory of code review as a communication network, we developed and validated a simulation model that operationalizes its core propositions about information diffusion among participants. To empirically evaluate the theory, we employed two complementary research approaches. First, we used the simulation model to conduct in silico experiments with closed-source code review systems from Microsoft, Spotify, and Trivago, as well as open-source code review systems from Android, Visual Studio Code, and React, to estimate the upper bound of information diffusion in code review. Second, through an observational study, we quantified the diffusion of information in code review across social, organizational, and architectural boundaries at Spotify. To demonstrate the practical relevance of the theory, we analyzed the code review system of a multinational enterprise as a communication network to reveal the latent collaboration structure among developers across borders, which is taxable. To explore the future of code review as a communication network, we conducted a questionnaire survey with 92 practitioners to gather their expectations and discuss how these anticipated changes may reshape our understanding of code review.

Results: By formalizing the theory of code review as a communication network modelled as a time-varying hypergraph, we were able to empirically demonstrate that traditional time-agnostic models substantially overestimate information diffusion in code review. Throughout our empirical studies, we found substential evidence supporting the theory of code review as a communication network: We confirmed that code review is capable of diffusing information quickly and widely among participants, even at a large scale. We also observed extensive information diffusion across social, organizational, and architectural boundaries at Spotify corroborating our theory. However, we also found that information diffusion patterns in open-source code review systems differ significantly, suggesting that findings from open-source environments may not directly apply to closed-source contexts. Through applying the theory of code review as a communication network in the domain of tax compliance, we were able to uncover the significant and previously unrecognized tax risks associated with collaborative software engineering within multinational enterprises. While practitioners consider code review also in the future a core practice in collaborative software engineering, we identify a potential risk that generative AI may undermine code review’s role as a human communication network.

Conclusion: Our work on understanding code review as a communication network contributes not only to theory-driven, empirical software engineering research but also lays the groundwork for practical applications, particularly in the context of tax compliance. Future research is needed to explore the evolving role of code review as a communication network.

sted, utgiver, år, opplag, sider
Karlskrona, Sweden: Blekinge Tekniska Högskola, 2025. s. 188
Serie
Blekinge Institute of Technology Doctoral Dissertation Series, ISSN 1653-2090 ; 2025:10
Emneord
code review, software engineering, tax compliance, collaborative software engineering, communication network
HSV kategori
Forskningsprogram
Programvaruteknik
Identifikatorer
urn:nbn:se:bth-28424 (URN)978-91-7295-508-0 (ISBN)
Disputas
2025-09-23, J1630, Valhallavägen 1, Karlskrona, 14:00 (engelsk)
Opponent
Veileder
Tilgjengelig fra: 2025-08-22 Laget: 2025-08-22 Sist oppdatert: 2025-09-30bibliografisk kontrollert
Principal InvestigatorGorschek, Tony
Koordinerande organisasjon
Blekinge Tekniska Högskola
Forskningsfinansiär
Tidsperiod
2018-09-01 - 2026-09-01
HSV kategori
Software Engineering
Identifikatorer
DiVA, id: project:2307Prosjekt id: 20180010

Søk i DiVA

Software Engineering

Søk utenfor DiVA

GoogleGoogle Scholar

Länk till extern prosjektsida

SERT Web