Digitala Vetenskapliga Arkivet

Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A systematic review of intermediate fusion in multimodal deep learning for biomedical applications
Research Unit of Computer Systems and Bioinformatics, Department of Engineering, Università Campus Bio-Medico di Roma, Rome, Italy.
Research Unit of Computer Systems and Bioinformatics, Department of Engineering, Università Campus Bio-Medico di Roma, Rome, Italy; Department of Biomedical Sciences, Humanitas University, Milan, Italy.
Research Unit of Computer Systems and Bioinformatics, Department of Engineering, Università Campus Bio-Medico di Roma, Rome, Italy.
Umeå University, Faculty of Medicine, Department of Diagnostics and Intervention.
Show others and affiliations
2025 (English)In: Image and Vision Computing, ISSN 0262-8856, E-ISSN 1872-8138, Vol. 158, article id 105509Article in journal (Refereed) Published
Abstract [en]

Deep learning has revolutionized biomedical research by providing sophisticated methods to handle complex, high-dimensional data. Multimodal deep learning (MDL) further enhances this capability by integrating diverse data types such as imaging, textual data, and genetic information, leading to more robust and accurate predictive models. In MDL, differently from early and late fusion methods, intermediate fusion stands out for its ability to effectively combine modality-specific features during the learning process. This systematic review comprehensively analyzes and formalizes current intermediate fusion methods in biomedical applications, highlighting their effectiveness in improving predictive performance and capturing complex inter-modal relationships. We investigate the techniques employed, the challenges faced, and potential future directions for advancing intermediate fusion methods. Additionally, we introduce a novel structured notation that standardizes intermediate fusion architectures, enhancing understanding and facilitating implementation across various domains. Our findings provide actionable insights and practical guidelines intended to support researchers, healthcare professionals, and the broader deep learning community in developing more sophisticated and insightful multimodal models. Through this review, we aim to provide a foundational framework for future research and practical applications in the dynamic field of MDL.

Place, publisher, year, edition, pages
Elsevier, 2025. Vol. 158, article id 105509
Keywords [en]
Biomedical data, Data fusion, Data integration, Fusion techniques, Healthcare, Joint fusion
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:umu:diva-237396DOI: 10.1016/j.imavis.2025.105509Scopus ID: 2-s2.0-105001226580OAI: oai:DiVA.org:umu-237396DiVA, id: diva2:1951322
Funder
The Kempe Foundations, JCSMK24-0094Available from: 2025-04-10 Created: 2025-04-10 Last updated: 2025-04-10Bibliographically approved

Open Access in DiVA

fulltext(3579 kB)27 downloads
File information
File name FULLTEXT01.pdfFile size 3579 kBChecksum SHA-512
aee5275261cf1bdb3733717a5f66a9e86071cb5e9798624086bd4dc1ac6b2989887833ee24316e953aa4d3419d155f18a7257a47e6c218aeee88a26db561f591
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Di Feola, FrancescoSoda, Paolo
By organisation
Department of Diagnostics and Intervention
In the same journal
Image and Vision Computing
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 27 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 149 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf