Digitala Vetenskapliga Arkivet

Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A machine learning tool for identifying metastatic colorectal cancer in primary care
Division of Family Medicine and Primary Care, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet;Academic Primary Health Care Centre, Region Stockholm;Regional Cancer Centre Stockholm-Gotland, Region Stockholm.ORCID iD: 0000-0002-6177-4136
Department of Community Medicine and Public Health, Sahlgrenska Academy, Institute of Medicine, University of Gothenburg.ORCID iD: 0000-0002-6849-0654
Division of Family Medicine and Primary Care, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet;Academic Primary Health Care Centre, Region Stockholm;Regional Cancer Centre Stockholm-Gotland, Region Stockholm.ORCID iD: 0000-0001-9589-4681
Division of Family Medicine and Primary Care, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet;Academic Primary Health Care Centre, Region Stockholm.ORCID iD: 0000-0001-9521-2345
Show others and affiliations
2025 (English)In: Scandinavian Journal of Primary Health Care, ISSN 0281-3432, E-ISSN 1502-7724Article in journal (Refereed) Epub ahead of print
Abstract [en]

Background: Detection of colorectal cancer (CRC) is mainly achieved by clinical assessment. As new treatments become available for metastatic CRC (MCRC), it is important to accurately identify these patients.

Aim: To develop a predictive model for identifying MCRC in primary health care patients using diagnostic data analysed with machine learning.

Design and setting: A case-control study utilising data on primary health care visits for 146 patients >18 years old diagnosed with MCRC in the Västra Götaland Region, Sweden during 2011, and 577 sex-, age, and primary health care centre-matched controls.

Method: Stochastic gradient boosting was used to construct a model for predicting the presence of MCRC based on diagnostic codes from primary health care consultations during the year before index (diagnosis) date and number of consultations. Variable importance was estimated using the normalised relative influence (NRI) score. Risks of having MCRC were calculated using odds ratios of marginal effects (ORME).

Results: The optimal model included 76 variables with non-zero influence, had an area under the curve of 76.5%, a sensitivity of 77.8%, and a specificity of 69.2%. The 10 most important variables had a combined NRI of 61.0%. Number of consultations during the year before index date had the highest NRI at 19.2%, with an ORME of 3.3.

Conclusion: A machine learning method based on primary health care consultation frequency and diagnoses may be used to identify important variables for predicting presence of MCRC. Both primary health care consultations and associated diagnostic codes need to be taken into consideration.

Place, publisher, year, edition, pages
Taylor & Francis, 2025.
Keywords [en]
artificial intelligence, cancer detection, family practice, gradient boosting, colorectal neoplasms
National Category
Cancer and Oncology
Identifiers
URN: urn:nbn:se:uu:diva-552410DOI: 10.1080/02813432.2025.2477155OAI: oai:DiVA.org:uu-552410DiVA, id: diva2:1944521
Funder
Region Stockholm, NsV-977323Stiftelsen Einar BelvénAvailable from: 2025-03-14 Created: 2025-03-14 Last updated: 2025-03-21Bibliographically approved

Open Access in DiVA

fulltext(1680 kB)33 downloads
File information
File name FULLTEXT01.pdfFile size 1680 kBChecksum SHA-512
02e397b6e9e2475997ca4992ed0845b7fd6677c67702be5bfd3e4889489ed180118b03fa86d476d20c90ef44edb3ac7faa5d27cb0d337da8a792d1d70b49db3e
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Search in DiVA

By author/editor
Abedi, EliyaEwing, MarcelaNemlander, ElinorHasselström, JanSjövall, AnnikaCarlsson, Axel C.Rosenblad, Andreas
By organisation
Clinical diabetology and metabolismDepartment of Statistics
In the same journal
Scandinavian Journal of Primary Health Care
Cancer and Oncology

Search outside of DiVA

GoogleGoogle Scholar
Total: 34 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 146 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf