Classification of tumor samples from expression data using decision trunks
2013 (English)In: Cancer Informatics, ISSN 1176-9351, Vol. 12, 53-66 p.Article in journal (Refereed) Published
We present a novel machine learning approach for the classification of cancer samples using expression data. We refer to the method as "decision trunks," since it is loosely based on decision trees, but contains several modifications designed to achieve an algorithm that: (1) produces smaller and more easily interpretable classifiers than decision trees; (2) is more robust in varying application scenarios; and (3) achieves higher classification accuracy. The decision trunk algorithm has been implemented and tested on 26 classification tasks, covering a wide range of cancer forms, experimental methods, and classification scenarios. This comprehensive evaluation indicates that the proposed algorithm performs at least as well as the current state of the art algorithms in terms of accuracy, while producing classifiers that include on average only 2-3 markers. We suggest that the resulting decision trunks have clear advantages over other classifiers due to their transparency, interpretability, and their correspondence with human decision-making and clinical testing practices. © the author(s), publisher and licensee Libertas Academica Ltd.
Place, publisher, year, edition, pages
Libertas Academica Ltd. , 2013. Vol. 12, 53-66 p.
Biomarkers, Classification, Gene expression, Machine learning, accuracy, article, classification algorithm, controlled study, decision making, decision tree, intermethod comparison, learning algorithm
Research subject Natural sciences
IdentifiersURN: urn:nbn:se:his:diva-8394DOI: 10.4137/CIN.S10356PubMedID: 23467331ScopusID: 2-s2.0-84874202131OAI: oai:DiVA.org:his-8394DiVA: diva2:639970