VOOZH about

URL: https://pubmed.ncbi.nlm.nih.gov/25750696/

⇱ Machine learning applications in cancer prognosis and prediction - PubMed


Clipboard, Search History, and several other advanced features are temporarily unavailable.
Skip to main page content
👁 Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

👁 Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Abstract

Cancer has been characterized as a heterogeneous disease consisting of many different subtypes. The early diagnosis and prognosis of a cancer type have become a necessity in cancer research, as it can facilitate the subsequent clinical management of patients. The importance of classifying cancer patients into high or low risk groups has led many research teams, from the biomedical and the bioinformatics field, to study the application of machine learning (ML) methods. Therefore, these techniques have been utilized as an aim to model the progression and treatment of cancerous conditions. In addition, the ability of ML tools to detect key features from complex datasets reveals their importance. A variety of these techniques, including Artificial Neural Networks (ANNs), Bayesian Networks (BNs), Support Vector Machines (SVMs) and Decision Trees (DTs) have been widely applied in cancer research for the development of predictive models, resulting in effective and accurate decision making. Even though it is evident that the use of ML methods can improve our understanding of cancer progression, an appropriate level of validation is needed in order for these methods to be considered in the everyday clinical practice. In this work, we present a review of recent ML approaches employed in the modeling of cancer progression. The predictive models discussed here are based on various supervised ML techniques as well as on different input features and data samples. Given the growing trend on the application of ML methods in cancer research, we present here the most recent publications that employ these techniques as an aim to model cancer risk or patient outcomes.

Keywords: ANN, Artificial Neural Network; AUC, Area Under Curve; BCRSVM, Breast Cancer Support Vector Machine; BN, Bayesian Network; CFS, Correlation based Feature Selection; Cancer recurrence; Cancer survival; Cancer susceptibility; DT, Decision Tree; ES, Early Stopping algorithm; GEO, Gene Expression Omnibus; HTT, High-throughput Technologies; LCS, Learning Classifying Systems; ML, Machine Learning; Machine learning; NCI caArray, National Cancer Institute Array Data Management System; NSCLC, Non-small Cell Lung Cancer; OSCC, Oral Squamous Cell Carcinoma; PPI, Protein–Protein Interaction; Predictive models; ROC, Receiver Operating Characteristic; SEER, Surveillance, Epidemiology and End results Database; SSL, Semi-supervised Learning; SVM, Support Vector Machine; TCGA, The Cancer Genome Atlas Research Network.

PubMed Disclaimer

Figures

👁 Fig. 1
Fig. 1
Classification task in supervised learning. Tumors are represented as X and classified as benign or malignant. The circled examples depict those tumors that have been misclassified.
👁 Fig. 2
Fig. 2
An indicative ROC curve of two classifiers: (a) Random Guess classifier (red curve) and (b) A classifier providing more robust predictions (blue dotted curve).
👁 Fig. 3
Fig. 3
An illustration of the ANN structure. The arrows connect the output of one node to the input of another.
👁 Fig. 4
Fig. 4
An illustration of a DT showing the tree structure. Each variable (X, Y, Z) is represented by a circle and the decision outcomes by squares (Class A, Class B). T(1–3) represents the thresholds (classification rules) in order to successfully classify each variable to a class label.
👁 Fig. 5
Fig. 5
A simplified illustration of a linear SVM classification of the input data. Figure was reproduced from the ML lectures of . Tumors are classified according to their size and the patient's age. The depicted arrows display the misclassified tumors.
👁 Fig. 6
Fig. 6
An illustration of a BN. Nodes (A–D) represent a set of random variables across with their conditional probabilities which are calculated in each table.
👁 Fig. 7
Fig. 7
Distribution of published studies, within the last 5 years, that employ ML techniques for cancer prediction.

References

    1. Hanahan D., Weinberg R.A. Hallmarks of cancer: the next generation. Cell. 2011;144:646–674. - PubMed
    1. Polley M.-Y.C., Freidlin B., Korn E.L., Conley B.A., Abrams J.S., McShane L.M. Statistical and practical considerations for clinical evaluation of predictive biomarkers. J Natl Cancer Inst. 2013;105:1677–1683. - PMC - PubMed
    1. Cruz J.A., Wishart D.S. Applications of machine learning in cancer prediction and prognosis. Cancer Informat. 2006;2:59. - PMC - PubMed
    1. Fortunato O., Boeri M., Verri C., Conte D., Mensah M., Suatoni P. Assessment of circulating microRNAs in plasma of lung cancer patients. Molecules. 2014;19:3038–3054. - PMC - PubMed
    1. Heneghan H.M., Miller N., Kerin M.J. MiRNAs as biomarkers and therapeutic targets in cancer. Curr Opin Pharmacol. 2010;10:543–550. - PubMed
Cite

NCBI Literature Resources

MeSH PMC Bookshelf Disclaimer

The PubMed wordmark and PubMed logo are registered trademarks of the U.S. Department of Health and Human Services (HHS). Unauthorized use of these marks is strictly prohibited.