Predicting Breast Cancer from Risk Factors Using SVM and Extra-Trees-Based Feature Selection Method
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F61989100%3A27350%2F22%3A10250524" target="_blank" >RIV/61989100:27350/22:10250524 - isvavai.cz</a>
Result on the web
<a href="https://www.mdpi.com/2073-431X/11/9/136" target="_blank" >https://www.mdpi.com/2073-431X/11/9/136</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.3390/computers11090136" target="_blank" >10.3390/computers11090136</a>
Alternative languages
Result language
angličtina
Original language name
Predicting Breast Cancer from Risk Factors Using SVM and Extra-Trees-Based Feature Selection Method
Original language description
Developing a prediction model from risk factors can provide an efficient method to recognize breast cancer. Machine learning (ML) algorithms have been applied to increase the efficiency of diagnosis at the early stage. This paper studies a support vector machine (SVM) combined with an extremely randomized trees classifier (extra-trees) to provide a diagnosis of breast cancer at the early stage based on risk factors. The extra-trees classifier was used to remove irrelevant features, while SVM was utilized to diagnose the breast cancer status. A breast cancer dataset consisting of 116 subjects was utilized by machine learning models to predict breast cancer, while the stratified 10-fold cross-validation was employed for the model evaluation. Our proposed combined SVM and extra-trees model reached the highest accuracy up to 80.23%, which was significantly better than the other ML model. The experimental results demonstrated that by applying extra-trees-based feature selection, the average ML prediction accuracy was improved by up to 7.29% as contrasted to ML without the feature selection method. Our proposed model is expected to increase the efficiency of breast cancer diagnosis based on risk factors. In addition, we presented the proposed prediction model that could be employed for web-based breast cancer prediction. The proposed model is expected to improve diagnostic decision-support systems by predicting breast cancer disease accurately.
Czech name
—
Czech description
—
Classification
Type
J<sub>imp</sub> - Article in a specialist periodical, which is included in the Web of Science database
CEP classification
—
OECD FORD branch
10200 - Computer and information sciences
Result continuities
Project
—
Continuities
V - Vyzkumna aktivita podporovana z jinych verejnych zdroju
Others
Publication year
2022
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
Computers
ISSN
2073-431X
e-ISSN
—
Volume of the periodical
11
Issue of the periodical within the volume
9
Country of publishing house
CH - SWITZERLAND
Number of pages
14
Pages from-to
nestrankovano
UT code for WoS article
000856323500001
EID of the result in the Scopus database
2-s2.0-85138679296