TY - JOUR
T1 - Support Vector Machine Ensembles for Breast Cancer Type Prediction from Mid-FTIR Micro-calcification Spectra
AU - Sattlecker, Martina
AU - Baker, Rebecca
AU - Stone, Nick
AU - Bessant, Conrad
PY - 2011/7
Y1 - 2011/7
N2 - Over the past years Fourier transform infrared (FTIR) spectroscopy has been demonstrated as a prospective tool for cancer diagnostics. In order to apply FTIR spectroscopy as a routine tool for biomedical diagnostics of tissue samples, strong and reliable classifiers are needed. Frequently, the number of available tissue samples is restricted and due to that data sets consist of a small number of samples, often less than 100. This can result in unstable classifiers, which perform poorly on unseen data. In this work we present a way to overcome this limitation by aggregating several support vector machines in to an ensemble. Different ensemble systems, including bagging, boosting and tree-based models, were investigated for a FTIR data set acquired from different types and stages of breast cancer. It was found that an ensemble system predicts 88.9% of the unseen multi-class test set correctly. In comparison a single classifier only achieved a predictive performance of 66.7%. As these results show, the application of SVM ensembles in biomedical diagnostics using FTIR spectroscopy can be highly beneficial.
AB - Over the past years Fourier transform infrared (FTIR) spectroscopy has been demonstrated as a prospective tool for cancer diagnostics. In order to apply FTIR spectroscopy as a routine tool for biomedical diagnostics of tissue samples, strong and reliable classifiers are needed. Frequently, the number of available tissue samples is restricted and due to that data sets consist of a small number of samples, often less than 100. This can result in unstable classifiers, which perform poorly on unseen data. In this work we present a way to overcome this limitation by aggregating several support vector machines in to an ensemble. Different ensemble systems, including bagging, boosting and tree-based models, were investigated for a FTIR data set acquired from different types and stages of breast cancer. It was found that an ensemble system predicts 88.9% of the unseen multi-class test set correctly. In comparison a single classifier only achieved a predictive performance of 66.7%. As these results show, the application of SVM ensembles in biomedical diagnostics using FTIR spectroscopy can be highly beneficial.
U2 - 10.1016/j.chemolab.2011.05.007
DO - 10.1016/j.chemolab.2011.05.007
M3 - Article
SN - 0169-7439
VL - 107
SP - 363
EP - 370
JO - CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS
JF - CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS
IS - 2
M1 - N/A
ER -