National Research Council of Canada. NRC Institute for Information Technology
computational biology; machine learning; data mining; knowledge discovery; bioinformatics; breast cancer prognosis; survival prediction; classification performance; sensitivity
Current breast cancer predictive signatures are not unique. Can we use this fact to our advantage to improve prediction? From the machine learning perspective, it is well known that combining multiple classifiers can improve classification performance. We propose an ensemble machine learning approach which consists of choosing feature subsets and learning predictive models from them. We then combine models based on certain model fusion criteria and we also introduce a tuning parameter to control sensitivity. Our method significantly improves classification performance with a particular emphasis on sensitivity which is critical to avoid misclassifying poor prognosis patients as good prognosis.
International Journal of Computational Biology and Drug Design1, no. 3: 275–294.