Searching for patterns in imbalanced data: methods and alternatives with case studies in life sciences

From National Research Council Canada

DOI	Resolve DOI: https://doi.org/10.1007/978-3-319-12568-8_20
Author	Search for: Famili, A. Fazel¹
Affiliation	National Research Council of Canada. Information and Communication Technologies
Format	Text, Book Chapter
Conference	19th Iberoamerican Congress, CIARP 2014, November 2-5, 2014, Puerto Vallarta, Mexico
Subject	knowledge discovery; imbalanced data; gene expression data
Abstract	The prime motivation for pattern discovery and machine learning research has been the collection and warehousing of large amounts of data, in many domains such as life sciences and industrial processes. Examples of unique problems arisen are situations where the data is imbalanced. The class imbalance problem corresponds to situations where majority of cases belong to one class and a small minority belongs to the other, which in many cases is equally or even more important. To deal with this problem a number of approaches have been studied in the past. In this talk we provide an overview of some existing methods and present novel applications that are based on identifying the inherent characteristics of one class vs the other. We present the results of a number of studies focusing on real data from life science applications.
Publication date	2014
Publisher	Springer
In	Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications (2014): 159–166.
Series	Lecture Notes in Computer Science 8827.
Language	English
Peer reviewed	Yes
NPARC number	21276087
Export citation	Export as RIS
Report a correction	Report a correction (opens in a new tab)
Record identifier	8d8af267-d0d1-4463-a8ea-02d4a83e970c
Record created	2015-09-22
Record modified	2020-06-18

Date modified:: 2025-05-09