Alternative title | Relevant attribute discovery in high dimensional data based on rough sets applications to Leukemia gene expressions |
---|
Download | - View accepted manuscript: Relevant attribute discovery in high dimensional data based on rough sets and unsupervised classification: application to Leukemia gene expressions (PDF, 305 KiB)
|
---|
DOI | Resolve DOI: https://doi.org/10.1007/11548706_38 |
---|
Author | Search for: Valdés, Julio J.1; Search for: Barton, Alan J.1 |
---|
Affiliation | - National Research Council of Canada. NRC Institute for Information Technology
|
---|
Format | Text, Book Chapter |
---|
Conference | The Tenth International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing (RSFDGrC 2005), August 31 - September 3, 2005, Regina, Saskatchewan, Canada |
---|
Subject | acute myeloid leukemia; acute lymphoblastic leukemia; high dimensional data; decision attribute; remote host |
---|
Abstract | A pipelined approach using two clustering algorithms in combination with Rough Sets is investigated for the purpose discovering important combination of attributes in high dimensional data. In many domains, the data objects are described in terms of a large number of features, like in gene expression experiments, or in samples characterized by spectral information. The Leader and several k-means algorithms are used as fast procedures for attribute set simplification of the information systems presented to the rough sets algorithms. The data submatrices described in terms of these features are then discretized w.r.t the decision attribute according to different rough set based schemes. From them, the reducts and their derived rules are extracted, which are applied to test data in order to evaluate the resulting classification accuracy. An exploration of this approach (using Leukemia gene expression data) was conducted in a series of experiments within a high-throughput distributed-computing environment. They led to subsets of genes with high discrimination power. Good results were obtained with no preprocessing applied to the data. |
---|
Publication date | 2005-09 |
---|
Publisher | Springer |
---|
In | |
---|
Series | |
---|
Language | English |
---|
Peer reviewed | Yes |
---|
NRC number | NRCC 48122 |
---|
NPARC number | 8913287 |
---|
Export citation | Export as RIS |
---|
Report a correction | Report a correction (opens in a new tab) |
---|
Record identifier | ed378d76-e02c-49d9-9c9a-4b18c393e1d6 |
---|
Record created | 2009-04-22 |
---|
Record modified | 2024-02-06 |
---|