Téléchargement | - Voir le manuscrit accepté : Undersampling with support vectors for multi-class imbalanced data classification (PDF, 380 Kio)
|
---|
DOI | Trouver le DOI : https://doi.org/10.1109/IJCNN52387.2021.9533379 |
---|
Auteur | Rechercher : Krawczyk, Bartosz; Rechercher : Bellinger, Colin1; Rechercher : Corizzo, Roberto; Rechercher : Japkowicz, Nathalie |
---|
Affiliation | - Conseil national de recherches du Canada. Technologies numériques
|
---|
Format | Texte, Article |
---|
Conférence | 2021 International Joint Conference on Neural Networks (IJCNN), July 18-22, 2021, Shenzhen, China [Virtual Event] |
---|
Description physique | 7 p. |
---|
Sujet | machine learning; imbalanced data classification; multi-class imbalance; undersampling |
---|
Résumé | Learning from imbalanced data poses significant challenges for the classifier. This becomes even more difficult, when dealing with multi-class problems. Here relationships among classes are no longer well-defined and it is easy to loose performance on one of the classes while gaining on other. In last years this topic has gained increased interest from the machine learning community - however, still there is a need for developing new and efficient algorithms to handle this challenge. In this paper we propose a new approach for balancing multi-class imbalanced problems. It is based on a two-step undersampling methodology. In the first step, a one-class classifier is being trained on each of the classes, achieving skew-insensitive data description. Support vectors for each class are extracted and used as new class representatives, thus achieving significant reduction in the terms of used instances. In the second step, an evolutionary undersampling approach is being used on these support vectors in order to further balance the training set. By applying this technique on a set of support vectors and not on a full dataset, we achieve a significant reduction of the computational time and increased accuracy. Finally, a standard multi-class classifier is being trained on the balanced data set. A thorough experimental study proves the usefulness of the proposed approach in comparison with state-of-the-art approaches for handling multi-class imbalanced data. |
---|
Date de publication | 2021-09-20 |
---|
Maison d’édition | IEEE |
---|
Dans | |
---|
Langue | anglais |
---|
Publications évaluées par des pairs | Oui |
---|
Exporter la notice | Exporter en format RIS |
---|
Signaler une correction | Signaler une correction (s'ouvre dans un nouvel onglet) |
---|
Identificateur de l’enregistrement | bd3b4916-7e94-4b0a-9901-3ce3ef0461eb |
---|
Enregistrement créé | 2021-11-10 |
---|
Enregistrement modifié | 2021-11-15 |
---|