Cough classification with deep derived features using Audio Spectrogram Transformer

Par Conseil national de recherches du Canada

DOI	Trouver le DOI : https://doi.org/10.1109/BigData55660.2022.10020878
Auteur	Rechercher : Valdes, Julio¹; Rechercher : Habashy, Karim¹; Rechercher : Xi, Pengcheng¹; Rechercher : Cohen-McFarlane, Madison; Rechercher : Wallace, Bruce; Rechercher : Goubran, Rafik; Rechercher : Knoefel, Frank
Affiliation	Conseil national de recherches du Canada. Technologies numériques
Bailleur de fonds	Rechercher : National Research Council of Canada
Format	Texte, Article
Conférence	2022 IEEE International Conference on Big Data (Big Data), December 17-20, 2022, Osaka, Japan
Sujet	neural network; transformer; big data; cough audio signals; spectrograms; feature learning; feature selection; feature generation; manifold extraction; AutoML; representation learning; adaptation models; computational modeling; audio recording; data models
Résumé	Cough diagnosis is important for the elderly population, since cough is a key symptom of many respiratory illnesses and conditions. This paper introduces a Transformer-based feature learning approach for the analysis of cough recordings. A Transformer network leveraging feature learning on a big data set is investigated from a feature engineering perspective, in order to find dedicated classification models that can improve overall performance. The latter was achieved through adopting AutoML post-processing techniques on different data sets, driven by the feature engineering process based on both feature selection and feature generation via nonlinear methods. It was found that this approach led to substantial improvements (in the order of 17% from 0.818 to 0.956 of accuracy) on practically all metrics of classification performance, with respect t o t hose obtained with standalone Transformers. Moreover, AutoML models using reduced number of features, either selected or generated, resulted in higher quality models. In particular, a model working only with 1.2 % of the features (nonlinearly generated from the 768 produced by the Transformer), outperformed the model using all of them. These results highlight that big data-derived machine learning models, when post-processed, can play an important role in adapting to small-data scenarios.
Date de publication	2022-12-17
Maison d’édition	IEEE
Dans	2022 IEEE International Conference on Big Data (Big Data), 10020878 (17 décembre 2022) : 1729–1739.
Langue	anglais
Publications évaluées par des pairs	Oui
Exporter la notice	Exporter en format RIS
Signaler une correction	Signaler une correction (s'ouvre dans un nouvel onglet)
Identificateur de l’enregistrement	4c39543a-1cd1-4ef9-a984-04b0c77551ef
Enregistrement créé	2023-01-31
Enregistrement modifié	2023-03-16

Date de modification :: 2024-06-30