Utilize este identificador para referenciar este registo: https://hdl.handle.net/10316/113751
Título: Automatic wheeze segmentation using harmonic-percussive source separation and empirical mode decomposition
Autor: Rocha, Bruno M. 
Pessoa, Diogo 
Marques, Alda
Carvalho, Paulo de 
Paiva, Rui Pedro 
Palavras-chave: Respiratory sound analysis; expert systems; harmonic-percussive source separation; empirical mode decomposition; sound event detection
Data: 23-Fev-2023
Editora: IEEE
Projeto: This work was supported in part by the FCT - Foundation for Science and Technology, in part by the I.P./MCTES through national funds (PIDDAC), within the scope of CISUC R&D Unit under Projects UIDB/00326/2020 and UIDP/00326/2020, in part by the Ph.D. scholarships under Projects SFRH/BD/135686/2018 and DFA/BD/4927/2020, in part by the WELMO, in part by the Horizon 2020 Framework Programme of the European Union under Grant 825572, in part by the FCT project Lung@ICU under Grant DSAIPA/AI/0113/2020, in part by the Fundo Europeu de Desenvolvimento Regional (FEDER), in part by the Programa Operacional Competitividade e Internacionalização (COMPETE), in part by the iBiMED under Project POCI- 01-0145-FEDER-007628, and in part by the FCT under Project UIDB/04501/2020. 
Título da revista, periódico, livro ou evento: IEEE Journal of Biomedical and Health Informatics
Volume: 27
Número: 4
Resumo: Wheezes are adventitious respiratory sounds commonly present in patients with respiratory conditions. The presence of wheezes and their time location are relevant for clinical reasons, such as understanding the degree of bronchial obstruction. Conventional auscultation is usually employed to analyze wheezes, but remote monitoring has become a pressing need during recent years. Automatic respiratory sound analysis is required to reliably perform remote auscultation. In this work we propose a method for wheeze segmentation. Our method starts by decomposing a given audio excerpt into intrinsic mode frequencies using empirical mode decomposition. Then, we apply harmonic-percussive source separation to the resulting audio tracks and get harmonic-enhanced spectrograms, which are processed to obtain harmonic masks. Subsequently, a series of empirically derived rules are applied to find wheeze candidates. Finally, the candidates stemming from the different audio tracks are merged and median filtered. In the evaluation stage, we compare our method to three baselines on the ICBHI 2017 Respiratory Sound Database, a challenging dataset containing various noise sources and background sounds. Using the full dataset, our method outperforms the baselines, achieving an F1 of 41.9%. Our method's performance is also better than the baselines across several stratified results focusing on five variables: recording equipment, age, sex, body-mass index, and diagnosis. We conclude that, contrary to what has been reported in the literature, wheeze segmentation has not been solved for real life scenario applications. Adaptation of existing systems to demographic characteristics might be a promising step in the direction of algorithm personalization, which would make automatic wheeze segmentation methods clinically viable.
URI: https://hdl.handle.net/10316/113751
DOI: 10.1109/JBHI.2023.3248265
Direitos: openAccess
Aparece nas coleções:I&D CISUC - Artigos em Revistas Internacionais

Mostrar registo em formato completo

Visualizações de página

27
Visto em 8/mai/2024

Downloads

7
Visto em 8/mai/2024

Google ScholarTM

Verificar

Altmetric

Altmetric


Este registo está protegido por Licença Creative Commons Creative Commons