Efficiently Classifying Lung Sounds through Depthwise Separable CNN Models with Fused STFT and MFCC Features

Título

Autor

Shyan-Ming Yuan, Chia-Hung Liao, Shing-Yun Jung, Yu-Sheng Wu, Chuen-Tsai Sun

Descripción

Lung sounds remain vital in clinical diagnosis as they reveal associations with pulmonary pathologies. With COVID-19 spreading across the world, it has become more pressing for medical professionals to better leverage artificial intelligence for faster and more accurate lung auscultation. This research aims to propose a feature engineering process that extracts the dedicated features for the depthwise separable convolution neural network (DS-CNN) to classify lung sounds accurately and efficiently. We extracted a total of three features for the shrunk DS-CNN model: the short-time Fourier-transformed (STFT) feature, the Mel-frequency cepstrum coefficient (MFCC) feature, and the fused features of these two. We observed that while DS-CNN models trained on either the STFT or the MFCC feature achieved an accuracy of 82.27% and 73.02%, respectively, fusing both features led to a higher accuracy of 85.74%. In addition, our method achieved 16 times higher inference speed on an edge device and only 0.45% less accuracy than RespireNet. This finding indicates that the fusion of the STFT and MFCC features and DS-CNN would be a model design for lightweight edge devices to achieve accurate AI-aided detection of lung diseases.

Fecha

2021

Materia

convolutional neural network, feature extraction, lung sounds, depthwise separable convolution, automatic auscultations

Identificador

10.3390/diagnostics11040732

Fuente

Epidemiology and Health

Editor

Korean Society of Epidemiology

Cobertura

Medicine (General)

Archivos

https://socictopen.socict.org/files/to_import/pdfs/0b41486a908421e2ff86c2d93c523ee4.pdf

Colección

Coronavirus

Citación

Shyan-Ming Yuan, Chia-Hung Liao, Shing-Yun Jung, Yu-Sheng Wu, Chuen-Tsai Sun, “Efficiently Classifying Lung Sounds through Depthwise Separable CNN Models with Fused STFT and MFCC Features,” SOCICT Open, consulta 13 de junio de 2026, https://www.socictopen.socict.org/items/show/9534.

Formatos de Salida

Position: 14039 (24 views)