Efficiently Classifying Lung Sounds through Depthwise Separable CNN Models with Fused STFT and MFCC Features

Título

Efficiently Classifying Lung Sounds through Depthwise Separable CNN Models with Fused STFT and MFCC Features

Autor

Shyan-Ming Yuan, Chia-Hung Liao, Shing-Yun Jung, Yu-Sheng Wu, Chuen-Tsai Sun

Descripción

Lung sounds remain vital in clinical diagnosis as they reveal associations with pulmonary pathologies. With COVID-19 spreading across the world, it has become more pressing for medical professionals to better leverage artificial intelligence for faster and more accurate lung auscultation. This research aims to propose a feature engineering process that extracts the dedicated features for the depthwise separable convolution neural network (DS-CNN) to classify lung sounds accurately and efficiently. We extracted a total of three features for the shrunk DS-CNN model: the short-time Fourier-transformed (STFT) feature, the Mel-frequency cepstrum coefficient (MFCC) feature, and the fused features of these two. We observed that while DS-CNN models trained on either the STFT or the MFCC feature achieved an accuracy of 82.27% and 73.02%, respectively, fusing both features led to a higher accuracy of 85.74%. In addition, our method achieved 16 times higher inference speed on an edge device and only 0.45% less accuracy than RespireNet. This finding indicates that the fusion of the STFT and MFCC features and DS-CNN would be a model design for lightweight edge devices to achieve accurate AI-aided detection of lung diseases.

Fecha

2021

Materia

convolutional neural network, feature extraction, lung sounds, depthwise separable convolution, automatic auscultations

Identificador

10.3390/diagnostics11040732

Fuente

Epidemiology and Health

Editor

Korean Society of Epidemiology

Cobertura

Medicine (General)

Archivos

https://socictopen.socict.org/files/to_import/pdfs/0b41486a908421e2ff86c2d93c523ee4.pdf

Colección

Citación

Shyan-Ming Yuan, Chia-Hung Liao, Shing-Yun Jung, Yu-Sheng Wu, Chuen-Tsai Sun, “Efficiently Classifying Lung Sounds through Depthwise Separable CNN Models with Fused STFT and MFCC Features,” SOCICT Open, consulta 17 de abril de 2026, https://www.socictopen.socict.org/items/show/9534.

Formatos de Salida

Position: 13032 (22 views)