Lightweight deep learning model for assessment of substitution voicing and speech after laryngeal carcinoma surgery /

Rytis Maskeliūnas; Audrius Kulikajevas; Robertas Damaševičius; Kipras Pribuišis; Nora Ulozaitė - Stanienė; Virgilijus Uloza

doi:10.3390/cancers14102366

Title	Lightweight deep learning model for assessment of substitution voicing and speech after laryngeal carcinoma surgery /
Authors	Maskeliūnas, Rytis ; Kulikajevas, Audrius ; Damaševičius, Robertas ; Pribuišis, Kipras ; Ulozaitė - Stanienė, Nora ; Uloza, Virgilijus
DOI	10.3390/cancers14102366
Full Text
Is Part of	Cancers.. Basel : MDPI. 2022, vol. 14, iss. 10, art. no. 2366, p. 1-18.. ISSN 2072-6694
Keywords [eng]	convolutional neural networks ; deep learning ; laryngeal carcinoma ; substitution voicing ; voice analysis
Abstract [eng]	Laryngeal carcinoma is the most common malignant tumor of the upper respiratory tract. Total laryngectomy provides complete and permanent detachment of the upper and lower airways that causes the loss of voice, leading to a patient’s inability to verbally communicate in the postoperative period. This paper aims to exploit modern areas of deep learning research to objectively classify, extract and measure the substitution voicing after laryngeal oncosurgery from the audio signal. We propose using well-known convolutional neural networks (CNNs) applied for image classification for the analysis of voice audio signal. Our approach takes an input of Mel-frequency spectrogram (MFCC) as an input of deep neural network architecture. A database of digital speech recordings of 367 male subjects (279 normal speech samples and 88 pathological speech samples) was used. Our approach has shown the best true-positive rate of any of the compared state-of-the-art approaches, achieving an overall accuracy of 89.47%.
Published	Basel : MDPI
Type	Journal article
Language	English
Publication date	2022
CC license

„Lightweight deep learning model for assessment of substitution voicing and speech after laryngeal carcinoma surgery /“