| Title |
Mordukhovich subdifferential optimization framework for multi-criteria voice cloning of pathological speech |
| Authors |
Maskeliūnas, Rytis ; Damaševičius, Robertas ; Kulikajevas, Audrius ; Pribuišis, Kipras ; Ulozaitė - Stanienė, Nora ; Uloza, Virgilijus |
| DOI |
10.32604/cmes.2025.072790 |
| Full Text |
|
| Is Part of |
Computer modeling in engineering & sciences.. Henderson, NV : Tech science press. 2025, vol. 145, iss. 3, p. 4203-4223.. ISSN 1526-1492. eISSN 1526-1506 |
| Keywords [eng] |
Mordukhovich subdifferential optimization ; multi-objective optimization ; alaryngeal voice reconstruction |
| Abstract [eng] |
This study introduces a novel voice cloning framework driven by Mordukhovich Subdifferential Optimization (MSO) to address the complex multi-objective challenges of pathological speech synthesis in under-resourced Lithuanian language with unique phonemes not present in most pre-trained models. Unlike existing voice synthesis models that often optimize for a single objective or are restricted to major languages, our approach explicitly balances four competing criteria: speech naturalness, speaker similarity, computational efficiency, and adaptability to pathological voice patterns. We evaluate four model configurations combining Lithuanian and English encoders, synthesizers, and vocoders. The hybrid model (English encoder, Lithuanian synthesizer, English vocoder), optimized via MSO, achieved the highest Mean Opinion Score (MOS) of 4.3 and demonstrated superior intelligibility and speaker fidelity. The results confirm that MSO enables effective navigation of trade-offs in multilingual pathological voice cloning, offering a scalable path toward high-quality voice restoration in clinical speech applications. This work represents the first integration of Mordukhovich optimization into pathological TTS, setting a new benchmark for speech synthesis under clinical and linguistic constraints. |
| Published |
Henderson, NV : Tech science press |
| Type |
Journal article |
| Language |
English |
| Publication date |
2025 |
| CC license |
|