Logo

28. Jahrestagung der Deutschen Gesellschaft für Audiologie e. V.

Deutsche Gesellschaft für Audiologie e. V.
04.-06.03.2026
Oldenburg

Meeting Abstract

Automatic speech performance measures for assessing audiogenic speech disorders using speech processing technologies

Laura Tuschen - Fraunhofer IDMT, HSA, Oldenburg, Deutschland
Marlene Rabe - Carl von Ossietzky Universität Oldenburg, Communication Acoustics, Oldenburg, Deutschland
Bernd T. Meyer - Carl von Ossietzky Universität Oldenburg, Communication Acoustics, Oldenburg, Deutschland

Text

Question: Adults may experience speech disorders of functional origin or organic causes. Speech is then characterized by sounds not being pronounced clearly, the voice being hoarse or rough, or uncontrolled speech intensity, changed rhythmic-melodic, and speech rate [1], [2]. Such disorders can lead to limitations in conversations and even exclusion from society as speech production ability is reduced on a communication related side [1]. The aim of this project was to investigate whether existing algorithm-based approaches and models from speech and hearing processing can be used to analyse and evaluate disordered human speech production, particularly in adults with speech disorders caused by hearing loss, so-called audiogenic speech disorders.

Method: An audio dataset of a test and a reference group was compared. Each group consisted of n=15 adults reading sentences from a matrix test with simple grammatical structure. The adults in the test group had a pre- or peri-lingual hearing impairment and suffered from different degrees of audiogenic speech and voice disorders. The subjects in the reference group had no hearing impairment. For each group, a set of 446 sentences was compared using different acoustic algorithm-based approaches to measure speech intelligibility (ASR), listener effort (LEAP), and speech naturalness (NISQA) [3]. In addition, a listening experiment was conducted in which participants with normal hearing rated a subset of the audio recordings based on perceived listener effort and naturalness on rating scales and speech intelligibility as correctly recognized words.

Results: The acoustic analyses generally show differences between the reference and test groups. Speech intelligibility and naturalness are lower on average for the test group within the acoustic measurements. On average, listener effort is rated higher for the test group. There is greater variance in all analysis results for the test group than for the reference group.The results of the listening experiment show similar results: The naturalness of the test group is rated as less natural than the reference group on average. The perceived listener effort is also rated higher on average for the test group. In addition, the variance of the rating results is higher in the test group than in the reference group. No group differences can be shown in the intelligibility ratings.

Conclusion: The results suggest that changes in speech associated with audiogenic speech disorders can be automatically measured using algorithm-based approaches and speech and hearing processing models in a group comparison. A more extensive listening experiment is planned to improve comparability with speech processing models and test reliability. In the future, it is planned to transfer the measurements to spontaneous speech and other speech disorders such as dysarthria in Parkinson's disease or childhood pronunciation disorders.


Literatur

[1] Lehner K, Ziegler W. Indicators of Communication Limitation in Dysarthria and Their Relation to Auditory-PerceptualSpeech Symptoms: Construct Validity of the KommPaS Web App. Journal of Speech, Language, and Hearing Research. 2022;65:22–42.
[2] Baumgärtner A, Staiger A. Neurogene Störungen der Sprache und des Sprechens. Neurologie Up2date. 2020;03:155–173.
[3] Mittag G, Möller S. Deep Learning Based Assessment of Synthetic Speech Naturalness. In: Proceedings Interspeech 2020.