Friday , March 29 2024

Multilingual Text-to-Speech Software Component for
Dynamic Language Identification and Voice Switching

Paul FOGARASSY-NESZLY1, Costin PRIBEANU2*

1 BAUM Engineering,
8, Str. Traian Moşoiu, Arad 310175, Romania
pf@baum.ro

* Corresponding author

2 I C I Bucharest
(National Institute for R & D in Informatics)

8-10 Averescu Blvd.
011455 Bucharest 1, Romania
pribeanu@ici.ro

Abstract: Text-to-speech synthesis is a critical feature of the applications developed for people with visual or reading disabilities. In the last years there has been an increasing interest in multilingual text-to-speech synthesis, which requires multilingual text analysis and language specific speech synthesis. In this case, the dynamic switching of the synthetic voice is needed in order to enhance the usability and user experience. This paper aims at presenting a software component for multilingual text-to-speech synthesis. The software has been developed and tested in four steps: alpha version (proof-of-concept), functional version (beta), commercial version, and implementation. The beta testing results showed a high accuracy of the language detection algorithms, which perform properly on texts having a variable degree of fragmentation. The commercial version has been then successfully implemented in two applications for visually impaired people: an automatic reading machine and a personal organizer for the blind and visually impaired users. Both implementations have been tested with users for usability and acceptance. The evaluation results showed that a device with this component is easier to use by visually impaired people.

Keywords: multilingual text-to-speech, dynamic language identification, voice switching, accessibility, assistive technologies, visually impaired users, usability.

>>Full text
CITE THIS PAPER AS:
Paul FOGARASSY-NESZLY, Costin PRIBEANU*,
Multilingual Text-to-Speech Software Component for Dynamic Language Identification and Voice Switching, Studies in Informatics and Control, ISSN 1220-1766, vol. 25(3), pp. 335-342, 2016. https://doi.org/10.24846/v25i3y201607