Multilingual Text-to-Speech Software Component for Dynamic Language Identification and Voice Switching

Studies in Informatics and Control

Vol. 25, No. 3, 2016

Multilingual Text-to-Speech Software Component for Dynamic Language Identification and Voice Switching

Paul FOGARASSY-NESZLY, Costin PRIBEANU

Abstract

Text-to-speech synthesis is a critical feature of the applications developed for people with visual or reading disabilities. In the last years there has been an increasing interest in multilingual text-to-speech synthesis, which requires multilingual text analysis and language specific speech synthesis. In this case, the dynamic switching of the synthetic voice is needed in order to enhance the usability and user experience. This paper aims at presenting a software component for multilingual text-to-speech synthesis. The software has been developed and tested in four steps: alpha version (proofof-concept), functional version (beta), commercial version, and implementation. The beta testing results showed a high accuracy of the language detection algorithms, which perform properly on texts having a variable degree of fragmentation. The commercial version has been then successfully implemented in two applications for visually impaired people: an automatic reading machine and a personal organizer for the blind and visually impaired users. Both implementations have been tested with users for usability and acceptance. The evaluation results showed that a device with this component is easier to use by visually impaired people.

Keywords

multilingual text-to-speech, dynamic language identification, voice switching, accessibility, assistive technologies, visually impaired users, usability.

View full article

CITE THIS PAPER AS:

Paul FOGARASSY-NESZLY, Costin PRIBEANU, "Multilingual Text-to-Speech Software Component for Dynamic Language Identification and Voice Switching", Studies in Informatics and Control, ISSN 1220-1766, vol. 25(3), pp. 335-342, 2016. https://doi.org/10.24846/v25i3y201607

Past Issues