Emotional Audio-Visual Arabic Text to Speech
2006 (English)In: Proceedings of the XIV European Signal Processing Conference (EUSIPCO), Florence, Italy, 2006Conference paper, Published paper (Refereed)
Abstract [en]
The goal of this paper is to present an emotional audio-visual. Text to speech system for the Arabic Language. The system is based on two entities: un emotional audio text to speech system which generates speech depending on the input text and the desired emotion type, and un emotional Visual model which generates the talking heads, by forming the corresponding visemes. The phonemes to visemes mapping, and the emotion shaping use a 3-paramertic face model, based on the Abstract Muscle Model. We have thirteen viseme models and five emotions as parameters to the face model. The TTS produces the phonemes corresponding to the input text, the speech with the suitable prosody to include the prescribed emotion. In parallel the system generates the visemes and sends the controls to the facial model to get the animation of the talking head in real time.
Place, publisher, year, edition, pages
Florence, Italy, 2006.
Series
European Signal Processing Conference, ISSN 2219-5491
National Category
Computer Sciences Natural Language Processing
Identifiers
URN: urn:nbn:se:kth:diva-52075OAI: oai:DiVA.org:kth-52075DiVA, id: diva2:465369
Conference
the XIV European Signal Processing Conference (EUSIPCO)
Note
tmh_import_11_12_14. QC 20111215
2011-12-142011-12-142025-02-01Bibliographically approved