Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Evaluating expressive speech synthesis from audiobooks in conversational phrases
Show others and affiliations
2012 (English)Conference paper, Published paper (Refereed)
Resource type
Text
Abstract [en]

Audiobooks are a rich resource of large quantities of natural sounding, highly expressive speech. In our previous research we have shown that it is possible to detect different expressive voice styles represented in a particular audiobook, using unsupervised clustering to group the speech corpus of the audiobook into smaller subsets representing the detected voice styles. These subsets of corpora of different voice styles reflect the various ways a speaker uses their voice to express involvement and affect, or imitate characters. This study is an evaluation of the detection of voice styles in an audiobook in the application of expressive speech synthesis. A further aim of this study is to investigate the usability of audiobooks as a language resource for expressive speech synthesis of utterances of conversational speech. Two evaluations have been carried out to assess the effect of the genre transfer: transmitting expressive speech from read aloud literature to conversational phrases with the application of speech synthesis. The first evaluation revealed that listeners have different voice style preferences for a particular conversational phrase. The second evaluation showed that it is possible for users of speech synthesis systems to learn the characteristics of a certain voice style well enough to make reliable predictions about what a certain utterance will sound like when synthesised using that voice style. 

Place, publisher, year, edition, pages
2012. 3335-3339 p.
National Category
Engineering and Technology
Identifiers
URN: urn:nbn:se:kth:diva-185523ISI: 000323927703066ISBN: 978-2-9517408-7-7 (print)OAI: oai:DiVA.org:kth-185523DiVA: diva2:922781
Conference
International Conference on Language Resources and Evaluation. MAY 21-27, 2012.
Note

QC 20160426

Available from: 2016-04-25 Created: 2016-04-21 Last updated: 2016-04-26Bibliographically approved

Open Access in DiVA

fulltext(507 kB)38 downloads
File information
File name FULLTEXT01.pdfFile size 507 kBChecksum SHA-512
71f38c28bf33392a759c4f15f96c7c8df884ed0f7aadc9fb926fc50ffbbde5fb73c452ed3c47cf4b9da2e7a8210f995354d9e1691dd53e043bcc6da356551022
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Székely, Éva
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 38 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 165 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf