Change search
ReferencesLink to record
Permanent link

Direct link
Automatic Turn Segmentation for Movie & TV Subtitles
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Speech Communication and Technology.
2016 (English)In: 2016 IEEE Workshop on Spoken Language Technology, IEEE conference proceedings, 2016Conference paper (Refereed)
Abstract [en]

Movie and TV subtitles contain large amounts of conversational material, but lack an explicit turn structure. This paper present a data-driven approach to the segmentation of subtitles into dialogue turns. Training data is first extracted by aligning subtitles with transcripts in order to obtain speaker labels. This data is then used to build a classifier whose task is to determine whether two consecutive sentences are part of the same dialogue turn. The approach relies on linguistic, visual and timing features extracted from the subtitles themselves and does not require access to the audiovisual material -- although speaker diarization can be exploited when audio data is available. The approach also exploits alignments with related subtitles in other languages to further improve the classification performance. The classifier achieves an accuracy of 78% on a held-out test set. A follow-up annotation experiment demonstrates that this task is also difficult for human annotators.

Place, publisher, year, edition, pages
IEEE conference proceedings, 2016.
National Category
Computer Science Language Technology (Computational Linguistics)
URN: urn:nbn:se:kth:diva-193938OAI: diva2:1034694
2016 IEEE Workshop on Spoken Language Technology

QC 20161014

Available from: 2016-10-12 Created: 2016-10-12 Last updated: 2016-10-14Bibliographically approved

Open Access in DiVA

fulltext(363 kB)13 downloads
File information
File name FULLTEXT01.pdfFile size 363 kBChecksum SHA-512
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Meena, Raveesh
By organisation
Speech Communication and Technology
Computer ScienceLanguage Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar
Total: 13 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

ReferencesLink to record
Permanent link

Direct link