Change search
ReferencesLink to record
Permanent link

Direct link
Scale transform in rhythmic similarity of music
Technological Education Institute, Greece.ORCID iD: 0000-0003-1679-6018
2011 (English)In: IEEE Transactions on Audio, Speech, and Language Processing, ISSN 1558-7916, E-ISSN 1558-7924, Vol. 19, no 1, 176-185 p.Article in journal (Refereed) Published
Abstract [en]

As a special case of the Mellin transform, the scale transform has been applied in various signal processing areas, in order to get a signal description that is invariant to scale changes. In this paper, the scale transform is applied to autocorrelation sequences derived from music signals. It is shown that two such sequences, when derived from similar rhythms with different tempo, differ mainly by a scaling factor. By using the scale transform, the proposed descriptors are robust to tempo changes, and are specially suited for the comparison of pieces with different tempi but similar rhythm. As music with such characteristics is widely encountered in traditional forms of music, the performance of the descriptors in a classification task of Greek traditional dances and Turkish traditional songs is evaluated. On these datasets accuracies compared to non-tempo robust approaches are improved by more than 20%, while on a dataset of Western music the achieved accuracy improves compared to previously presented results.

Place, publisher, year, edition, pages
IEEE Press, 2011. Vol. 19, no 1, 176-185 p.
Keyword [en]
Computational ethnomusicology; music information retrieval (MIR); rhythmic similarity; scale transform
National Category
Media Engineering
Research subject
Computer Science; Information and Communication Technology; Media Technology; Speech and Music Communication
URN: urn:nbn:se:kth:diva-193758DOI: 10.1109/TASL.2010.2045782ISI: 000291661300015ScopusID: 2-s2.0-77957735497OAI: diva2:1040417

QC 20161031

Available from: 2016-10-27 Created: 2016-10-10 Last updated: 2016-11-11Bibliographically approved

Open Access in DiVA

fulltext(515 kB)10 downloads
File information
File name FULLTEXT01.pdfFile size 515 kBChecksum SHA-512
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Holzapfel, André
In the same journal
IEEE Transactions on Audio, Speech, and Language Processing
Media Engineering

Search outside of DiVA

GoogleGoogle Scholar
Total: 10 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 8 hits
ReferencesLink to record
Permanent link

Direct link