Scale transform in rhythmic similarity of music
2011 (English)In: IEEE Transactions on Audio, Speech, and Language Processing, ISSN 1558-7916, E-ISSN 1558-7924, Vol. 19, no 1, 176-185 p.Article in journal (Refereed) Published
As a special case of the Mellin transform, the scale transform has been applied in various signal processing areas, in order to get a signal description that is invariant to scale changes. In this paper, the scale transform is applied to autocorrelation sequences derived from music signals. It is shown that two such sequences, when derived from similar rhythms with different tempo, differ mainly by a scaling factor. By using the scale transform, the proposed descriptors are robust to tempo changes, and are specially suited for the comparison of pieces with different tempi but similar rhythm. As music with such characteristics is widely encountered in traditional forms of music, the performance of the descriptors in a classification task of Greek traditional dances and Turkish traditional songs is evaluated. On these datasets accuracies compared to non-tempo robust approaches are improved by more than 20%, while on a dataset of Western music the achieved accuracy improves compared to previously presented results.
Place, publisher, year, edition, pages
IEEE Press, 2011. Vol. 19, no 1, 176-185 p.
Computational ethnomusicology; music information retrieval (MIR); rhythmic similarity; scale transform
Research subject Computer Science; Information and Communication Technology; Media Technology; Speech and Music Communication
IdentifiersURN: urn:nbn:se:kth:diva-193758DOI: 10.1109/TASL.2010.2045782ISI: 000291661300015ScopusID: 2-s2.0-77957735497OAI: oai:DiVA.org:kth-193758DiVA: diva2:1040417
QC 201610312016-10-272016-10-102016-11-11Bibliographically approved