Three dimensions of pitched instrument onset detection
2010 (English)In: IEEE Transactions on Audio, Speech and Language Processing, Vol. 18, no 6, 1517-1527 p.Article in journal (Refereed) Published
In this paper, we suggest a novel group delay based method for the onset detection of pitched instruments. It is proposed to approach the problem of onset detection by examining three dimensions separately: phase (i.e., group delay), magnitude and pitch. The evaluation of the suggested onset detectors for phase, pitch and magnitude is performed using a new publicly available and fully onset annotated database of monophonic recordings which is balanced in terms of included instruments and onset samples per instrument, while it contains different performance styles. Results show that the accuracy of onset detection depends on the type of instruments as well as on the style of performance. Combining the information contained in the three dimensions by means of a fusion at decision level leads to an improvement of onset detection by about 8% in terms of F-measure, compared to the best single dimension.
Place, publisher, year, edition, pages
IEEE Press, 2010. Vol. 18, no 6, 1517-1527 p.
Automatic music transcription; group delay; music information retrieval; onset detection
Research subject Speech and Music Communication; Computer Science; Information and Communication Technology
IdentifiersURN: urn:nbn:se:kth:diva-193767DOI: 10.1109/TASL.2009.2036298ISI: 000288375700038ScopusID: 2-s2.0-77955730228OAI: oai:DiVA.org:kth-193767DiVA: diva2:1040347
QC 201610312016-10-272016-10-102016-11-11Bibliographically approved