Change search
ReferencesLink to record
Permanent link

Direct link
Musical genre classification using Nonnegative Matrix Factorization based features
Institute of Computer Science.ORCID iD: 0000-0003-1679-6018
2008 (English)In: IEEE Transactions on Audio, Speech and Language Processing, Vol. 16, no 2, 424-434 p.Article in journal (Refereed) Published
Abstract [en]

Nonnegative matrix factorization (NMF) is used to derive a novel description for the timbre of musical sounds. Using NMF, a spectrogram is factorized providing a characteristic spectral basis. Assuming a set of spectrograms given a musical genre, the space spanned by the vectors of the obtained spectral bases is modeled statistically using mixtures of Gaussians, resulting in a description of the spectral base for this musical genre. This description is shown to improve classification results by up to 23.3% compared to MFCC-based models, while the compression performed by the factorization decreases training time significantly. Using a distance-based stability measure this compression is shown to reduce the noise present in the data set resulting in more stable classification models. In addition, we compare the mean squared errors of the approximation to a spectrogram using independent component analysis and nonnegative matrix factorization, showing the superiority of the latter approach.

Place, publisher, year, edition, pages
IEEE Press, 2008. Vol. 16, no 2, 424-434 p.
Keyword [en]
Audio classification; Audio feature extraction; Music information retrieval; Nonnegative matrix factorization
National Category
Media Engineering
Research subject
Computer Science; Media Technology; Speech and Music Communication
Identifiers
URN: urn:nbn:se:kth:diva-193764DOI: 10.1109/TASL.2007.909434ISI: 000252612100016ScopusID: 2-s2.0-39649092019OAI: oai:DiVA.org:kth-193764DiVA: diva2:1040351
Note

QC 20161031

Available from: 2016-10-27 Created: 2016-10-10 Last updated: 2016-11-11

Open Access in DiVA

fulltext(535 kB)7 downloads
File information
File name FULLTEXT01.pdfFile size 535 kBChecksum SHA-512
a6fd68402eb040ea7a901739be17948dda910ddbe6a2c467db053797d0dce22416fc90ac1605d398bfc551f87629fb81a1ed56f08788593fa110c69f4d8bf23f
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Holzapfel, André
Media Engineering

Search outside of DiVA

GoogleGoogle Scholar
Total: 7 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 6 hits
ReferencesLink to record
Permanent link

Direct link