A statistical approach to musical genre classification using Non-negative Matrix Factorization
2007 (English)In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE Press, 2007, Vol. II, 693-696 p.Conference paper (Refereed)
This paper introduces a new feature set based on a Non-negtive Matrix Factorization approach for the classification of musical signals into genres, only using synchronous organization of music events (vertical dimension of music). This feature set generates a vector space to describe the spectrogram representation of a music Signal. The space is modeled statistically by a mixture of Gaussians (GMM). A new signal is classified by considering the likelihoods over all the estimated feature vectors given these statistical models, without constructing a model for the signal itself. Cross-validation tests on two commonly utilized datasets for this task show the superiority of the proposed features compared to the widely used MFCC type of representation based on classification accuracies (over 9% of improvement), as well as on a stability measure introduced in this paper for GMM.
Place, publisher, year, edition, pages
IEEE Press, 2007. Vol. II, 693-696 p.
music genre classification; Non-negative Matrix Factorization; Gaussian mixture model; MFCC
IdentifiersURN: urn:nbn:se:kth:diva-193765DOI: 10.1109/ICASSP.2007.366330ISI: 000248908100174ScopusID: 2-s2.0-34547514190OAI: oai:DiVA.org:kth-193765DiVA: diva2:1040352
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
QC 201610312016-10-272016-10-102016-11-11Bibliographically approved