Change search
ReferencesLink to record
Permanent link

Direct link
Idealized computational models for auditory receptive fields
KTH, School of Computer Science and Communication (CSC), Computational Biology, CB.ORCID iD: 0000-0002-9081-2170
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Music Acoustics.ORCID iD: 0000-0003-2926-6518
2014 (English)Manuscript (preprint) (Other academic)
Abstract [en]

This paper presents a theory by which idealized models of auditory receptive fields can be derived in a principled axiomatic manner, from a set of structural properties to enable invariance of receptive field responses under natural sound transformations and ensure internal consistency between spectro-temporal receptive fields at different temporal and spectral scales.For defining a time-frequency transformation of a purely temporal sound signal, it is shown that the framework allows for a new way of deriving the Gabor and Gammatone filters as well as a novel family of generalized Gammatone filters, with additional degrees of freedom to obtain different trade-offs between the spectral selectivity and the temporal delay of time-causal temporal window functions.When applied to the definition of a second-layer of receptive fields from a spectrogram, it is shown that the framework leads to two canonical families of spectro-temporal receptive fields, in terms of spectro-temporal derivatives of either spectro-temporal Gaussian kernels for non-causal time or the combination of a time-causal generalized Gammatone filter over the temporal domain and a Gaussian filter over the logspectral domain. For each filter family, the spectro-temporal receptive fields can be either separable over the time-frequency domain or be adapted to local glissando transformations that represent variations in logarithmic frequencies over time. Within each domain of either non-causal or time-causal time, these receptive field families are derived by uniqueness from the assumptions.It is demonstrated how the presented framework allows for computation of basic auditory features for audio processing and that it leads to predictions about auditory receptive fields with good qualitative similarity to biological receptive fields measured in the inferior colliculus (ICC) and primary auditory cortex (A1) of mammals.

Place, publisher, year, edition, pages
National Category
Computer Science Language Technology (Computational Linguistics) Bioinformatics (Computational Biology) Neurosciences
URN: urn:nbn:se:kth:diva-158160OAI: diva2:774981
Swedish Research Council, 2010-4766,2012-4685

QC 20150203

Available from: 2014-12-30 Created: 2014-12-30 Last updated: 2015-02-06Bibliographically approved

Open Access in DiVA

fulltext(35962 kB)32 downloads
File information
File name FULLTEXT01.pdfFile size 35962 kBChecksum SHA-512
Type fulltextMimetype application/pdf

Other links

Fulltext at arXiv

Search in DiVA

By author/editor
Lindeberg, TonyFriberg, Anders
By organisation
Computational Biology, CBMusic Acoustics
Computer ScienceLanguage Technology (Computational Linguistics)Bioinformatics (Computational Biology)Neurosciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 32 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 545 hits
ReferencesLink to record
Permanent link

Direct link