Slow and steady: auditory features for discriminating animal vocalizations

Ronald W Di Tullio; Linran Wei; Vijay Balasubramanian

doi:10.1101/2024.06.20.599962

Slow and steady: auditory features for discriminating animal vocalizations

bioRxiv [Preprint]. 2024 Jul 2:2024.06.20.599962. doi: 10.1101/2024.06.20.599962.

Authors

Ronald W Di Tullio^{1

2}, Linran Wei³, Vijay Balasubramanian^{1

2

4}

Affiliations

¹ David Rittenhouse Laboratory, Department of Physics and Astronomy, University of Pennsylvania, USA.
² Computational Neuroscience Initiative, University of Pennsylvania, USA.
³ David Rittenhouse Laboratory, Department of Physics and Astronomy, University of Pennsylkvania, USA.
⁴ Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, NM 87501, USA.

Abstract

We propose that listeners can use temporal regularities - spectro-temporal correlations that change smoothly over time - to discriminate animal vocalizations within and between species. To test this idea, we used Slow Feature Analysis (SFA) to find the most temporally regular components of vocalizations from birds (blue jay, house finch, American yellow warbler, and great blue heron), humans (English speakers), and rhesus macaques. We projected vocalizations into the learned feature space and tested intra-class (same speaker/species) and inter-class (different speakers/species) auditory discrimination by a trained classifier. We found that: 1) Vocalization discrimination was excellent (> 95%) in all cases; 2) Performance depended primarily on the ~10 most temporally regular features; 3) Most vocalizations are dominated by ~10 features with high temporal regularity; and 4) These regular features are highly correlated with the most predictable components of animal sounds.

Publication types

Preprint

Grants and funding

R01 EB026945/EB/NIBIB NIH HHS/United States