| Catégorie de document |
Contribution à un colloque ou à un congrès |
| Titre |
Saliency-based modeling of acoustic scenes using sparse non-negative matrix factorization |
| Sous-titre |
Soundscape, NMF, perception |
| Auteur principal |
Benjamin Cauchi |
| Co-auteurs |
Mathieu Lagrange, Nicolas Misdariis, Arshia Cont |
| Colloque / congrès |
WIAMIS - Workshop on Image and Audio Analysis for Multimedia Interactive Services. Paris : Juillet 2013 |
| Comité de lecture |
Oui |
| Année |
2013 |
| Statut éditorial |
Publié |
| Résumé |
The modelling of auditory scenes is a challenging task in Computational Auditory Scene Analysis. A method based on sparse Non-negative Matrix Factorization that can be used with no prior knowledge of the audio content to establish the similarity between scenes is proposed. The method is evaluated on a corpus of soundscapes of train stations issued from a perceptual study and results are compared with the human perception. The proposed method, by being able to focus on salient events within the scene, achieves better performances than a state-of-the-art Bag-of-Frames approach though not reaching the human performances. |
| Equipes |
Analyse et synthèse sonores, Perception et design sonores |
| Cote |
Cauchi13a |
| Adresse de la version en ligne |
http://architexte.ircam.fr/textes/Cauchi13a/index.pdf |
|