Dynamic Non-negative Models for Audio Source Separation

As seen so far, non-negative models can be quite powerful when it comes to resolving mixtures of sounds. However, in such models we often ignore temporal information, instead focusing on resolving each incoming spectrum independently. In this chaper we will present some methods that learn to incorporate the temporal aspects of sounds and use that information to perform improved separation. We will show three such models, a conlvolutive model that learns fixed temporal features, a hidden Markov model that learns state transitions and can incorporate language in- formation, and finally a continuous dynamical model that learns how sounds evolve over time and is able to resolve cases where static information is not enough.

Publications

Dynamic Non-negative Models for Audio Source Separation

Book Chapter in Audio Source Separation, Springer

Publication date: March 3, 2018

Paris Smaragdis, Gautham Mysore, Nasser Mohammadiha

Research Areas: AI & Machine Learning Audio