A Sparse Non-parametric Approach for Single Channel Separation of Known Sounds

In Neural Information Processing Systems. Vancouver, BC, Canada . December 2009

Published December 30, 2009

Paris Smaragdis, M. Shashanka, B. Raj

In this paper we present an algorithm for separating mixed sounds from a monophonic recording. Our approach makes use of training data which allows us to learn representations of the types of sounds that compose the mixture. In contrast to popular methods that attempt to extract com- pact generalizable models for each sound from training data, we employ the training data itself as a representation of the sources in the mixture. We show that mixtures of known sounds can be described as sparse com- binations of the training data itself, and in doing so produce significantly better separation results as compared to similar systems based on compact statistical models.

Learn More

Research Area:  Audio