TR2007-002
Convolutive Speech Bases and their Application to Supervised Speech Separation
-
- "Convolutive Speech Bases and their Application to Supervised Speech Separation", IEEE Transactions on Audio, Speech and Language Processing, Vol. 15, No. 1, pp. 1-12, January 2007.BibTeX TR2007-002 PDF
- @article{Smaragdis2007jan2,
- author = {Smaragdis, P.},
- title = {Convolutive Speech Bases and their Application to Supervised Speech Separation},
- journal = {IEEE Transactions on Audio, Speech and Language Processing},
- year = 2007,
- volume = 15,
- number = 1,
- pages = {1--12},
- month = jan,
- issn = {1558-7916},
- url = {https://www.merl.com/publications/TR2007-002}
- }
,
- "Convolutive Speech Bases and their Application to Supervised Speech Separation", IEEE Transactions on Audio, Speech and Language Processing, Vol. 15, No. 1, pp. 1-12, January 2007.
-
Research Areas:
Abstract:
In this paper we present a convolutive basis decomposition method and its application on simultaneous speakers separation from monophonic recordings. The model we propose is a convolutive version of the non-negative matrix factorization algorithm. Due to the non-negativity constraint this type of coding is very well suited for intuitively and efficiently representing magnitude spectra. We present results that reveal the nature of these basis functions and we introduce their utility in separating monophonic mixtures of known speakers.
Related News & Events
-
NEWS IEEE Transactions on Audio, Speech and Language Processing: 2 publications by Petros T. Boufounos and others Date: January 15, 2007
Where: IEEE Transactions on Audio, Speech and Language Processing
MERL Contact: Petros T. BoufounosBrief- The articles "Position and Trajectory Learning for Microphone Arrays" by Smaragdis, P. and Boufounos, P. and "Convolutive Speech Bases and their Application to Supervised Speech Separation" by Smaragdis, P. were published in IEEE Transactions on Audio, Speech and Language Processing.