TR2010-012
Subword Unit Approaches for Retrieval by Voice
-
- "Subword Unit Approaches for Retrieval by Voice", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), March 2010.BibTeX TR2010-012 PDF
- @inproceedings{Gouvea2010mar,
- author = {Gouvea, E. and Ezzat, T. and Raj, B.},
- title = {Subword Unit Approaches for Retrieval by Voice},
- booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
- year = 2010,
- month = mar,
- url = {https://www.merl.com/publications/TR2010-012}
- }
,
- "Subword Unit Approaches for Retrieval by Voice", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), March 2010.
-
Research Areas:
Abstract:
In this work, we describe a subword unit approach for information retrieval of items by voice. An algorithm based on the minimum description length (MDL) principle coverts an index written in terms of words with vocabulary size V into an index written in terms of phonetics subword units of size M much-less-than V. We demonstrate that, with this highly reduced vocabulary of subword units, improvement in ASR decode speed and memory footprint can be achieved, at the expense of a small drop in recall performance. Results on a music lyrics retrieval task are demonstrated.
Related News & Events
-
NEWS ICASSP 2010: 9 publications by Anthony Vetro, Shantanu D. Rane and Petros T. Boufounos Date: March 14, 2010
Where: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
MERL Contacts: Anthony Vetro; Petros T. BoufounosBrief- The papers "Privacy and Security of Features Extracted from Minutiae Aggregates" by Nagar, A., Rane, S.D. and Vetro, A., "Hiding Information Inside Structured Shapes" by Das, S., Rane, S.D. and Vetro, A., "Ultrasonic Sensing for Robust Speech Recognition" by Srinivasan, S., Raj, B. and Ezzat, T., "Reconstruction of Sparse Signals from Distorted Randomized Measurements" by Boufounos, P.T., "Disparity Search Range Estimation: Enforcing Temporal Consistency" by Min, D., Yea, S., Arican, Z. and Vetro, A., "Synthesizing Speech from Doppler Signals" by Toth, A.R., Raj, B., Kalgaonkar, K. and Ezzat, T., "Spectrogram Dimensionality Reduction with Independence Constraints" by Wilson, K.W. and Raj, B., "Robust Regression using Sparse Learning for High Dimensional Parameter Estimation Problems" by Mitra, K., Veeraraghavan, A.N. and Chellappa, R. and "Subword Unit Approaches for Retrieval by Voice" by Gouvea, E., Ezzat, T. and Raj, B. were presented at the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP).