TR2012-022
Latent Dirichlet Reallocation for Term Swapping
-
- "Latent Dirichlet Reallocation for Term Swapping", International Workshop on Statistical Machine Learning for Speech Processing (IWSML), March 2012.BibTeX TR2012-022 PDF
- @inproceedings{Heaukulani2012mar,
- author = {Heaukulani, C. and {Le Roux}, J. and Hershey, J.R.},
- title = {Latent Dirichlet Reallocation for Term Swapping},
- booktitle = {International Workshop on Statistical Machine Learning for Speech Processing (IWSML)},
- year = 2012,
- month = mar,
- url = {https://www.merl.com/publications/TR2012-022}
- }
,
- "Latent Dirichlet Reallocation for Term Swapping", International Workshop on Statistical Machine Learning for Speech Processing (IWSML), March 2012.
-
MERL Contact:
-
Research Areas:
Abstract:
This paper is an extended abstract of a work in progress, which proposes latent Dirichlet reallocation (LDR), a probabilistic model for text data from different dialects over a shared vocabulary. LDR first uses a topic model to allocate word probabilities to vocabulary terms; it then uses a subtopic model to allow for a possible reallocation of probability between a few potentially swappable terms between dialects. An MCMC inference procedure is derived, combining Gibbs sampling with Hamiltonian Monte-Carlo. Finally, we demonstrate the ability of LDR to correctly switch the probabilities for swappable terms under the subtopics using a toy example.
Related News & Events
-
NEWS IWSML 2012: publication by Jonathan Le Roux, John R. Hershey and others Date: March 31, 2012
Where: International Workshop on Statistical Machine Learning for Speech Processing (IWSML)
MERL Contact: Jonathan Le Roux
Research Area: Speech & AudioBrief- The paper "Latent Dirichlet Reallocation for Term Swapping" by Heaukulani, C., Le Roux, J. and Hershey, J.R. was presented at the International Workshop on Statistical Machine Learning for Speech Processing (IWSML).