TR2005-138

Learning Source Trajectories Using Wrapped-Phase Hidden Markov Models


Abstract:

In this paper we examine the problem of identifying trajectories of sound sources as captured from microphone arrays. Instead of employing traditional localization techniques we attach this problem with a statistical modeling approach of phase measurements. As in many signal processing applications that require the use of phase there is the issue of phase-wrapping. Even though there exists a significant amount of work on unwrapping wrapped phase estimates, when it comes to stochastic modeling this can introduce an additional level of undesirable complication. We address this issue by defining an appropriate statistical model to fit wrapped phase data, and employ it as a state model of an HMM in order to recognize sound trajectories. Using both synthetic and real data we highlight the accuracy of this model as opposed to generic HMM modeling.

 

  • Related News & Events

    •  NEWS    WASPAA 2005: 3 publications by Petros T. Boufounos, Ajay Divakaran and Paris Smaragdis
      Date: October 16, 2005
      Where: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
      MERL Contact: Petros T. Boufounos
      Brief
      • The papers "Latent Variable Decomposition of Spectrograms for Single Channel Speaker Separation" by Raj, B. and Smaragdis, P., "Learning Source Trajectories Using Wrapped-Phase Hidden Markov Models" by Smaragdis, P. and Boufounos, P. and "Audio Analysis for Surveillance Applications" by Radhakrishnan, R., Divakaran, A. and Smaragdis, P. were presented at the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).
    •