TR2015-136

The third 'CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines


    •  Barker, J., Marxer, R., Vincent, E., Watanabe, S., "The Third 'CHiME' Speech Separation and Recognition Challenge: Dataset, Task and Baselines", IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), DOI: 10.1109/​ASRU.2015.75404837, December 2015, pp. 504-511.
      BibTeX TR2015-136 PDF
      • @inproceedings{Barker2015dec,
      • author = {Barker, J. and Marxer, R. and Vincent, E. and Watanabe, S.},
      • title = {The Third 'CHiME' Speech Separation and Recognition Challenge: Dataset, Task and Baselines},
      • booktitle = {IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)},
      • year = 2015,
      • pages = {504--511},
      • month = dec,
      • publisher = {IEEE},
      • doi = {10.1109/ASRU.2015.75404837},
      • url = {https://www.merl.com/publications/TR2015-136}
      • }
  • Research Areas:

    Artificial Intelligence, Speech & Audio

Abstract:

The CHiME challenge series aims to advance far field speech recognition technology by promoting research at the interface of signal processing and automatic speech recognition. This paper presents the design and outcomes of the 3rd CHiME Challenge, which targets the performance of automatic speech recognition in a real-world, commercially-motivated scenario: a person talking to a tablet device that has been fitted with a six-channel microphone array. The paper describes the data collection, the task definition and the baseline systems for data simulation, enhancement and recognition. The paper then presents an overview of the 26 systems that were submitted to the challenge focusing on the strategies that proved to be most successful relative to the MVDR array processing and DNN acoustic modeling reference system. Challenge findings related to the role of simulated data in system training and evaluation are discussed.