NEWS    MERL Speech & Audio Researchers Presenting 7 Papers and a Tutorial at Interspeech 2019

Date released: September 17, 2019


  •  NEWS    MERL Speech & Audio Researchers Presenting 7 Papers and a Tutorial at Interspeech 2019
  • Date:

    September 15, 2019 - September 19, 2019

  • Where:

    Graz, Austria

  • Description:

    MERL Speech & Audio Team researchers will be presenting 7 papers at the 20th Annual Conference of the International Speech Communication Association INTERSPEECH 2019, which is being held in Graz, Austria from September 15-19, 2019. Topics to be presented include recent advances in end-to-end speech recognition, speech separation, and audio-visual scene-aware dialog. Takaaki Hori is also co-presenting a tutorial on end-to-end speech processing.

    Interspeech is the world's largest and most comprehensive conference on the science and technology of spoken language processing. It gathers around 2000 participants from all over the world.

  • External Link:

    https://interspeech2019.org/

  • MERL Contacts:
  • Research Areas:

    Artificial Intelligence, Machine Learning, Speech & Audio

    •  Karafiat, M., Baskar, M.K., Watanabe, S., Hori, T., Wiesner, M., Cernocky, J.H., "Analysis of Multilingual Sequence-to-Sequence Speech Recognition Systems", Interspeech, DOI: 10.21437/​Interspeech.2019-2355/​/​, September 2019, pp. 2019-2355.
      BibTeX TR2019-103 PDF
      • @inproceedings{Karafiat2019sep,
      • author = {Karafiat, Martin and Baskar, Murali Karthick and Watanabe, Shinji and Hori, Takaaki and Wiesner, Matthew and Cernocky, Jan, Honza},
      • title = {Analysis of Multilingual Sequence-to-Sequence Speech Recognition Systems},
      • booktitle = {Interspeech},
      • year = 2019,
      • pages = {2019--2355},
      • month = sep,
      • doi = {10.21437/Interspeech.2019-2355//},
      • url = {https://www.merl.com/publications/TR2019-103}
      • }
    •  Seki, H., Hori, T., Watanabe, S., Moritz, N., Le Roux, J., "Vectorized Beam Search for CTC-Attention-based Speech Recognition", Interspeech, DOI: 10.21437/​Interspeech.2019-2860, September 2019, pp. 3825-3829.
      BibTeX TR2019-102 PDF
      • @inproceedings{Seki2019sep2,
      • author = {Seki, Hiroshi and Hori, Takaaki and Watanabe, Shinji and Moritz, Niko and Le Roux, Jonathan},
      • title = {Vectorized Beam Search for CTC-Attention-based Speech Recognition},
      • booktitle = {Interspeech},
      • year = 2019,
      • pages = {3825--3829},
      • month = sep,
      • doi = {10.21437/Interspeech.2019-2860},
      • url = {https://www.merl.com/publications/TR2019-102}
      • }
    •  Seki, H., Hori, T., Watanabe, S., Le Roux, J., Hershey, J., "End-to-End Multilingual Multi-Speaker Speech Recognition", Interspeech, DOI: 10.21437/​Interspeech.2019-3038, September 2019, pp. 3755-3759.
      BibTeX TR2019-101 PDF
      • @inproceedings{Seki2019sep,
      • author = {Seki, Hiroshi and Hori, Takaaki and Watanabe, Shinji and Le Roux, Jonathan and Hershey, John},
      • title = {End-to-End Multilingual Multi-Speaker Speech Recognition},
      • booktitle = {Interspeech},
      • year = 2019,
      • pages = {3755--3759},
      • month = sep,
      • doi = {10.21437/Interspeech.2019-3038},
      • url = {https://www.merl.com/publications/TR2019-101}
      • }
    •  Baskar, M.K., Watanabe, S., Astudillo, R., Hori, T., Burget, L., Cernocky, J.H., "Semi-supervised Sequence-to-sequence ASR using Unpaired Speech and Text", Interspeech, DOI: 10.21437/​Interspeech.2019-3167, September 2019, pp. 3790-3794.
      BibTeX TR2019-100 PDF
      • @inproceedings{Baskar2019sep,
      • author = {Baskar, Murali Karthick and Watanabe, Shinji and Astudillo, Ramon and Hori, Takaaki and Burget, Lukas and Cernocky, Jan, Honza},
      • title = {Semi-supervised Sequence-to-sequence ASR using Unpaired Speech and Text},
      • booktitle = {Interspeech},
      • year = 2019,
      • pages = {3790--3794},
      • month = sep,
      • doi = {10.21437/Interspeech.2019-3167},
      • issn = {1990-9772},
      • url = {https://www.merl.com/publications/TR2019-100}
      • }
    •  Wichern, G., McQuinn, E., Antognini, J., Flynn, M., Zhu, R., Crow, D., Manilow, E., Le Roux, J., "WHAM!: Extending Speech Separation to Noisy Environments", Interspeech, DOI: 10.21437/​Interspeech.2019-2821, September 2019, pp. 1368-1372.
      BibTeX TR2019-099 PDF
      • @inproceedings{Wichern2019sep,
      • author = {Wichern, Gordon and McQuinn, Emmett and Antognini, Joe and Flynn, Michael and Zhu, Richard and Crow, Dwight and Manilow, Ethan and Le Roux, Jonathan},
      • title = {WHAM!: Extending Speech Separation to Noisy Environments},
      • booktitle = {Interspeech},
      • year = 2019,
      • pages = {1368--1372},
      • month = sep,
      • doi = {10.21437/Interspeech.2019-2821},
      • url = {https://www.merl.com/publications/TR2019-099}
      • }
    •  Moritz, N., Hori, T., Le Roux, J., "Unidirectional Neural Network Architectures for End-to-End Automatic Speech Recognition", Interspeech, DOI: 10.21437/​Interspeech.2019-2837, September 2019, pp. 76-80.
      BibTeX TR2019-098 PDF
      • @inproceedings{Moritz2019sep,
      • author = {Moritz, Niko and Hori, Takaaki and Le Roux, Jonathan},
      • title = {Unidirectional Neural Network Architectures for End-to-End Automatic Speech Recognition},
      • booktitle = {Interspeech},
      • year = 2019,
      • pages = {76--80},
      • month = sep,
      • doi = {10.21437/Interspeech.2019-2837},
      • url = {https://www.merl.com/publications/TR2019-098}
      • }
    •  Hori, C., Cherian, A., Marks, T.K., Hori, T., "Joint Student-Teacher Learning for Audio-Visual Scene-Aware Dialog", Interspeech, September 2019, pp. 1886-1890.
      BibTeX TR2019-097 PDF
      • @inproceedings{Hori2019sep,
      • author = {Hori, Chiori and Cherian, Anoop and Marks, Tim K. and Hori, Takaaki},
      • title = {Joint Student-Teacher Learning for Audio-Visual Scene-Aware Dialog},
      • booktitle = {Interspeech},
      • year = 2019,
      • pages = {1886--1890},
      • month = sep,
      • publisher = {ISCA},
      • url = {https://www.merl.com/publications/TR2019-097}
      • }