EVENT SANE 2025 - Speech and Audio in the Northeast

Date released: December 16, 2025

EVENT SANE 2025 - Speech and Audio in the Northeast
Date:

Friday, November 7, 2025
Location:

Google, New York, NY
Description:

SANE 2025, a one-day event gathering researchers and students in speech and audio from the Northeast of the American continent, was held on Friday November 7, 2025 at Google, in New York, NY.

It was the 12th edition in the SANE series of workshops, which started in 2012 and is typically held every year alternately in Boston and New York. Since the first edition, the audience has grown to about 200 participants and 50 posters each year, and SANE has established itself as a vibrant, must-attend event for the speech and audio community across the northeast and beyond.

SANE 2025 featured invited talks by six leading researchers from the Northeast as well as from the wider community: Dan Ellis (Google Deepmind), Leibny Paola Garcia Perera (Johns Hopkins University), Yuki Mitsufuji (Sony AI), Julia Hirschberg (Columbia University), Yoshiki Masuyama (MERL), and Robin Scheibler (Google Deepmind). It also featured a lively poster session with 50 posters.

MERL Speech and Audio Team's Yoshiki Masuyama presented a well-received overview of the team's recent work on "Neural Fields for Spatial Audio Modeling". His talk highlighted how neural fields are reshaping spatial audio research by enabling flexible, data-driven interpolation of head-related transfer functions and room impulse responses. He also discussed the integration of sound-propagation physics into neural field models through physics-informed neural networks, showcasing MERL’s advances at the intersection of acoustics and deep learning.

SANE 2025 was co-organized by Jonathan Le Roux (MERL), Quan Wang (Google Deepmind), and John R. Hershey (Google Deepmind). SANE remained a free event thanks to generous sponsorship by Google, MERL, Apple, Bose, and Carnegie Mellon University.

Slides and videos of the talks are available from the SANE workshop website and via a YouTube playlist.
MERL Contacts:

Jonathan Le Roux; Yoshiki Masuyama
External Link:

https://www.saneworkshop.org/sane2025/
Research Areas:

Artificial Intelligence, Machine Learning, Speech & Audio
- Related Publications
  Masuyama, Y., Germain, F.G., Wichern, G., Ick, C., Le Roux, J., "Physics-Informed Direction-Aware Neural Acoustic Fields", IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), DOI: 10.1109/WASPAA66052.2025.11230918, October 2025.
  BibTeX TR2025-142 PDF
  @inproceedings{Masuyama2025oct,
  author = {Masuyama, Yoshiki and Germain, François G and Wichern, Gordon and Ick, Christopher and {Le Roux}, Jonathan},
  title = {{Physics-Informed Direction-Aware Neural Acoustic Fields}},
  booktitle = {IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)},
  year = 2025,
  month = oct,
  doi = {10.1109/WASPAA66052.2025.11230918},
  url = {https://www.merl.com/publications/TR2025-142}
  }
  Ick, C., Wichern, G., Masuyama, Y., Germain, F.G., Le Roux, J., "Direction-Aware Neural Acoustic Fields for Few-Shot Interpolation of Ambisonic Impulse Responses", Interspeech, DOI: 10.21437/Interspeech.2025-1912, August 2025, pp. 933-937.
  BibTeX TR2025-120 PDF
  @inproceedings{Ick2025aug,
  author = {Ick, Christopher and Wichern, Gordon and Masuyama, Yoshiki and Germain, François G and {Le Roux}, Jonathan},
  title = {{Direction-Aware Neural Acoustic Fields for Few-Shot Interpolation of Ambisonic Impulse Responses}},
  booktitle = {Interspeech},
  year = 2025,
  pages = {933--937},
  month = aug,
  doi = {10.21437/Interspeech.2025-1912},
  url = {https://www.merl.com/publications/TR2025-120}
  }
  Ick, C., Wichern, G., Masuyama, Y., Germain, F.G., Le Roux, J., "Data Augmentation Using Neural Acoustic Fields With Retrieval-Augmented Pre-training", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Satellite Workshop on Generative Data Augmentation for Real-World Signal Processing Applications (GenDA), April 2025.
  BibTeX TR2025-045 PDF
  @inproceedings{Ick2025apr,
  author = {Ick, Christopher and Wichern, Gordon and Masuyama, Yoshiki and Germain, François G and {Le Roux}, Jonathan},
  title = {{Data Augmentation Using Neural Acoustic Fields With Retrieval-Augmented Pre-training}},
  booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Satellite Workshop on Generative Data Augmentation for Real-World Signal Processing Applications (GenDA)},
  year = 2025,
  month = apr,
  url = {https://www.merl.com/publications/TR2025-045}
  }
  Masuyama, Y., Wichern, G., Germain, F.G., Ick, C., Le Roux, J., "Retrieval-Augmented Neural Field for HRTF Upsampling and Personalization", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), DOI: 10.1109/ICASSP49660.2025.10889481, April 2025.
  BibTeX TR2025-029 PDF Software
  @inproceedings{Masuyama2025mar,
  author = {{{Masuyama, Yoshiki and Wichern, Gordon and Germain, François G and Ick, Christopher and Le Roux, Jonathan}}},
  title = {{{Retrieval-Augmented Neural Field for HRTF Upsampling and Personalization}}},
  booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
  year = 2025,
  month = apr,
  doi = {10.1109/ICASSP49660.2025.10889481},
  url = {https://www.merl.com/publications/TR2025-029}
  }
  Masuyama, Y., Wichern, G., Germain, F.G., Pan, Z., Khurana, S., Hori, C., Le Roux, J., "NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), DOI: 10.1109/ICASSP48485.2024.10448477, March 2024, pp. 1016-1020.
  BibTeX TR2024-026 PDF Software
  @inproceedings{Masuyama2024mar,
  author = {Masuyama, Yoshiki and Wichern, Gordon and Germain, François G and Pan, Zexu and Khurana, Sameer and Hori, Chiori and {Le Roux}, Jonathan},
  title = {{NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization}},
  booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
  year = 2024,
  pages = {1016--1020},
  month = mar,
  doi = {10.1109/ICASSP48485.2024.10448477},
  url = {https://www.merl.com/publications/TR2024-026}
  }

Date:

Location:

Description:

MERL Contacts:

External Link:

Research Areas: