Software & Data Downloads — SEBBs

Sound Event Bounding Boxes for prediction of sound event bounding boxes (SEBBs).

Python implementation for the prediction of sound event bounding boxes (SEBBs). SEBBs are one-dimensional bounding boxes defined by event onset time, event offset time, sound class and a confidence. They represent sound event candidates with a scalar confidence score assigned to it. We call it (1d) bounding boxes to highlight the similarity to the (2d) bounding boxes typically used for object detection in computer vision.

With SEBBs the sensitivity of a system can be controlled without an impact on the detection of an events' on- and offset times, which the previous frame-level thresholding approaches suffer from.

  •  Ebbers, J., Germain, F.G., Wichern, G., Le Roux, J., "Sound Event Bounding Boxes", Interspeech, September 2024.
    BibTeX TR2024-118 PDF Software
    • @inproceedings{Ebbers2024sep,
    • author = {Ebbers, Janek and Germain, François G and Wichern, Gordon and Le Roux, Jonathan}},
    • title = {Sound Event Bounding Boxes},
    • booktitle = {Interspeech},
    • year = 2024,
    • month = sep,
    • url = {https://www.merl.com/publications/TR2024-118}
    • }

Access software at https://github.com/merlresearch/sebbs.