TR2024-002

Pixel-Grounded Prototypical Part Networks

- Carmichael, Z., Jones, L.S., Cherian, A., Michael J., , Scheirer, W., "Pixel-Grounded Prototypical Part Networks", IEEE Winter Conference on Applications of Computer Vision (WACV), DOI: 10.1109/WACV57701.2024.00470, January 2024.
  BibTeX TR2024-002 PDF Video Software Presentation
  - @inproceedings{Carmichael2024jan,
  - author = {Carmichael, Zachariah and Jones, Lohit, Suhas and Cherian, Anoop and Michael J. and Scheirer, Walter},
  - title = {{Pixel-Grounded Prototypical Part Networks}},
  - booktitle = {IEEE Winter Conference on Applications of Computer Vision (WACV)},
  - year = 2024,
  - month = jan,
  - doi = {10.1109/WACV57701.2024.00470},
  - url = {https://www.merl.com/publications/TR2024-002}
  - }
MERL Contacts:
Research Areas:

Artificial Intelligence, Computer Vision, Machine Learning

Abstract:

Prototypical part neural networks (ProtoPartNNs), namely PROTOPNET and its derivatives, are an intrinsically interpretable approach to machine learning. Their prototype learning scheme enables intuitive explanations of the form, this (prototype) looks like that (testing image patch). But, does this actually look like that? In this work, we delve into why object part localization and associated heat maps in past work are misleading. Rather than localizing to object parts, existing ProtoPartNNs localize to the entire image, contrary to generated explanatory visualizations. We argue that detraction from these underlying issues is due to the alluring nature of visualizations and an over-reliance on intuition. To alleviate these issues, we devise new receptive field-based architectural constraints for meaning- ful localization and a principled pixel space mapping for ProtoPartNNs. To improve interpretability, we propose ad- ditional architectural improvements, including a simplified classification head. We also make additional corrections to PROTOPNET and its derivatives, such as the use of a validation set, rather than a test set, to evaluate generalization during training. Our approach, PIXPNET (Pixel-grounded Prototypical part Network), is the only ProtoPartNN that truly learns and localizes to prototypical object parts. We demonstrate that PIXPNET achieves quantifiably improved interpretability without sacrificing accuracy.

Software & Data Downloads

Pixel-Grounded Prototypical Part Networks

Related Publication

Carmichael, Z., Lohit, S., Cherian, A., Jones, M.J., Scheirer, W., "Pixel-Grounded Prototypical Part Networks", arXiv, October 2023.

BibTeX arXiv

@article{Carmichael2023oct,
author = {Carmichael, Zachariah and Lohit, Suhas and Cherian, Anoop and Jones, Michael J. and Scheirer, Walter},
title = {{Pixel-Grounded Prototypical Part Networks}},
journal = {arXiv},
year = 2023,
month = oct,
url = {https://arxiv.org/abs/2309.14531}
}

TR2024-002

Pixel-Grounded Prototypical Part Networks

MERL Contacts:

Michael J.
Jones

Suhas
Lohit

Anoop
Cherian

Research Areas:

Abstract:

Software & Data Downloads

Related Video

Related Publication

MERL Contacts:

Michael J.Jones

SuhasLohit

AnoopCherian

Research Areas:

Abstract:

Michael J.
Jones

Suhas
Lohit

Anoop
Cherian