Robustified ANNs Reveal Wormholes Between Human Category Percepts

TitleRobustified ANNs Reveal Wormholes Between Human Category Percepts
Publication TypeJournal Article
Year of Publication2023
AuthorsGaziv, G, Lee, MJ, DiCarlo, JJ
JournalarXiv
Date Published08/2023
Type of Articlepreprint
Abstract

The visual object category reports of artificial neural networks (ANNs) are notoriously sensitive to tiny, adversarial image perturbations. Because human category reports (aka human percepts) are thought to be insensitive to those same small-norm perturbations -- and locally stable in general -- this argues that ANNs are incomplete scientific models of human visual perception. Consistent with this, we show that when small-norm image perturbations are generated by standard ANN models, human object category percepts are indeed highly stable. However, in this very same "human-presumed-stable" regime, we find that robustified ANNs reliably discover low-norm image perturbations that strongly disrupt human percepts. These previously undetectable human perceptual disruptions are massive in amplitude, approaching the same level of sensitivity seen in robustified ANNs. Further, we show that robustified ANNs support precise perceptual state interventions: they guide the construction of low-norm image perturbations that strongly alter human category percepts toward specific prescribed percepts. These observations suggest that for arbitrary starting points in image space, there exists a set of nearby "wormholes", each leading the subject from their current category perceptual state into a semantically very different state. Moreover, contemporary ANN models of biological visual processing are now accurate enough to consistently guide us to those portals.

 

URLhttps://arxiv.org/pdf/2308.06887.pdf
DOI10.48550/arXiv.2308.06887 Focus to learn more
Refereed DesignationNon-Refereed

File: