Psychophysics for Computer Vision
Fall 2012 - Present
Description
For many classes of problems, the goal of computer vision is to solve visual challenges for which human observers have effortless expertise - face and object recognition, image segmentation, and medical image analysis, to name just a few. However, there exists a large class of problems where human performance dramatically outshines current efforts. This occurs even in areas where computer vision has been considered to be highly successful, such as the case of face detection. For example, digital cameras identify faces quickly and accurately, yet when compared to human ability to detect faces in challenging views and environments, no extant algorithm comes close to matching human performance.
There is an obvious gap between current state-of-the-art computer vision applications and human performance. While current methods are improving year by year, there is the concern that such methods will asymptote well below the level of human performance. In this work, we provide a new approach that relies on a heretofore untapped source of information, one that significantly improves performance at a rate beyond current methods. In addition, we argue that this method can be of considerable assistance even for emerging solutions that are not well-studied, as it supplies fundamental information likely to be useful for all algorithms.
We find that any reference to human performance is often non-existent or impoverished. If there is any reference, it is simply to compare overall performance, say measuring human accuracy and comparing it with that of the machine for an extended task with many items. There is much more information about human capacities that is of direct value. For example, some images are learnable and some are not. This learnability also varies with experience. Something that is initially not learnable can be learnable at a later training session. And learnability itself can be further fractionated. Some things are easily and quickly learned; some take more time. Such detailed information reflecting human capacity, which we call a perceptual annotation, is something that can be effectively used in conjunction with current algorithms. The key approach to accomplish this is to use the results obtained from the discipline of human psychophysics.
This work was supported by NIH Grant R01 EY01363, NSF IIS Award #0963668, NSF SBIR Award #IIP-1621689, NSF CNS RET Award #1609394, and a gift from the Intel Corporation
Publications
- "Informing Machine Perception With Psychophysics,", , , ,
,Proceedings of the IEEE,February 2024.[pdf] [code][bibtex]@article{dulay2024informing,
title={Informing Machine Perception With Psychophysics},
author={Dulay, Justin and
Poltoratski, Sonia and
Hartmann, Till S and
Anthony, Samuel E and
Scheirer, Walter J},
journal={Proceedings of the IEEE},
volume={112},
number={2},
pages={88--96},
year={2024},
publisher={IEEE}
}
- "Psychophysical-Score: A Behavioral Measure for Assessing the Biological, , , ,
Plausibility of Visual Recognition Models,"
,CogSci 2023,July 2023.[pdf][bibtex]@inProceedings{richardwebster2023psychophysicalscore,
title={Psychophysical-Score: A Behavioral Measure for Assessing the Biological
Plausibility of Visual Recognition Models}, author={Brandon RichardWebster and
Justin Dulay and
Anthony DiFalco and
Elisabetta Caldesi and
Walter J. Scheirer},
booktitle={CogSci},
year={2023},
}
- "Measuring Human Perception to Improve Open Set Recognition,", , , ,IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI),September 2023.[pdf] [supp. material] [code] [data][bibtex]@article{huang2023measuring,
title={Measuring Human Perception to Improve Open Set Recognition},
author={Huang, Jin and Prijatelj, Derek and Dulay, Justin and Scheirer, Walter},
journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
volume={45},
number={9},
pages={11382--11389},
year={2023},
publisher={IEEE}
}
- "Measuring Human Perception to Improve Handwritten Document Transcription,", , , , , ,
, ,IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI),October 2022.[pdf][bibtex]@article{grieggs2021measuring,
title={Measuring human perception to improve handwritten document transcription},
author={Grieggs, Samuel and
Shen, Bingyu and
Rauch, Greta and
Li, Pei and
Ma, Jiaqi and
Chiang, David and
Price, Brian andi
Scheirer, Walter J},
journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
volume={44},
number={10},
pages={6594--6601},
year={2021},
publisher={IEEE}
}
- "Self-Driving Vehicles: Key Technical Challenges and Progress Off the Road,", , ,IEEE Potentials,January-February 2020.[pdf][bibtex]@article{MilfordPotentials2020,
author = {Michael Milford and
Samuel E. Anthony and
Walter J. Scheirer},
title = {Self-Driving Vehicles: Key Technical Challenges and Progress Off the Road},
journal = {IEEE Potentials},
volume = {39},
number = {1},
month = {January-February},
year = {2020}
}
- "PsyPhy: A Psychophysics Driven Evaluation Framework for Visual Recognition,", , ,IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI),September 2019.[pdf] [code] [supp. material][bibtex]@article{RichardWebsterPsyPhy2019,
author = {Brandon RichardWebster and
Samuel E. Anthony and
Walter J. Scheirer},
title = {PsyPhy: {A} Psychophysics Driven Evaluation Framework for Visual Recognition},
journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI)},
volume = {41},
number = {9},
month = {September},
year = {2019}
}
- "Convolutional Neural Networks for Subjective Face Attributes,", , , ,
,Image and Vision Computing (IVC),October 2018.[pdf][bibtex]@article{McCurie_2018_IVC,
author = {Mel McCurie and Fernando Beletti and Lucas Parzianello and Allen Westendorp and
Samuel E. Anthony and Walter J. Scheirer},
title = {Convolutional Neural Networks for Subjective Face Attributes},
journal = {Image and Vision Computing (IVC)},
volume = {78},
month = {October},
year = {2018}
}
- "Visual Psychophysics for Making Face Recognition Algorithms More Explainable,", , , ,
,Proceedings of the European Conference on Computer Vision (ECCV),September 2018.[pdf] [code][bibtex]@InProceedings{RichardWebesterFace2018,
author = {Brandon RichardWebster and
So Yon Kwon and
Christopher Clarizio and
Samuel E. Anthony and
Walter J. Scheirer},
title = {Visual Psychophysics for Making Face Recognition Algorithms More Explainable},
booktitle = {European Conference on Computer Vision (ECCV)},
year = {2018}
}
- "Predicting First Impressions with Deep Learning,", , , ,
,Proceedings of the IEEE Conference on Automatic Face and Gesture Recognition (FG),May 2017.[pdf] [code & data][bibtex]@InProceedings{McCurie_2017_FG,
author = {Mel McCurie and Fernando Beletti and Lucas Parzianello and Allen Westendorp and
Samuel E. Anthony and Walter J. Scheirer},
title = {Predicting First Impressions with Deep Learning},
booktitle = {IEEE Conference on Automatic Face and Gesture Recognition (FG)},
month = {May},
year = {2017}
}
- "Perceptual Annotation: Measuring Human Vision to Improve Computer Vision,", , ,IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI),August 2014.[pdf][bibtex]@article{Scheirer_2014_TPAMI,
author = {Walter J. Scheirer and Samuel E. Anthony and Ken Nakayama and David D. Cox},
title = {Perceptual Annotation: Measuring Human Vision to Improve Computer Vision},
journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI)},
volume = {36},
issue = {8},
month = {August},
year = {2014}
}
Posters
Code
- The Basic Psychophysically Informed Learning code is available on GitHub
- The MSD-Net with Psychophysical Loss code is available on GitHub
- The PsyPhy code is available on GitHub
- The Perceptual Annotation code is available on GitHub
Data Sets
- Portilla-Simoncelli Images and Perceptual Annotations from TestMyBrain.org
- Biologically-inspired Features (FDDB and AFLW) and Perceptual Annotations (AFLW)
- HOG Features (FDDB and Portilla-Simoncelli Set) and Perceptual Annotations (Portilla-Simoncelli Set)
Press Coverage
- Boston Globe: "Teaching self-driving cars to read minds"
- MIT Technology Review: "Machine-Vision Algorithm Learns to Judge People by Their Faces"