Human-Like Coarse Object Representations in Vision Models
arXiv:2602.12486v1 Announce Type: new Abstract: Humans appear to represent objects for intuitive physics with coarse, volumetric bodies” that smooth concavities – trading fine visual details for efficient physical predictions – yet their internal structure is largely unknown. Segmentation models, in contrast, optimize pixel-accurate masks that may misalign with such bodies. We ask whether and when these models nonetheless acquire human-like bodies. Using a time-to-collision (TTC) behavioral paradigm, we introduce a comparison pipeline and alignment metric, then vary model […]