The mid-level vision toolbox for computing structural properties of real-world images

Mid-level vision is the intermediate visual processing stage for generating representations of shapes and partial geometries of objects. Our mechanistic understanding of these operations is limited, in part, by a lack of computational tools for analyzing image properties at these levels of represent...

Full description

Bibliographic Details
Main Authors: Dirk B. Walther, Delaram Farzanfar, Seohee Han, Morteza Rezanejad
Format: Article
Language:English
Published: Frontiers Media S.A. 2023-09-01
Series:Frontiers in Computer Science
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fcomp.2023.1140723/full
_version_ 1797687109748785152
author Dirk B. Walther
Delaram Farzanfar
Seohee Han
Morteza Rezanejad
author_facet Dirk B. Walther
Delaram Farzanfar
Seohee Han
Morteza Rezanejad
author_sort Dirk B. Walther
collection DOAJ
description Mid-level vision is the intermediate visual processing stage for generating representations of shapes and partial geometries of objects. Our mechanistic understanding of these operations is limited, in part, by a lack of computational tools for analyzing image properties at these levels of representation. We introduce the Mid-Level Vision (MLV) Toolbox, an open-source software that automatically processes low- and mid-level contour features and perceptual grouping cues from real-world images. The MLV toolbox takes vectorized line drawings of scenes as input and extracts structural contour properties. We also include tools for contour detection and tracing for the automatic generation of vectorized line drawings from photographs. Various statistical properties of the contours are computed: the distributions of orientations, contour curvature, and contour lengths, as well as counts and types of contour junctions. The toolbox includes an efficient algorithm for computing the medial axis transform of contour drawings and photographs. Based on the medial axis transform, we compute several scores for local mirror symmetry, local parallelism, and local contour separation. All properties are summarized in histograms that can serve as input into statistical models to relate image properties to human behavioral measures, such as esthetic pleasure, memorability, affective processing, and scene categorization. In addition to measuring contour properties, we include functions for manipulating drawings by separating contours according to their statistical properties, randomly shifting contours, or rotating drawings behind a circular aperture. Finally, the MLV Toolbox offers visualization functions for contour orientations, lengths, curvature, junctions, and medial axis properties on computer-generated and artist-generated line drawings. We include artist-generated vectorized drawings of the Toronto Scenes image set, the International Affective Picture System, and the Snodgrass and Vanderwart object images, as well as automatically traced vectorized drawings of set architectural scenes and the Open Affective Standardized Image Set (OASIS).
first_indexed 2024-03-12T01:13:35Z
format Article
id doaj.art-42f18ca06848478e8046df126285f357
institution Directory Open Access Journal
issn 2624-9898
language English
last_indexed 2024-03-12T01:13:35Z
publishDate 2023-09-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Computer Science
spelling doaj.art-42f18ca06848478e8046df126285f3572023-09-13T19:44:33ZengFrontiers Media S.A.Frontiers in Computer Science2624-98982023-09-01510.3389/fcomp.2023.11407231140723The mid-level vision toolbox for computing structural properties of real-world imagesDirk B. WaltherDelaram FarzanfarSeohee HanMorteza RezanejadMid-level vision is the intermediate visual processing stage for generating representations of shapes and partial geometries of objects. Our mechanistic understanding of these operations is limited, in part, by a lack of computational tools for analyzing image properties at these levels of representation. We introduce the Mid-Level Vision (MLV) Toolbox, an open-source software that automatically processes low- and mid-level contour features and perceptual grouping cues from real-world images. The MLV toolbox takes vectorized line drawings of scenes as input and extracts structural contour properties. We also include tools for contour detection and tracing for the automatic generation of vectorized line drawings from photographs. Various statistical properties of the contours are computed: the distributions of orientations, contour curvature, and contour lengths, as well as counts and types of contour junctions. The toolbox includes an efficient algorithm for computing the medial axis transform of contour drawings and photographs. Based on the medial axis transform, we compute several scores for local mirror symmetry, local parallelism, and local contour separation. All properties are summarized in histograms that can serve as input into statistical models to relate image properties to human behavioral measures, such as esthetic pleasure, memorability, affective processing, and scene categorization. In addition to measuring contour properties, we include functions for manipulating drawings by separating contours according to their statistical properties, randomly shifting contours, or rotating drawings behind a circular aperture. Finally, the MLV Toolbox offers visualization functions for contour orientations, lengths, curvature, junctions, and medial axis properties on computer-generated and artist-generated line drawings. We include artist-generated vectorized drawings of the Toronto Scenes image set, the International Affective Picture System, and the Snodgrass and Vanderwart object images, as well as automatically traced vectorized drawings of set architectural scenes and the Open Affective Standardized Image Set (OASIS).https://www.frontiersin.org/articles/10.3389/fcomp.2023.1140723/fullmid-level visionperceptual groupinggestalt grouping rulescontour drawingsmedial axis transformsymmetry
spellingShingle Dirk B. Walther
Delaram Farzanfar
Seohee Han
Morteza Rezanejad
The mid-level vision toolbox for computing structural properties of real-world images
Frontiers in Computer Science
mid-level vision
perceptual grouping
gestalt grouping rules
contour drawings
medial axis transform
symmetry
title The mid-level vision toolbox for computing structural properties of real-world images
title_full The mid-level vision toolbox for computing structural properties of real-world images
title_fullStr The mid-level vision toolbox for computing structural properties of real-world images
title_full_unstemmed The mid-level vision toolbox for computing structural properties of real-world images
title_short The mid-level vision toolbox for computing structural properties of real-world images
title_sort mid level vision toolbox for computing structural properties of real world images
topic mid-level vision
perceptual grouping
gestalt grouping rules
contour drawings
medial axis transform
symmetry
url https://www.frontiersin.org/articles/10.3389/fcomp.2023.1140723/full
work_keys_str_mv AT dirkbwalther themidlevelvisiontoolboxforcomputingstructuralpropertiesofrealworldimages
AT delaramfarzanfar themidlevelvisiontoolboxforcomputingstructuralpropertiesofrealworldimages
AT seoheehan themidlevelvisiontoolboxforcomputingstructuralpropertiesofrealworldimages
AT mortezarezanejad themidlevelvisiontoolboxforcomputingstructuralpropertiesofrealworldimages
AT dirkbwalther midlevelvisiontoolboxforcomputingstructuralpropertiesofrealworldimages
AT delaramfarzanfar midlevelvisiontoolboxforcomputingstructuralpropertiesofrealworldimages
AT seoheehan midlevelvisiontoolboxforcomputingstructuralpropertiesofrealworldimages
AT mortezarezanejad midlevelvisiontoolboxforcomputingstructuralpropertiesofrealworldimages