Perception is the process through which the brain organizes and interprets sensory information to build a meaningful experience of the world. These processes occur automatically and unconsciously, through the interaction between what we perceive and what we already know.
Perception is therefore subjective: our past experiences, expectations, selective attention, and prior knowledge shape the way we observe the world. Sensory systems receive physical stimuli and transform them into neural signals, which the brain processes, attributing form, meaning, and identity.
In this sense, perception represents the link between a physical object and the meaning we assign to it. Shapes, colors, materials, and movements are interpreted in milliseconds, guiding our reactions even before conscious processing occurs. As James J. Gibson notes, “perception is not an internal representation of the world, but a direct process of exploring the environment” (Gibson, 1979), highlighting the active role of the observer in interacting with objects.
Visual Object Recognition
Visual recognition occurs in two main stages: an initial stage of structural description and a second stage of cognitive processing.
In the first stage, the physical features of the stimulus—such as luminance, color, shape, and orientation—are analyzed. This phase allows the figure to be distinguished from the background, without yet assigning meaning to the perceived object.
The second stage occurs when the perceived shape is compared with memory. It is at this stage that the object is recognized as familiar, engaging attention, memory, language, imagination, and consciousness.
A clear example is Bev Doolittle’s Pintos: at first, only indistinct patches are perceived, but when the horses become recognizable, the perception of the image changes dramatically. As Mecacci observes (2001, p.123), “this ‘knowing’ that we are perceiving something involves another very complex mental process called consciousness: we are aware of perceiving, remembering, and needing to pay attention.”
Bottom-Up and Top-Down Processing
Visual recognition also relies on two complementary modes: bottom-up and top-down processing.
Bottom-up processing starts from sensory data, analyzing individual elements of the stimulus. It dominates when an object is new, ambiguous, or unexpected. For example, you might first notice a buzzing near your ear and then a small moving object, allowing you to recognize a fly.
Top-down processing, on the other hand, relies on knowledge, expectations, and pre-existing mental representations. A common example is moving around your house at night without turning on the lights, navigating based on your knowledge of the objects’ arrangement.
The balance between these modes depends on the familiarity of the object and the context. Familiar objects or those consistent with the environment favor top-down processing, while unusual objects require more careful analysis of sensory data.
Feature Integration Theory
According to Anne Treisman, visual perception occurs in two stages: the first automatically detects elementary features such as color, shape, orientation, and movement; the second integrates these features through attention, forming a coherent whole—the object as we perceive it.
Perception is therefore an active process, with attention playing a central role in making sense of visual information. Understanding these mechanisms is essential for analyzing how objects are perceived, interpreted, and used.
Sources:
• Gibson, J. J. (1979). The Ecological Approach to Visual Perception. Houghton Mifflin.
• Mecacci, L. (2001). Manual of General Psychology. Giunti.
• Britannica, T. Editors of Encyclopaedia (n.d.). Perception. Encyclopaedia Britannica.
• Neuroscienze.net. (n.d.). Visual Perception and Design.
• Santagostino Magazine. (n.d.). Selective Attention.