Midlevel vision Flashcards

Question

Artificial neural networks

Answer 1

inspired by the structure of the brain: They consist of layers of units ("nodes") that mimic neurons. Nodes are interconnected, like axons and synapses in the brain. Learning occurs as the strength of connections changes with experience — similar to synaptic plasticity.

Answer 2

type of ANN with many layers (nodes) — this "depth" allows it to learn very complex patterns. the number of nodes (depth) of a network distinguishes a single neural network from a deep learning mode) Used in AI applications like: Facial recognition Google Home Self-driving cars Medical image analysis (e.g., reading mammograms) Modern advances in computing power and memory allow for deeper networks with millions of parameters. DNNs learn on their own by adjusting connections based on training data — no manual programming required.

Answer 3

1. Input Layer: Image is fed into the system. 2. First Layer: Extracts basic features (like edges, similar to simple cells in the visual cortex). a set of features is extracted from the image (think: simple cells) 3. Pooling Layer: Combines info to detect patterns (like complex cells). 4. Repeated Layers: Each layer extracts higher-order features, based on the previous one. operations create a new image from which the next layer of the DNN will extract features 5. Final Layer: Contains "neurons" that fire in response to specific object categories (similar to the idea of a "grandmother cell" in the brain).

Answer 4

The features of the faces don’t jumble together like the houses do. (we can more easily separate and tell apart overlapping faces from overlapping houses)

Answer 5

Instead of analyzing individual parts (eyes, nose, mouth) - feature-by-feature analysis, we process the face as a whole — a single unified representation. This is called holistic processing. It aligns with Gestalt principles, where “the whole is greater than the sum of its parts.” ➡️ Example: You might not notice if someone gets a haircut or has a blemish, because your brain doesn’t fixate on individual features — it sees the overall facial patten

Answer 6

Face Inversion: Turning a face upside down disrupts holistic processing. You’re forced to look at features individually, like with other objects. This explains the "face inversion effect", where it’s much harder to recognize upside-down faces. Low contrast or unusual lighting can also impair holistic face recognition.

Answer 7

Face Blindness A neurological disorder where someone can’t recognize faces — even familiar ones: 🔸 Types: Congenital (developmental): Person is born with it. Brain has normal face-selective regions/face patches, but connections between them are impaired. Acquired: Caused by damage to the temporal lobes, especially the Fusiform Face Area (FFA) in the ventral "what" pathway. People may know they are looking at a face, can detect emotion or gender, but can’t identify the person. Often use voice, clothing, or other cues to recognize others.

Answer 8

- ext: music -int: thoughts in head

Answer 9

-overt: orient sensory receptors to what you want to pay attention to - look at them -covert: not obviously pay attention - eavesdrop

Answer 10

driving and having convo, 2 things happening at once so attention is divided, multitasking

Answer 11

focus on activity, reading/sewing.. Task keeps attention for long/sustained time

Answer 12

pay specific attention to one stimulus - cocktail effect

Answer 13

In a visual search task, an observer looks for a target item among distractors. These tasks simulate real-world search behaviors. Typically, the target is present in 50% of trials. Reaction time (RT) is measured as the time it takes to say “yes” (target present) or “no” (target absent). -As set size increases (more items on screen), reaction time increases. -Saying "yes" (target present) is usually faster than "no" (target absent) because: To say “no,” you often need to check every item.

Answer 14

Efficiency is measured by the slope of the RT vs. set size graph: Shallow slope = more efficient search Steep slope = less efficient search

Answer 15

Definition: Target differs from distractors by a single, obvious feature (e.g., color, shape, orientation). Example: Finding a red dot among blue dots. Key Traits: Salient features make the target "pop out" Processed in parallel — the brain checks all items at once Reaction time stays the same, even as set size increases ✅ Highly efficient

Answer 16

Definition: Target shares features with distractors (e.g., color and shape) Example: Finding a red circle among red squares and blue circles Key Traits: -Requires examining items one-by-one -Known as a serial self-terminating search Ends when you find the target or finish checking all items Reaction time increases with set size ❌ Less efficient Where’s Waldo: What Type of Search? ✅ Serial self-terminating search -Waldo shares many features with other objects You must scan carefully, item by item — no "pop out" Cannot process all at once = inefficient

Answer 17

Visual search in the real world is not purely parallel or purely serial — it's guided. Attention is directed to the most likely candidates based on basic features (e.g., color, size, orientation). You don’t check every item blindly — you use what you know about the target to narrow down the search. use basic features to identify - colour) 🧠 Example: Looking for a red apple in a fruit bowl? You can ignore all non-red items.

Answer 18

arget is defined by a combination of features (not just one). Example: Searching for a black suitcase with a Canadian flag pin — both color and icon matter. Another example: Looking for tomatoes = red + round shape. Conjunction searches are slower than feature searches, but faster than fully serial searches, thanks to feature-based guidance.

Answer 19

Definition: Exposure to a stimulus facilitates faster or easier recognition of that stimulus (or similar ones) later. In visual search, seeing an object once can "prime" your brain to find it more quickly the next time. 🧠 Example: If you spot a monkey puppet in one part of a scene, you'll be faster at spotting other monkey puppets in later parts — even if you're not consciously thinking about it. ✅ Priming improves reaction time and accuracy.

Answer 20

Your understanding of how typical environments are structured helps guide your attention during search. You use contextual knowledge to focus your search in likely locations. 🧠 Example: If you're looking for a faucet, you’ll naturally search near a sink, not in the middle of the floor. This type of guidance relies on your experience and memory of the real world.

Answer 21

Our brain processes different features of an object (color, shape, motion, orientation) in separate neural circuits. The challenge: How do we combine ("bind") these features to perceive a single, unified object? Feature Integration Theory (Anne Treisman)

Answer 22

Main Idea: Some features (e.g., color, orientation) are processed automatically and in parallel — even before we focus attention. However, correctly binding features to the right object requires focused attention. 🔄 Two Stages of Processing: Preattentive stage: Fast, parallel, Processes basic features (e.g., “there’s something red”), Doesn’t bind features to objects yet Focused attention stage: Required for binding features into a coherent perception (e.g., “that red object is a tomato”) Binding = conscious attention 🧠 Example: You notice red before you know you’re searching for a tomato.

Answer 23

Occur when attention is not fully deployed You miscombine features from different objects Example: See a brown “B” and a red “C” but report a red “B” — features were present, but not correctly bound ✅ Evidence that binding needs attention

Answer 24

Contralateral neglect: Ignore one side of space (typically the left side after right hemisphere damage) Line cancellation task: Only cross out lines on the right side Patient may be unaware that the left side even exists

Answer 25

Patient can detect a stimulus on either side if shown alone But when both sides are stimulated, they fail to notice the contralesional one Competition for attention → stimulus on the damaged side is “extinguished” 👁️ Therapy: Train patients to scan their full visual field