Exam Preparation Deck Flashcards
What is the advantage of using a 3D face representation in face detection? Any disadvantages?
- Pose/viewing angle/illumination invariance may be achieved.
- Enormous computational/memory requirements (up to 1GB per face)
- Amounts to inverse optics: Have to generate 3D face from 2D images.
Why does the world seem to be uniformly coloured and resolved, even though only the fovea can do such detections?
- Internal visual representation built from multiple fovea frames over time.
- Supports “vision is graphics”: Human vision is a result of a complex graphical process. It is not a direct encoding of input signals.
- Shows the importance of data integration over time.
What computation methods are often used to find the active contours?
- Gradient descent
- Simulated annealing
- Partial differential equations
- Iterative numerical methods
Descibe the two major ways of motion detection.
How do they relate spatial and temporal gradients of the image?
- Ratio of local time-derivative to spatial gradient gives estimate of local image velocity.
- Time derivative of Laplacian-Gaussian-convolved image in the vicinity of Laplacian zero-crossing. Amplitude gives speed, sign gives direction (relative to contour normal).
![](https://s3.amazonaws.com/brainscape-prod/system/cm/215/178/651/a_image_thumb.png?1496407057)
Define the 3x3 Laplacian operator.
How is it used? Why is the sum of all taps 0?
Used for edge detection.
Gives no response to areas of uniform brightness.
![](https://s3.amazonaws.com/brainscape-prod/system/cm/215/178/856/a_image_thumb.png?1496407356)
State the compression rate of MPEG (both interframe and intraframe).
How does MPEG compress videos?
Both 50-50%.
Extracted object motions, so predictions of trajectories are possible. This allows a mode of compression.
Describe the three Hadamard conditions for well-posed problem.
- Its solution exists.
- Its solution is unique.
- Its solution depends continuously on input.
What is functional streaming?
- The division-of-labor within the mammalian brain.
- Seems to have different streams for different image processing, such as color/texture processing.
- Different parts of brain specializes in specific tasks.
- But how do they get integrated?
Describe the reflectance map.
- Relates intensities of image to surface orientations of objects.
- Specifies the fraction of incident light reflected, per unit surface area per unit solid angle in camera direction.
- Specified by three parameters: i (illuminant angle), e (emitted ray angle), g (angle between illuminant and emitted ray)
Describe Bayesian inference. How does it work?
Define:
- Prior probability
- Posterior probability
Drawing inferences from data. Takes account of two major information:
- Prior knowledge, usually defined as unconditioned probabilities.
- Conditional probabilities on class conditional data.
The Bayes’ rule often used…. Here P(C_k) is prior.
![](https://s3.amazonaws.com/brainscape-prod/system/cm/215/180/889/a_image_thumb.png?1496410232)
What does the inner/outer plexiform layer of the mammalian retina do? What’s its purpose?
Outer layer:
- Performs spatial centre/surround comparisons
- Uses on-centre / off-surround isotropic receptive field structures.
- Can be seen as edge detection, or some kind of bandpass filtering.
Inner layer:
- Similar function in time. Sensitive to motion or dynamic aspect of images.
What’s the advantage of second order differential operator… over first order ones?
- First order operators detect edges in polar-sensitive way: positive for +right edge, negative for +left edge.
- Second order ones have the advantage of producing zero-crossings at an edge.
What does the expression ‘signal-to-symbol’ converter mean?
- To the human body, the external world exists as physical signals on sensory surfaces.
- Vision converts it into high-level symbols, which is easier to understand and manipulate for humans.
- Shows why computer vision is hard. It cannot be done merely by signal processing methods.
- There has to be a bridge between ‘signal’ and ‘symbol’.
Define the correspondence problem.
Establishing the point-to-point correspondences in two different images.
State the advantages of Fourier Transform in Computer Vision.
- Convolution can be made more efficient, given kernels of size > 5x5.
- Texture detection can be done by Fourier analysis, as textures are well defined by spatial frequency and orientation characteristics.
- Motion can be detected, by exploiting the spectral co-planarity theorem.