Exam Preparation Deck Flashcards
What is the advantage of using a 3D face representation in face detection? Any disadvantages?
- Pose/viewing angle/illumination invariance may be achieved.
- Enormous computational/memory requirements (up to 1GB per face)
- Amounts to inverse optics: Have to generate 3D face from 2D images.
Why does the world seem to be uniformly coloured and resolved, even though only the fovea can do such detections?
- Internal visual representation built from multiple fovea frames over time.
- Supports “vision is graphics”: Human vision is a result of a complex graphical process. It is not a direct encoding of input signals.
- Shows the importance of data integration over time.
What computation methods are often used to find the active contours?
- Gradient descent
- Simulated annealing
- Partial differential equations
- Iterative numerical methods
Descibe the two major ways of motion detection.
How do they relate spatial and temporal gradients of the image?
- Ratio of local time-derivative to spatial gradient gives estimate of local image velocity.
- Time derivative of Laplacian-Gaussian-convolved image in the vicinity of Laplacian zero-crossing. Amplitude gives speed, sign gives direction (relative to contour normal).
Define the 3x3 Laplacian operator.
How is it used? Why is the sum of all taps 0?
Used for edge detection.
Gives no response to areas of uniform brightness.
State the compression rate of MPEG (both interframe and intraframe).
How does MPEG compress videos?
Both 50-50%.
Extracted object motions, so predictions of trajectories are possible. This allows a mode of compression.
Describe the three Hadamard conditions for well-posed problem.
- Its solution exists.
- Its solution is unique.
- Its solution depends continuously on input.
What is functional streaming?
- The division-of-labor within the mammalian brain.
- Seems to have different streams for different image processing, such as color/texture processing.
- Different parts of brain specializes in specific tasks.
- But how do they get integrated?
Describe the reflectance map.
- Relates intensities of image to surface orientations of objects.
- Specifies the fraction of incident light reflected, per unit surface area per unit solid angle in camera direction.
- Specified by three parameters: i (illuminant angle), e (emitted ray angle), g (angle between illuminant and emitted ray)
Describe Bayesian inference. How does it work?
Define:
- Prior probability
- Posterior probability
Drawing inferences from data. Takes account of two major information:
- Prior knowledge, usually defined as unconditioned probabilities.
- Conditional probabilities on class conditional data.
The Bayes’ rule often used…. Here P(C_k) is prior.
What does the inner/outer plexiform layer of the mammalian retina do? What’s its purpose?
Outer layer:
- Performs spatial centre/surround comparisons
- Uses on-centre / off-surround isotropic receptive field structures.
- Can be seen as edge detection, or some kind of bandpass filtering.
Inner layer:
- Similar function in time. Sensitive to motion or dynamic aspect of images.
What’s the advantage of second order differential operator… over first order ones?
- First order operators detect edges in polar-sensitive way: positive for +right edge, negative for +left edge.
- Second order ones have the advantage of producing zero-crossings at an edge.
What does the expression ‘signal-to-symbol’ converter mean?
- To the human body, the external world exists as physical signals on sensory surfaces.
- Vision converts it into high-level symbols, which is easier to understand and manipulate for humans.
- Shows why computer vision is hard. It cannot be done merely by signal processing methods.
- There has to be a bridge between ‘signal’ and ‘symbol’.
Define the correspondence problem.
Establishing the point-to-point correspondences in two different images.
State the advantages of Fourier Transform in Computer Vision.
- Convolution can be made more efficient, given kernels of size > 5x5.
- Texture detection can be done by Fourier analysis, as textures are well defined by spatial frequency and orientation characteristics.
- Motion can be detected, by exploiting the spectral co-planarity theorem.