Geometry Flashcards
Why should we remove pincushion and barrel distortion before processing
The geometry used in this class assumes a pinhole camera model, this model has no distortion
Briefly describe Zhangs method for camera calibration
- Take at least 3 images of a known surface (checkerboard)
- Use these images to calculate homographies.
- Estimate K from these homographies
- Use iterative, non-linear methods to calculate the full set of intrinsic parameters
If we estimate the camera pose from the K matrix and corresponding world/image points mathematically, the rotation part will typically not be a true rotation matrix. How can we fix this?
We can use the SVD to get the closest true rotation matrix in Frobenius-norm.
What is the difference between PnP methods and iterative methods for camera pose estimation from 3D data?
The PnP methods are non-iterative, faster and need fewer data points.
What do we need to know to extract 3D information from single view cameras?
We need objects/ regions with a known 3D structure, like planar regions, parallel lines, horizontal surfaces, vertical structures.
What is the vanishing point of a line
Infinite lines in the real world have finite length in the image and vanish at a point. This is the vanishing point.
How is the vanishing point for parallel lines related
Parallel lines in the same plane have the same vanishing point
What is the vanishing line, and how is this related to vanishing points
The vanishing line is where planes disappear in the image (Like the horizon). All vanishing points for lines in the plane intersect the vanishing line of that plane.
If the vanishing line of the horizontal plane is straight and runs through the center of the image, how is the camera rotated?
The camera is straight and level.
Which parameters determine the epipolar plane for two view geometry?
The position of the cameras and the observed points
What are the epipoles?
The epipoles are where the baseline intersects the images
What is the Q matrix used for in stereo geometry
The Q matrix is used to reproject points in the image to the world, it gives the world coordinates from image coordinates.
What is the DSI (Disparity Space Image) used in Stereo processing
DSI is a mapping R3->R with pixel coordinates u,v and disparity d as input and output indicating how well it matches.
What is E, the essential matrix, in two view geometry
The essential matrix relates a point in the first normalized image plane to an epipolar line in the second normalized image plane.
How many point correspondences do we need to estimate E, the essential matrix
At least 5
What is F, the fundamental matrix, in two view geometry
The fundamental matrix relates a point in one image to an (epipolar) line in the second image plane
How many point correspondences do we need to estimate F, the fundamental matrix
At least 7, but 8 is often used.
Describe the 8-point algorithm for computing F, the fundamental matrix.
- Normalize using similarity transforms
- Compute A from the point correspondences
- Use SVD and extract F_mark from the right Singular value vector
- Calculate the SVD of F_mark.
- Set s_33 = 0 to enforce a true Fundamental matrix
- Denormalize F
What is the fundamental difference between the 8 and 7 point algorithm for computing F?
The 7 point algorithm will give a 2-D nullspace of Ah. F can be found through a linear combination of the basis vectors with the constraint det(F) = 0. This constraint gives rise to a cubic polynomial which gives 1 or 3 solutions for F.
What is the theory behind two-view triangulation and why can’t we use this directly in practice.
Theoretically, two image points can be back-projected into the world and we can determine the intersection of the two lines formed. In practical applications, noise will usually result in these two lines not intersecting.
What is the problem with minimizing geometric error in triangulation, and what linear alternatives do we have?
Minimizing the geometric error will usually not minimize the reprojection error and the method doesn’t naturally extend to more than two cameras.
We could instead minimize the algebraic error using a least squares approach
How do we reduce the non-linear reprojection minimization problem of triangulation from 3 parameters of X, the point in the world, to 1 parameter?
We utilize that the epipolar lines can be described with 1 parameter, and minimize the error from u, u’ to their polar line.