vision Flashcards
How to define the camera position
The camera position can be defined by a translation [Tx, Ty,Tz] and a rotation in the 3D space which is represented by 3*3 matrix
How to define the reotation in the 3D space
it can be defined by 3 values, euler angles that can be represented by a 33 matrix, know that all the 3D rotations can be represented by a 33 matrix but not all the 3*3 matrices are 3D rotations
what are the Euler angles
the describe 3angles that define the orientation of a rigid body
what are the extrinsic parameters
correspond to the combination of the rotation and translation to define the camera
what is the main idea behind a pinhole camera
project 3D points in the space to a 2D one, such that every 3D point is represented by 2 points of the sensor
what is the difference between extrinsic and intrinsic parameters
extensive is more related to the camera position and intensity are more related to the project of the points in the space
what is the focal length
it is the distance between the the lens and the sensor , we use it to represent the sensor points in term of space points
how can we represent the world coordinates to pixel coordinates
one can use the mixture of intrinsic and extrinsic parameters
explain the stereovision settings
The image is seen by two cameras, they are both on the same epipolar line, the distance between them is called the baseline and their optical view is parallel to each other
why the angle of the eye change and is not parallel all the time
through constantly modifying the angles leads to obtain wider fields between the intersection of the cameras projection
why we may need the stereovision
stereovision allows acquiring the geometry of the object
what is the disparity
reflects the difference in the projection
what is the relation between the distance and the dipary
The higher is the distance the higher is the disparity
How the objects are detecion happen
the idea is that when both camera projections meet, that field is captured
How the objects are detected happen
the idea is that when both camera projections meet, that field is captured