Computer Vision Terminology Flashcards
Computer vision
Computer vision is a subfield of artificial intelligence and computer science that focuses on enabling computers to understand and interpret the visual world. Essentially, it’s about teaching computers to “see” and understand digital images or videos.
Simultaneous localization and mapping (SLAM)
Simultaneous Localization and Mapping, or SLAM, is a computational problem in the field of robotics. As the name implies, it’s about doing two things at the same time:
Localization: Determining where a robot is located in an environment.
Mapping: Building a map of that environment.
vSLAM: Initialization
“Initialization” refers to the process of setting up the initial conditions or starting point for the algorithm.
vSLAM: Local mapping
Local mapping is where the robot builds a smaller, more immediate map of its surroundings, often referred to as a local map.
vSLAM: Loop closure
The idea of loop closure is to correct this drift by recognizing when the robot returns to a place it has visited before. When the robot recognizes such a place, it can “close the loop”, correcting its current position estimate and map to align with the previous visit.
vSLAM: Relocalization
It refers to the ability of a robot to determine its current location in a map that it previously built or in a known environment, particularly after it has lost track of its position due to an error, disturbance, or after it has been manually moved (also known as the “kidnapped robot” problem).
vSLAM: Tracking
“Tracking” typically refers to the process of continuously estimating the robot’s motion and position over time based on its sensor data.
Human pose estimation (HPE)
Human pose estimation (HPE) is a computer vision task that involves determining the position and orientation of the human body, along with the positions of various body parts such as the head, arms, legs, and so on, usually in real-time.
Rigid pose estimation (RPE)
Rigid Pose Estimation (RPE) is a concept in computer vision and robotics that involves determining the position and orientation (the “pose”) of an object that does not deform or change shape — in other words, a “rigid” object. The term ‘rigid’ indicates that the distance between any two points on the object remains constant over time, regardless of the object’s movement or orientation.
Global map optimization
Process of improving the accuracy and consistency of a map that a robot has created of its environment.
Global positioning system (GPS) signal
The Global Positioning System (GPS) is a satellite-based navigation system that provides location and time information in all weather conditions, anywhere on or near the Earth where there is an unobstructed line of sight to four or more GPS satellites.
GPS-degraded environment
A GPS-degraded environment refers to any situation or location where the Global Positioning System (GPS) signals are unreliable, weak, or completely unavailable.
GPS-denied environment
A GPS-denied environment is a location or situation where the Global Positioning System (GPS) signals are not available at all.
Dead reckoning data
Dead reckoning is a process used in navigation to determine one’s current position based on a previously known position, or fix, and advancing that position based upon known or estimated speeds over a period of time, and the direction in which the person or vehicle is known or estimated to have moved.
Robot drift
“Robot drift” is a term often used in the context of robotics and refers to the accumulated error in a robot’s estimated position and orientation over time.