Computer Vision Glossary Flashcards
Optical Character Recognition (OCR)
The technology that enables machines to convert images of text into machine-readable text data.
Azure AI Vision
A service on the Azure platform that provides AI-powered vision capabilities, including OCR.
Read API
The OCR engine within Azure AI Vision, used to extract text from images, PDFs, and TIFF files.
Machine Learning Model
Algorithms trained on data to recognize patterns and make predictions, used in OCR to identify text elements.
Bounding Box
A rectangular region that marks the location of an object within an image, described by its coordinate points.
Vision Studio
A graphical user interface within Azure that allows users to access and experiment with AI vision capabilities without needing to code.
REST API
A standardized way to interact with web services using HTTP requests, used to programmatically access the Read API.
SDK (Software Development Kit)
A set of tools and resources that developers can use to build applications, used for accessing the Read API through programming languages.
JSON
A lightweight, text-based data interchange format used to represent data structures, commonly used in APIs to return structured data.
Natural Language Processing (NLP)
A field of AI focused on enabling computers to understand, interpret, and generate human language.
Face Detection
The process of identifying the presence and location of human faces within an image or video.
Facial Analysis
The process of examining specific facial features to derive additional information.
Facial Recognition
The process of identifying individuals from their facial features using trained models.
Azure AI Face Service
A Microsoft Azure service that provides pre-built algorithms for face detection, recognition, and analysis.
Accessories
Objects such as glasses, masks, or headwear that can be detected on a face.