Computer Vision Glossary Flashcards

1
Q

Optical Character Recognition (OCR)

A

The technology that enables machines to convert images of text into machine-readable text data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Azure AI Vision

A

A service on the Azure platform that provides AI-powered vision capabilities, including OCR.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Read API

A

The OCR engine within Azure AI Vision, used to extract text from images, PDFs, and TIFF files.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Machine Learning Model

A

Algorithms trained on data to recognize patterns and make predictions, used in OCR to identify text elements.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Bounding Box

A

A rectangular region that marks the location of an object within an image, described by its coordinate points.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Vision Studio

A

A graphical user interface within Azure that allows users to access and experiment with AI vision capabilities without needing to code.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

REST API

A

A standardized way to interact with web services using HTTP requests, used to programmatically access the Read API.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

SDK (Software Development Kit)

A

A set of tools and resources that developers can use to build applications, used for accessing the Read API through programming languages.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

JSON

A

A lightweight, text-based data interchange format used to represent data structures, commonly used in APIs to return structured data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Natural Language Processing (NLP)

A

A field of AI focused on enabling computers to understand, interpret, and generate human language.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Face Detection

A

The process of identifying the presence and location of human faces within an image or video.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Facial Analysis

A

The process of examining specific facial features to derive additional information.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Facial Recognition

A

The process of identifying individuals from their facial features using trained models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Azure AI Face Service

A

A Microsoft Azure service that provides pre-built algorithms for face detection, recognition, and analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Accessories

A

Objects such as glasses, masks, or headwear that can be detected on a face.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Occlusion

A

The blocking of a face in an image by an object, impacting accuracy of detection and analysis.

17
Q

Limited Access

A

A policy restricting access to advanced features of the Azure AI Face service, requiring approval from Microsoft.

18
Q

Responsible AI

A

Microsoft’s approach to AI development, which includes ethical guidelines for the design and implementation of AI technologies.

19
Q

Liveness Detection

A

The process of determining whether an input source is real or fake to prevent manipulation or spoofing.

20
Q

Convolutional Neural Network (CNN)

A

A type of deep learning architecture commonly used in computer vision, which uses filters to extract feature maps from images.

21
Q

Deep Learning

A

A subset of machine learning that uses neural networks with many layers to learn complex patterns from data.

22
Q

Feature Map

A

An array of numeric values that result from applying a filter to an image, used in deep learning models.

23
Q

Filter (Kernel)

A

An array of numeric values used to perform convolutional filtering, modifying pixel values in an image.

24
Q

Image Classification

A

The process of predicting the category or class of an image.

25
Q

Multi-Modal Model

A

An AI model trained using multiple types of data, encapsulating relationships between image features and text embeddings.

26
Q

Object Detection

A

The process of detecting and locating specific objects within an image, and classifying them.

27
Q

Pixel

A

A single point of color in a digital image, represented by numerical values.

28
Q

Resolution

A

The dimensions of an image, measured in pixels, indicating its quality or clarity.

29
Q

Transformer

A

A type of neural network architecture commonly used in NLP, encoding words as vector-based embeddings.

30
Q

Vector-based Embedding

A

An array of numeric values that represent semantic attributes of a word or token.