Computer Vision Flashcards by Dia3 Kims

What is computer vision?

A field of artificial intelligence that enables computers to interpret and understand visual information from the world, such as images and videos.

How well did you know this?

Not at all

Perfectly

How do computers see images?

Images are represented as a grid of pixels, with each pixel having a numerical value corresponding to its color or intensity.

How well did you know this?

Not at all

Perfectly

What is a grayscale image?

An image where each pixel has a single value representing its light or dark intensity, typically ranging from 0 (black) to 255 (white).

How well did you know this?

Not at all

Perfectly

What is pixel analysis in computer vision?

The process of breaking down an image into tiny dots called pixels, each with a specific color and brightness value.

How well did you know this?

Not at all

Perfectly

What is feature extraction?

The analysis of an image to identify patterns, shapes, edges, and other features.

How well did you know this?

Not at all

Perfectly

What is the purpose of pattern recognition in computer vision?

To compare identified features to known patterns or objects that the computer has been trained to recognize.

How well did you know this?

Not at all

Perfectly

What role does machine learning play in computer vision?

It uses statistical and computational methods to train models on large image datasets, allowing computers to understand visual data by learning from patterns.

How well did you know this?

Not at all

Perfectly

What is a Convolutional Neural Network (CNN)?

A type of artificial neural network designed to process and recognize patterns in images through multiple layers.

How well did you know this?

Not at all

Perfectly

What is the significance of the year 1957 in computer vision?

The first known digital image scanner, the ‘Cyclograph,’ was developed, transforming images into grids of numbers.

How well did you know this?

Not at all

Perfectly

Who are David Hubel and Torsten Wiesel?

Researchers whose experiments on the visual cortex of cats revealed how the brain processes visual information.

How well did you know this?

Not at all

Perfectly

What is Optical Character Recognition (OCR)?

A technology that solves the problem of recognizing text printed in any font or typeface.

How well did you know this?

Not at all

Perfectly

What is image classification?

The ability to classify an image into a specific category, such as identifying a dog or a person’s face.

How well did you know this?

Not at all

Perfectly

What is object detection?

The process of identifying and locating objects within an image or video.

How well did you know this?

Not at all

Perfectly

What does image segmentation involve?

Dividing an image into distinct segments, each representing a specific object or region.

How well did you know this?

Not at all

Perfectly

What is semantic segmentation?

Classifying each pixel in an image into a specific class or category without distinguishing between different instances.

How well did you know this?

Not at all

Perfectly

What is instance segmentation?

Identifying and distinguishing individual objects within an image, providing precise masks for each object.

How well did you know this?

Not at all

Perfectly

What is panoptic segmentation?

Study These Flashcards

A hybrid method that combines both semantic and instance segmentation.

What is keypoint detection?

Study These Flashcards

Identifying specific, important points in an image to understand shapes, poses, or movements.

What is image captioning?

Study These Flashcards

Combining computer vision and natural language processing to generate descriptive text for an image.

What are some applications of computer vision?

Study These Flashcards

Self-driving cars
Medical imaging
Google Translate
Optical Character Recognition
Facial Recognition
Machine inspection / Surveillance
Fingerprint recognition and biometrics
QR Code Scanning
3D model building

What is a major challenge in computer vision related to data?

Study These Flashcards

Training deep learning models requires large amounts of labeled data, which can be time-consuming and expensive to collect.

What is domain adaptation in computer vision?

Study These Flashcards

Techniques to improve model performance on data from a different domain, such as different lighting or camera angles.

What programming language is often recommended for computer vision projects?

Study These Flashcards

Python, due to its ease of use, versatility, and extensive libraries for computer vision applications.

What is the ImageNet dataset?

Study These Flashcards

A large-scale dataset with millions of labeled images across thousands of categories used for object recognition.

What is OpenCV?

A library for image processing and traditional computer vision tasks.

True or False: Convolutional Neural Networks (CNNs) are used in video applications.

False. CNNs are primarily used for image understanding; Recurrent Neural Networks (RNNs) are used for video applications.

What is the benefit of using machine learning in computer vision?

Allows computers to learn from patterns in data, improving their ability to classify and detect objects without explicit programming.

Fill in the blank: The first known digital image scanner was developed in _____ 1957.

the year

What is the purpose of Google Open Image?

A massive dataset with millions of images annotated for object detection, segmentation, and relationships. ## Footnote Supports a wide range of vision tasks.

What is OpenCV used for?

Image processing and traditional computer vision tasks. ## Footnote A widely-used library in the field.

Name two popular deep learning frameworks for training and deploying models.

* TensorFlow * PyTorch ## Footnote These frameworks are commonly used for various machine learning applications.

What is Keras?

A high-level neural network API that works well with TensorFlow. ## Footnote Often used for prototyping.

What is Scikit-image?

A Python library for image processing with tools for filtering, segmentation, and feature extraction. ## Footnote Useful for various image analysis tasks.

What are NumPy and Pandas used for?

Data manipulation and preprocessing. ## Footnote Essential libraries in Python for data science.

What libraries are used for data visualization and plotting results?

* Matplotlib * Seaborn ## Footnote These libraries help in visualizing data and results effectively.

What is PyTorch Hub?

A library of pretrained models for PyTorch. ## Footnote Provides easy access to state-of-the-art models for various tasks.

What is TensorFlow Hub?

A repository of pretrained machine learning models for TensorFlow. ## Footnote Models can be easily reused and fine-tuned for various tasks.

What is YOLO known for?

Real-time object detection. ## Footnote Can detect multiple objects in a single pass.

What are the versions of YOLO mentioned?

* YOLOv6 * YOLOv7 ## Footnote Popular for applications requiring fast, accurate detection.

What is ResNet used for?

Image classification tasks. ## Footnote Notable for its deep architecture and residual connections.

What is MASK RCNN?

A deep learning model for object instance segmentation. ## Footnote Provides detailed masks for each instance of objects in an image.

True or False: MASK RCNN is particularly popular in fields like medical imaging.

True ## Footnote It is also used in applications requiring precise object localization.

Computer Vision Flashcards

(42 cards)