Deep Learning-based Deep Neural Networks and Applications Flashcards

Question

What particular image recognition application are CNNs used for?

Answer 1

car systems

Answer 2

Neocognitron (1980), LeNet-5 (1989-1998), AlexNet (2012)

Answer 3

measures the likelihood of an object's identity in an image

Answer 4

a neural network created to mimic the way the eye processes image information

Answer 5

its more complex but more effective for image processing

Answer 6

the neurons that are activated depend on where the "cat" is looking at the screen (only specific areas of neurons are activated at certain times)

Answer 7

there's a neural network that corresponds to neurons for each region on the screen

Answer 8

an operation that multiplies the weights of surrounding pixel values and adds them then uses this as a new pixel

Answer 9

using convolutional filters

Answer 10

distance to apply the kernel (if stride=1 then move kernel 1 pixel at a time, stride=2 skip one pixel each move, etc)

Answer 11

(m*n)/(s+1) * (m*n)/(s+1) (aka 1/stride)

Answer 12

numbers listed in the filter (correlates to the weight of FNN)

Answer 13

process of modifying weight (modifying numbers inside the filter) which allows features to be effectively extracted from the image

Answer 14

stack CNNs like perceptrons allowing you to learn simple info in low layers and abstract info in high layers

Answer 15

the information processed at each layer of multiple stacked CNNs that is organized and expressed graphically

Answer 16

activation map

Answer 17

the result of extracting features from the input image

Answer 18

for object recognition

Answer 19

false, it becomes more abstract

Answer 20

by connecting multiple layers

Answer 21

for video data, because CNN has excellent ability to compress video data but the output of CNN is only 2D so the FNN helps with that

Answer 22

pooling aka subsampling

Answer 23

calculations are faster when the layer size is smaller and the probability of overfitting decreases with smaller neural network parameters

Answer 24

hand-crafted features

Answer 25

because it is designed with human intuition

Answer 26

learning feature extraction and classification is simultaneous

Answer 27

since the entire process from input to output is learned at once

Answer 28

convolution layer and pooling layer

Answer 29

end-to-end learning, feature learning, uses tens to hundreds of layers, maintains original structure of data, and partial connectivity and weight sharing

Answer 30

dramatically reduces the number of operations

Answer 31

a recurrent neural network is a type of ANN that is capable of processing sequential data

Answer 32

a feedback loop

Answer 33

sequential data such as time series, speech recognition, and handwriting recognition

Answer 34

ordered data (either temporal or spatial)

Answer 35

stock prices, text and audio data

Answer 36

to remember historical data

Answer 37

standard neural networks don't remember data from the distant past

Answer 38

ability to: handle variable-length inputs, track long-term dependencies, and keep information about the order

Answer 39

data used to train recurrent neural networks

Answer 40

cutting data to certain lengths to make multiple training sample

Answer 41

when past learning influences future learning outcomes

Answer 42

language modeling and generation, music composition, speech recognition, and time series anomaly detection

Answer 43

ability to recognize features or objects and recognize emotions based off of facial expressions, mainly focuses on discernment abilities

Answer 44

multi-layer perceptron, CNN, RNN, etc

Answer 45

AI that imitates human handwriting, art styles, etc.

Answer 46

HMM and deep learning based generation model

Answer 47

classification models determine which classes inputs belong to, and generative models learn patterns from input data

Answer 48

a neural network that automatically discovers patterns in the training data and creates new samples that resemble the probability distribution of the training data

Answer 49

learn the latent space representation of training data

Answer 50

a neural network that learns input data and performs encoding and decoding to produce an output identical to the input

Answer 51

feature learning, dimensionality reduction, representation learning

Answer 52

anomaly detection, abnormal financial transactions, visualization and restoration of data, meaning extraction, and image search

Answer 53

encoder, latent space, decoder, loss function

Answer 54

encodes input into the latent space

Answer 55

high-level features that compress and represent the original pattern

Answer 56

unpacks the latent space and restores it to the input

Answer 57

uses MSE of input and output images

Answer 58

GAN and cycleGAN

Answer 59

Restricted Boltzmann Machine, a model proposed by Hinton and is an unsupervised generative model that captures higher-order correlations in the data

Answer 60

extracts important features and combines features to form patterns

Answer 61

feature extraction, dimensionality reduction, e.g. spectrum analysis

Answer 62

shallow neural network with only 2 layers (visible layer and hidden layer)

Answer 63

Generative Adversarial Network

Answer 64

Ian Goodfellow and others in 2014

Answer 65

2 competing neural networks in a zero-sum game where one aims to fool the other by producing realistic candidates

Answer 66

create fake images or videos that look real

Answer 67

the generator (DCNN) and the discriminator (CNN)

Answer 68

creates new data instances and is trained to produce data that is similar to the training set

Answer 69

evaluates the data instances for authenticity and is trained to distinguish between the generated data and real data

Answer 70

adversarial training

Answer 71

when is begins, the generator doesn't know what to generate so random noise is supplied as input, and it makes random noise output, then the discriminator has an easy time distinguishing between real and fake, but as training goes on, the replicas and distinguishing gets better

Answer 72

using average pixel value calculation we can produce images but they will be the same every times, so generative models need stochastic random elements to influence the output, essentially the random noise acts as a seed

Answer 73

expose it to 10 million randomly chosen youtube video thumbnails over 3 days

Answer 74

unsupervised learning to identify relevant parts of a photo based on data patterns

Answer 75

1 billion connections and 16,000 computers

Answer 76

adjust difficulty, personalize, and improve immersion

Answer 77

based on the player's performance it adjusts the difficulty to balance the difficulty and player skill level in order to improve player engagement

Answer 78

analyze player data and behavior to create tailored experiences and content recommendations

Answer 79

improve text to speech, speech to text, and speech synthesis

Answer 80

GAN uses 2 adversarial neural networks, and cycleGAN uses 4 (2 generators and 2 discriminators)

Answer 81

google, MS, facebook, twitter, and baidu

Answer 82

voice recognition, translation, cat image recognition, youtube recommendations, automatic tagging, etc.

Answer 83

released TensorFlow source code

Answer 84

detecting breed of dog from photo, simultaneous interpretation technology for Cortana and Skype

Answer 85

Adam Project

Answer 86

creating DeepFace to recognize human faces, faces can be recognized from various angles or lighting based on user-uploaded pictures, translator tool

Answer 87

photo analysis

Answer 88

Prof. Andrew Ng, the leader of the Brain project at Google, moved to Baidu in 2014

Answer 89

Naver, Daum Kakao, Samsung, LG, etc

Answer 90

voice recognition, news summaries

Answer 91

image recognition

Answer 92

accuracy increased and recognition time decreased

Answer 93

XAI (Explainable AI) which means in addition to recognition results, the reason is also explained

Answer 94

in text, video, and voice recognition

Answer 95

learning on large scale data, range getting wider, accuracy improving, scope of application is expanding (many high expectations for the 4th industrial revolution)

Answer 96

they can performs multiple computations simultaneously

Answer 97

they have thousands of cores

Answer 98

3 times faster when processing parallel tasks

Answer 99

parallelism because it requires millions of calculations

Answer 100

NPU (neural processing unit)

Answer 101

a processing unit specifically designed to speed up neural networks also called IPUs (intelligent processing unit)

Answer 102

NPU is specialized for AI tasks and GPU is versatile

Answer 103

graphics rendering, scientific simulations, and computation heavy tasks

Answer 104

the AI semiconductor market

Answer 105

next generation NPUs

Answer 106

it has convenient software kits that returns results when you put information into the library

Answer 107

2015 by Google

Answer 108

a deep learning framework implemented in C++ with multiple interfaces that can be accessed from multiple languages through Python

Answer 109

dataflow graphs

Answer 110

TensorFlow

Answer 111

feedforward, convolutional, and recurrent neural networks and combinations of them

Answer 112

CPU and GPU

Answer 113

sequential linear stack model

Answer 114

search, ads, google maps, street view, translation, YouTube, etc

Answer 115

recognition or cursive letters or numbers, voice, image, NLP and machine translation

Answer 116

Facebook's AI team

Answer 117

a Python-based open source ML library

Answer 118

2016, then stabilized in April 2019

Deep Learning-based Deep Neural Networks and Applications Flashcards

(152 cards)