11 - The Eyes of a Machine Flashcards

Question 1

Q

Who are the co-founders of the Department of Neurobiology at Harvard?

Answer

A

David Hubel and Torsten Wiesel

They were awarded the Nobel Prize in Physiology or Medicine in 1981.

Question 2

Q

What was the main focus of Hubel and Wiesel’s research?

Answer

A

The visual system of cats

Their work involved mapping the visual cortex and understanding neural responses to visual stimuli.

Question 3

Q

What invention did Hubel create in 1957 for recording neuron activity?

Answer

A

A tungsten electrode

This electrode was preferred over glass micropipettes and steel electrodes for its durability.

Question 4

Q

What anesthetic was used on cats during Hubel and Wiesel’s experiments?

Answer

A

Intraperitoneal thiopental sodium

This was administered to keep the cats under anesthesia during the experiments.

Question 5

Q

What is a ‘visual field’?

Answer

A

The region in front of us that our eyes are sensitive to

It changes as we move our eyes and focus on different objects.

Question 6

Q

Define ‘receptive field’.

Answer

A

The portion of the visual field that triggers a single neuron

The size of the receptive field can vary significantly among different neurons.

Question 7

Q

What are retinal ganglion cells?

Answer

A

Neurons that monitor the image on the retina

They are the first layer of neurons receiving inputs from the retina.

Question 8

Q

What is the function of a ‘simple cell’ in the visual cortex?

Answer

A

It fires only when all its connected retinal ganglion cells fire together

Simple cells are sensitive to specific orientations of edges.

Question 9

Q

What does a ‘complex cell’ do?

Answer

A

It fires in response to an edge regardless of its position in the receptive field

This indicates spatial or translational invariance.

Question 10

Q

What does ‘invariance’ refer to in the context of vision?

Answer

A

The ability to recognize stimuli regardless of their position or orientation

This includes translational and rotational invariance.

Question 11

Q

What is a ‘hypercomplex cell’?

Answer

A

A cell that fires for an edge of a specific length and orientation

It can detect complex shapes and patterns.

Question 12

Q

What was the name of the first neural network-based image recognition system?

Answer

A

Cognitron

Developed by Kunihiko Fukushima in 1975.

Question 13

Q

How does the learning algorithm in the cognitron work?

Answer

A

It strengthens synaptic connections based on neuron firing activity

This is akin to Hebbian learning principles.

Question 14

Q

What limitation did the cognitron have?

Answer

A

It was not translation invariant

This means it recognized patterns differently based on their position in the visual field.

Question 15

Q

What architecture did the neocognitron adopt from Hubel and Wiesel’s work?

Answer

A

It includes S-cells and C-cells

These cells model simple and complex cells found in the visual cortex.

Question 16

Q

What is the role of S-cells in the neocognitron?

Answer

A

They respond to specific features like vertical edges

S-cells feed into C-cells to indicate edge presence.

Question 17

Q

What does the term ‘grandmother cell’ refer to?

Answer

A

A hypothetical neuron that fires when seeing a specific complex stimulus

This concept is often used to illustrate the idea of specific neural responses to familiar faces.

Question 18

Q

What are C-cells in the context of the visual processing model?

Answer

A

C-cells respond to vertical edges in different patches of the visual field.

Question 19

Q

What is the role of S-cells in the visual processing architecture?

Answer

A

S-cells collate outputs from C-cells to detect edges anywhere in the visual field.

Question 20

Q

What does translation invariance refer to in the neocognitron?

Answer

A

The ability to detect patterns regardless of their position or distortions.

Question 21

Q

Who developed the neocognitron and what was its significance?

Answer

A

Fukushima developed the neocognitron, which advanced pattern recognition in machines.

Question 22

Q

What was the main limitation of the neocognitron’s training algorithm?

Answer

A

It adjusted only the weights of the S-cells and was cumbersome.

Question 23

Q

What breakthrough did Yann LeCun achieve in the field of neural networks?

Answer

A

He developed the convolutional neural network (CNN) using the backpropagation algorithm.

Question 24

Q

How did the meeting between LeCun and Fukushima impact the field?

Answer

A

Fukushima was surprised to learn that LeCun’s work was parallel to his own on the same topic.

Question 25

Q

Who were the key figures in the debate about cognitive capabilities, and what were their positions?

Answer

A

Piaget believed in learning during development; Chomsky argued for innate capabilities.

Question 26

Q

What was Papert’s stance during the Piaget-Chomsky debate?

Answer

A

Papert supported Piaget, arguing that Chomsky underestimated the role of learning.

Question 27

Q

What is a perceptron?

Answer

A

An early type of artificial neural network used to analyze learning and cognition.

Question 28

Q

What was LeCun’s key realization regarding learning algorithms?

Answer

A

A learning algorithm should minimize an objective function.

Question 29

Q

What is the difference between a loss function and an objective function?

Answer

A

The objective function includes a regularizer to prevent overfitting, while the loss function does not.

Question 30

Q

What is the purpose of a regularizer in machine learning?

Answer

A

To prevent overfitting by adding an extra term to the loss function.

Question 31

Q

What was the focus of LeCun’s doctoral work?

Answer

A

Developing a learning algorithm related to backpropagation for multi-layer neural networks.

Question 32

Q

What did LeCun present at a conference in 1985?

Answer

A

A poorly written paper in French on his learning algorithm.

Question 33

Q

What significant software did LeCun and Bottou create?

Answer

A

SN, which later evolved into Lush, an early neural network simulation tool.

Question 34

Q

What major dataset did LeCun work with at Bell Labs?

Answer

A

Images of handwritten digits from the U.S. Postal Service.

Question 35

Q

What is the convolution operation in the context of image processing?

Answer

A

An operation that combines two functions to produce a third, particularly for feature extraction.

Question 36

Q

What are kernels in convolutional neural networks?

Answer

A

Small matrices used to filter images and detect features such as edges.

Question 37

Q

What is the effect of convolution on the size of an image?

Answer

A

Convolution typically reduces the size of the output image.

Question 38

Q

What is the stride in convolution operations?

Answer

A

The number of pixels by which the kernel moves across the image during convolution.

Question 39

Q

What is the formula for calculating the output image size after convolution?

Answer

A

((i - k) / s) + 1, where i is the input size, k is the kernel size, and s is the stride.

Question 40

Q

True or False: The original image size remains the same after applying a convolution operation.

Question 41

Q

What is the term used for the distance the kernel moves in a convolution operation?

Answer

A

Stride

The stride can affect the size of the output image.

Question 42

Q

How is the output image size calculated in a convolution operation?

Answer

A

(( i - k ) / s ) + 1

Where i is the input image size, k is the kernel filter size, and s is the stride.

Question 43

Q

What is padding in the context of convolutional networks?

Answer

A

Dummy pixels added around the input image

Padding can affect the output size and feature extraction.

Question 44

Q

What is the output size when using a 3×3 kernel on a 28×28 image with a stride of 1?

Answer

A

26×26

This is calculated without padding.

Question 45

Q

What represents the weighted sum of pixel values in a convolution operation?

Answer

A

The output of a neuron

The weights correspond to the kernel values.

Question 46

Q

How many neurons are needed for a 5×5 image with a 2×2 kernel and a stride of 1?

Answer

A

16 neurons

This produces a 4×4 output image.

Question 47

Q

Define receptive field in the context of convolutional neural networks.

Answer

A

The specific area of the image that a neuron responds to

Each neuron has its own region of interest.

Question 48

Q

What are simple cells and complex cells in the hierarchy of visual processing?

Answer

A

Simple cells respond to simple features and complex cells respond to compositions of those features

This hierarchy was posited by Hubel and Wiesel.

Question 49

Q

What is max pooling in convolutional neural networks?

Answer

A

An operation that outputs the largest pixel value from a region under the filter

Max pooling reduces image size and increases the receptive field.

Question 50

Q

What is the effect of max pooling on the size of an image?

Answer

A

Reduces the size of the image

For example, a 4×4 image with a 2×2 filter and stride of 2 results in a 2×2 image.

Question 51

Q

What is the purpose of backpropagation in training neural networks?

Answer

A

To calculate gradients and update weights

This process minimizes the error between expected and actual outputs.

Question 52

Q

What is the role of activation functions in neural networks?

Answer

A

To introduce non-linearity and enable backpropagation

Activation functions must be differentiable.

Question 53

Q

What are hyperparameters in the context of neural networks?

Answer

A

Parameters not learned during training that influence performance

Examples include the number of layers, kernel sizes, and activation functions.

Question 54

Q

What was LeNet used for in the banking industry?

Answer

A

To read and recognize digits on checks

LeNet was one of the early successful applications of convolutional neural networks.

Question 55

Q

What does the output layer of a neural network for digit recognition typically consist of?

Answer

A

10 neurons

Each neuron corresponds to a digit from 0 to 9.

Question 56

Q

What happens during stochastic gradient descent?

Answer

A

A subset of images is used for each pass through the network

This is a method to optimize the learning process.

Question 57

Q

What was a significant limitation of convolutional neural networks in the 1990s?

Answer

A

Lack of general-purpose software to build CNNs

AT&T did not allow the distribution of their software open source.

Question 58

Q

What algorithms were outperforming CNNs for low-resolution images in the 1990s?

Answer

A

Conventional techniques

CNNs were still not widely adopted due to skepticism in the computer vision community.

Question 59

Q

What hardware advancement in the 2000s significantly changed deep learning?

Answer

A

Graphical Processing Units (GPUs)

GPUs were originally designed for rendering 3D graphics but proved vital for deep learning tasks.

Question 60

Q

What dataset did Jürgen Schmidhuber’s team use to train multi-layer perceptrons?

Answer

A

MNIST images

They achieved low error rates of 0.35 percent using deep learning techniques.

Question 61

Q

What was the name of the first massive CNN built by Hinton’s lab?

Answer

A

AlexNet

AlexNet demonstrated that conventional methods for image recognition could not compete.

Question 62

Q

What was the purpose of vector maps in Hinton and Mnih’s research?

Answer

A

To teach neural networks how to label pixels in aerial images

Vector maps provided clear information about road locations.

Question 63

Q

What programming interface allowed GPUs to be used for general-purpose tasks?

Answer

A

CUDA

CUDA enabled engineers to perform tasks beyond just graphics acceleration.

Question 64

Q

What breakthrough did Hinton’s lab achieve in speech recognition?

Answer

A

Using CUDAMat to program deep neural networks

This demonstrated the versatility of GPUs in different machine learning tasks.

Answer 64

A

The ceiling for SVMs is low compared to neural networks

He believed neural networks had a higher potential if trained correctly.

Answer 65

A

Train a system to categorize 1.2 million images into 1,000 categories

It aimed to advance the field of computer vision.

Answer 66

A

17 percent

This was significantly lower than the previous winners’ rates.

Answer 67

A

Rectified Linear Unit (ReLU)

This was a departure from the traditional sigmoid function.

Answer 68

A

Revolutionized computer vision, natural language processing, machine translation, and more

Their applications are vast and continually growing.

Answer 69

A

The emergence of quantum mechanics

He suggested that deep neural networks are leading to new theories in machine learning.