11 - The Eyes of a Machine Flashcards

1
Q

Who are the co-founders of the Department of Neurobiology at Harvard?

A

David Hubel and Torsten Wiesel

They were awarded the Nobel Prize in Physiology or Medicine in 1981.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What was the main focus of Hubel and Wiesel’s research?

A

The visual system of cats

Their work involved mapping the visual cortex and understanding neural responses to visual stimuli.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What invention did Hubel create in 1957 for recording neuron activity?

A

A tungsten electrode

This electrode was preferred over glass micropipettes and steel electrodes for its durability.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What anesthetic was used on cats during Hubel and Wiesel’s experiments?

A

Intraperitoneal thiopental sodium

This was administered to keep the cats under anesthesia during the experiments.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a ‘visual field’?

A

The region in front of us that our eyes are sensitive to

It changes as we move our eyes and focus on different objects.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Define ‘receptive field’.

A

The portion of the visual field that triggers a single neuron

The size of the receptive field can vary significantly among different neurons.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are retinal ganglion cells?

A

Neurons that monitor the image on the retina

They are the first layer of neurons receiving inputs from the retina.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the function of a ‘simple cell’ in the visual cortex?

A

It fires only when all its connected retinal ganglion cells fire together

Simple cells are sensitive to specific orientations of edges.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does a ‘complex cell’ do?

A

It fires in response to an edge regardless of its position in the receptive field

This indicates spatial or translational invariance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does ‘invariance’ refer to in the context of vision?

A

The ability to recognize stimuli regardless of their position or orientation

This includes translational and rotational invariance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a ‘hypercomplex cell’?

A

A cell that fires for an edge of a specific length and orientation

It can detect complex shapes and patterns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What was the name of the first neural network-based image recognition system?

A

Cognitron

Developed by Kunihiko Fukushima in 1975.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How does the learning algorithm in the cognitron work?

A

It strengthens synaptic connections based on neuron firing activity

This is akin to Hebbian learning principles.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What limitation did the cognitron have?

A

It was not translation invariant

This means it recognized patterns differently based on their position in the visual field.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What architecture did the neocognitron adopt from Hubel and Wiesel’s work?

A

It includes S-cells and C-cells

These cells model simple and complex cells found in the visual cortex.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the role of S-cells in the neocognitron?

A

They respond to specific features like vertical edges

S-cells feed into C-cells to indicate edge presence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What does the term ‘grandmother cell’ refer to?

A

A hypothetical neuron that fires when seeing a specific complex stimulus

This concept is often used to illustrate the idea of specific neural responses to familiar faces.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What are C-cells in the context of the visual processing model?

A

C-cells respond to vertical edges in different patches of the visual field.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the role of S-cells in the visual processing architecture?

A

S-cells collate outputs from C-cells to detect edges anywhere in the visual field.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What does translation invariance refer to in the neocognitron?

A

The ability to detect patterns regardless of their position or distortions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Who developed the neocognitron and what was its significance?

A

Fukushima developed the neocognitron, which advanced pattern recognition in machines.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What was the main limitation of the neocognitron’s training algorithm?

A

It adjusted only the weights of the S-cells and was cumbersome.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What breakthrough did Yann LeCun achieve in the field of neural networks?

A

He developed the convolutional neural network (CNN) using the backpropagation algorithm.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

How did the meeting between LeCun and Fukushima impact the field?

A

Fukushima was surprised to learn that LeCun’s work was parallel to his own on the same topic.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Who were the key figures in the debate about cognitive capabilities, and what were their positions?

A

Piaget believed in learning during development; Chomsky argued for innate capabilities.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What was Papert’s stance during the Piaget-Chomsky debate?

A

Papert supported Piaget, arguing that Chomsky underestimated the role of learning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What is a perceptron?

A

An early type of artificial neural network used to analyze learning and cognition.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What was LeCun’s key realization regarding learning algorithms?

A

A learning algorithm should minimize an objective function.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What is the difference between a loss function and an objective function?

A

The objective function includes a regularizer to prevent overfitting, while the loss function does not.

30
Q

What is the purpose of a regularizer in machine learning?

A

To prevent overfitting by adding an extra term to the loss function.

31
Q

What was the focus of LeCun’s doctoral work?

A

Developing a learning algorithm related to backpropagation for multi-layer neural networks.

32
Q

What did LeCun present at a conference in 1985?

A

A poorly written paper in French on his learning algorithm.

33
Q

What significant software did LeCun and Bottou create?

A

SN, which later evolved into Lush, an early neural network simulation tool.

34
Q

What major dataset did LeCun work with at Bell Labs?

A

Images of handwritten digits from the U.S. Postal Service.

35
Q

What is the convolution operation in the context of image processing?

A

An operation that combines two functions to produce a third, particularly for feature extraction.

36
Q

What are kernels in convolutional neural networks?

A

Small matrices used to filter images and detect features such as edges.

37
Q

What is the effect of convolution on the size of an image?

A

Convolution typically reduces the size of the output image.

38
Q

What is the stride in convolution operations?

A

The number of pixels by which the kernel moves across the image during convolution.

39
Q

What is the formula for calculating the output image size after convolution?

A

((i - k) / s) + 1, where i is the input size, k is the kernel size, and s is the stride.

40
Q

True or False: The original image size remains the same after applying a convolution operation.

41
Q

What is the term used for the distance the kernel moves in a convolution operation?

A

Stride

The stride can affect the size of the output image.

42
Q

How is the output image size calculated in a convolution operation?

A

(( i - k ) / s ) + 1

Where i is the input image size, k is the kernel filter size, and s is the stride.

43
Q

What is padding in the context of convolutional networks?

A

Dummy pixels added around the input image

Padding can affect the output size and feature extraction.

44
Q

What is the output size when using a 3×3 kernel on a 28×28 image with a stride of 1?

A

26×26

This is calculated without padding.

45
Q

What represents the weighted sum of pixel values in a convolution operation?

A

The output of a neuron

The weights correspond to the kernel values.

46
Q

How many neurons are needed for a 5×5 image with a 2×2 kernel and a stride of 1?

A

16 neurons

This produces a 4×4 output image.

47
Q

Define receptive field in the context of convolutional neural networks.

A

The specific area of the image that a neuron responds to

Each neuron has its own region of interest.

48
Q

What are simple cells and complex cells in the hierarchy of visual processing?

A

Simple cells respond to simple features and complex cells respond to compositions of those features

This hierarchy was posited by Hubel and Wiesel.

49
Q

What is max pooling in convolutional neural networks?

A

An operation that outputs the largest pixel value from a region under the filter

Max pooling reduces image size and increases the receptive field.

50
Q

What is the effect of max pooling on the size of an image?

A

Reduces the size of the image

For example, a 4×4 image with a 2×2 filter and stride of 2 results in a 2×2 image.

51
Q

What is the purpose of backpropagation in training neural networks?

A

To calculate gradients and update weights

This process minimizes the error between expected and actual outputs.

52
Q

What is the role of activation functions in neural networks?

A

To introduce non-linearity and enable backpropagation

Activation functions must be differentiable.

53
Q

What are hyperparameters in the context of neural networks?

A

Parameters not learned during training that influence performance

Examples include the number of layers, kernel sizes, and activation functions.

54
Q

What was LeNet used for in the banking industry?

A

To read and recognize digits on checks

LeNet was one of the early successful applications of convolutional neural networks.

55
Q

What does the output layer of a neural network for digit recognition typically consist of?

A

10 neurons

Each neuron corresponds to a digit from 0 to 9.

56
Q

What happens during stochastic gradient descent?

A

A subset of images is used for each pass through the network

This is a method to optimize the learning process.

57
Q

What was a significant limitation of convolutional neural networks in the 1990s?

A

Lack of general-purpose software to build CNNs

AT&T did not allow the distribution of their software open source.

58
Q

What algorithms were outperforming CNNs for low-resolution images in the 1990s?

A

Conventional techniques

CNNs were still not widely adopted due to skepticism in the computer vision community.

59
Q

What hardware advancement in the 2000s significantly changed deep learning?

A

Graphical Processing Units (GPUs)

GPUs were originally designed for rendering 3D graphics but proved vital for deep learning tasks.

60
Q

What dataset did Jürgen Schmidhuber’s team use to train multi-layer perceptrons?

A

MNIST images

They achieved low error rates of 0.35 percent using deep learning techniques.

61
Q

What was the name of the first massive CNN built by Hinton’s lab?

A

AlexNet

AlexNet demonstrated that conventional methods for image recognition could not compete.

62
Q

What was the purpose of vector maps in Hinton and Mnih’s research?

A

To teach neural networks how to label pixels in aerial images

Vector maps provided clear information about road locations.

63
Q

What programming interface allowed GPUs to be used for general-purpose tasks?

A

CUDA

CUDA enabled engineers to perform tasks beyond just graphics acceleration.

64
Q

What breakthrough did Hinton’s lab achieve in speech recognition?

A

Using CUDAMat to program deep neural networks

This demonstrated the versatility of GPUs in different machine learning tasks.

65
Q

What was Sutskever’s view on the limitations of support vector machines (SVMs)?

A

The ceiling for SVMs is low compared to neural networks

He believed neural networks had a higher potential if trained correctly.

66
Q

What was the ImageNet challenge introduced in 2010?

A

Train a system to categorize 1.2 million images into 1,000 categories

It aimed to advance the field of computer vision.

67
Q

What was the top-5 error rate achieved by AlexNet in the ImageNet competition?

A

17 percent

This was significantly lower than the previous winners’ rates.

68
Q

What activation function did AlexNet use for its neurons?

A

Rectified Linear Unit (ReLU)

This was a departure from the traditional sigmoid function.

69
Q

What impact did deep neural networks have on various fields?

A

Revolutionized computer vision, natural language processing, machine translation, and more

Their applications are vast and continually growing.

70
Q

What did Mikhail Belkin compare the current state of machine learning to in physics?

A

The emergence of quantum mechanics

He suggested that deep neural networks are leading to new theories in machine learning.