2 - We Are All Just Numbers Here... Flashcards

Question 1

Q

Who was William Rowan Hamilton?

Answer

A

An Irish mathematician known for his work on quaternions.

Question 2

Q

What significant event happened on October 16, 1843?

Answer

A

Hamilton had a flash of inspiration for the quaternion formula while walking along the Royal Canal.

Question 3

Q

What is the fundamental formula for quaternion multiplication?

Answer

A

i² = j² = k² = ijk = -1.

Question 4

Q

What did Hamilton etch on the stone of Brougham Bridge?

Answer

A

The fundamental formula for quaternion multiplication.

Question 5

Q

Define a scalar quantity.

Answer

A

A stand-alone number that represents magnitude only.

Question 6

Q

Define a vector.

Answer

A

A quantity that has both magnitude and direction.

Question 7

Q

What are the components of a vector?

Answer

A

The x-component and y-component.

Question 8

Q

How can the magnitude of a vector be calculated?

Answer

A

Using the Pythagorean theorem: √(x² + y²).

Question 9

Q

What does Newton’s Second Law of Motion state?

Answer

A

Acceleration is proportional to the force acting on an object and they have the same direction.

Question 10

Q

What geometrical shape is used to represent vector addition?

Answer

A

A parallelogram.

Question 11

Q

What is the resultant vector in the example of a man walking from (0,0) to (6,9)?

Answer

A

The vector from (0,0) to (6,9).

Question 12

Q

What is the net distance in the xy coordinate space from the origin to (6,9)?

Answer

A

10.82 miles.

Question 13

Q

What happens when you subtract vectors?

Answer

A

It indicates if one force is acting against another.

Question 14

Q

What is the effect of multiplying a vector by a scalar?

Answer

A

It scales the vector’s magnitude.

Question 15

Q

Define a unit vector.

Answer

A

A vector with a magnitude of 1.

Question 16

Q

What is the dot product of two vectors?

Answer

A

The magnitude of one vector multiplied by the projection of another onto it.

Question 17

Q

What does a dot product of zero indicate?

Answer

A

The two vectors are orthogonal (at right angles).

Question 18

Q

How is the dot product calculated using vector components?

Answer

A

a.b = a1b1 + a2b2.

Question 19

Q

What is the significance of Hamilton’s work on quaternions for machine learning?

Answer

A

It laid foundational mathematical concepts important for vector analysis.

Question 20

Q

Fill in the blank: A ______ is a mathematical entity composed of four elements.

Answer

A

quaternion.

Question 21

Q

True or False: The magnitude of a vector can be negative.

Question 22

Q

What does the projection of one vector onto another represent?

Answer

A

The ‘shadow cast’ by one vector onto another.

Question 23

Q

What is the equation for the scalar quantity when dealing with vectors a and b?

Answer

A

a.b = a 1 b 1 + a 2 b 2

Question 24

Q

What do the vectors i and j represent in the context of dot products?

Answer

A

Orthogonal vectors, where i.j and j.i are zero, and both i.i and j.j equal 1

Question 25

Q

What does a perceptron output if the weighted sum of its inputs plus the bias term is greater than 0?

Question 26

Q

What is the output of a perceptron if the weighted sum is less than or equal to 0?

Question 27

Q

In the perceptron model, how can the weights be represented?

Answer

A

As a vector w = (w1, w2)

Question 28

Q

What geometrical concept does the perceptron use to separate data points into clusters?

Answer

A

A linearly separating hyperplane

Question 29

Q

What is the relationship between the weight vector w and the separating hyperplane?

Answer

A

The vector w is orthogonal to the hyperplane

Question 30

Q

What does the dot product of a data point vector and the weight vector indicate?

Answer

A

The distance of the data point from the hyperplane

Question 31

Q

What happens when a data point lies on the hyperplane?

Answer

A

The dot product with the weight vector equals zero

Question 32

Q

What is the significance of the bias term in a perceptron?

Answer

A

It moves the hyperplane away from the origin without changing its orientation

Question 33

Q

Fill in the blank: The perceptron learning algorithm guarantees to find one separating hyperplane, but not necessarily the _____ one.

Question 34

Q

What is the mathematical representation of a one-column matrix with two elements?

Answer

A

A column matrix indexed by numbers 1 and 2

Question 35

Q

What is the process of flipping a column matrix on its side called?

Answer

A

Taking the transpose of a matrix

Question 36

Q

What is the notation for the transpose of matrix A?

Question 37

Q

In the context of matrices, what is a vector?

Answer

A

A particular form of matrix with either one row or one column

Question 38

Q

What is the relationship between the number of columns in the first matrix and the number of rows in the second for taking a dot product?

Answer

A

They must be equal

Question 39

Q

How can the weighted sum of inputs in a perceptron be concisely written?

Answer

A

As the dot product w T x

Question 40

Q

What does the perceptron learn from a set of input data vectors?

Answer

A

The weight vector that represents a hyperplane separating the data into two clusters

Question 41

Q

What is the significance of the hyperplane in the context of classification?

Answer

A

It determines the classification of new data points based on their position relative to it

Question 42

Q

True or False: The perceptron can classify data points as ‘obese’ or ‘not-obese’ based on their position relative to the hyperplane.

Question 43

Q

What is the role of modern deep neural networks in relation to the perceptron?

Answer

A

They build upon the foundational concepts established by the perceptron

Question 44

Q

What is a perceptron learning algorithm?

Answer

A

A computationally viable algorithm for binary classification that involves finding a hyperplane to separate data into two groups.

Question 45

Q

What defines a ‘solution’ in the context of perceptrons?

Answer

A

A hyperplane that linearly separates the data into two groups.

Question 46

Q

Who developed a significant proof regarding the perceptron learning algorithm in 1962?

Answer

A

Henry David Block.

Question 47

Q

What did Block’s proof establish?

Answer

A

Upper bounds for the number of mistakes made by the perceptron learning algorithm.

Question 48

Q

What is the focus of Minsky and Papert’s book ‘Perceptrons’?

Answer

A

A class of computations that make decisions by weighing evidence.

Question 49

Q

What was a notable criticism made by Block in his review of ‘Perceptrons’?

Answer

A

He objected to Minsky and Papert’s implication that cyberneticists should have known about earlier convergence proofs.

Question 50

Q

What is the significance of the term ‘cybernetics’?

Answer

A

The study of control and communication in the animal and the machine.

Question 51

Q

What are the six variables used to categorize patients in the discussed pandemic scenario?

Answer

A

x1 = age
x2 = body mass index
x3 = has difficulty breathing (yes = 1/no = 0)
x4 = has fever (yes/no)
x5 = has diabetes (yes/no)
x6 = chest CT scan (0 = clear, 1 = mild infection, 2 = severe infection)

Question 52

Q

What does the outcome ‘y’ represent for each patient?

Answer

A

y = -1 (did not need ventilator support) or y = 1 (needed ventilator support).

Question 53

Q

What is the goal of training a perceptron in this context?

Answer

A

To find a separating hyperplane for the data points.

Question 54

Q

What is the first step in the perceptron training algorithm?

Answer

A

Initialize the weight vector to zero: set w = 0.

Question 55

Q

What condition necessitates updating the weight vector in the perceptron algorithm?

Answer

A

If y w^T x ≤ 0.

Question 56

Q

How does the perceptron determine if the weights are correct?

Answer

A

If the expression y w^T x is positive.

Question 57

Q

What does the convergence proof by Minsky and Papert establish?

Answer

A

The perceptron will converge to a solution in a finite number of steps if one exists.

Question 58

Q

What is the significance of the dot product of weight vectors during training?

Answer

A

It indicates how closely the weight vector aligns with the desired weight vector.

Question 59

Q

What does the term ‘XOR problem’ refer to in perceptrons?

Answer

A

A problem that cannot be solved by a single layer of perceptrons, as it cannot be linearly separated.

Question 60

Q

What is the relationship between lower and upper bounds in computational complexity?

Answer

A

Lower bounds indicate what is impossible, while upper bounds measure resource limits for solutions.

Question 61

Q

What does the weight vector w represent in the perceptron model?

Answer

A

The parameters that define the hyperplane separating the data.

Question 62

Q

Fill in the blank: The perceptron learning algorithm updates the weight vector by adding _______.

Question 63

Q

True or False: The perceptron algorithm guarantees a solution for all types of data.

Question 64

Q

What major assumption is made about the data in the context of perceptrons?

Answer

A

The data are linearly separable.

Answer 57

A

A simple type of artificial neuron used in machine learning

Answer 58

A

The XOR problem

Answer 59

A

(0, 0) * (1, 0) * (1, 1) * (0, 1)

Answer 60

A

Perceptrons stacked such that the output of one feeds into the input of another

Answer 61

A

Backpropagation

Answer 62

A

Calculus and optimization theory

Answer 63

A

A physicist’s unique solution to a biological problem re-energized the field

Answer 64

A

To find a linearly separating hyperplane

Answer 65

A

Initialize the weight vector to zero: set w = 0

Answer 66

A

If y w^T x ≤ 0

Answer 67

A

w_new = w_old + y x

Answer 68

A

The distance between the linear separating hyperplane and the closest data point

Answer 69

A

It grows by at least γ

Answer 70

A

It grows by at most 1

Answer 71

A

M is always a finite quantity

Answer 72

A

1 over γ²

Answer 73

A

A hyperplane that separates different classes of data points

Answer 74

A

It cannot solve problems like XOR that are not linearly separable

Answer 75

A

It ensures all input data points have magnitudes less than or equal to 1