ml intro Flashcards

pass exam

1
Q

The intersection of two convex sets is always convex.

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

The union of two convex sets is always convex.

A

FALSE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The function f(x)=7+3x1+999x2 is linear.

A

FALSE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

A logistic function determines the threshold according to which the output of a linear model is classified.

A

FALSE Because YOU decide the threshold.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

In order to predict y, x2 is more informative than x1.

A

FALSE It’s exactly the other way around: X1 is more informative. This is because:
I(x1|y)=1/4log(1/4/(1/4+1/2))+…+…+…=1/4+1/4+1/4+1/4=1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

The mutual information between x1 and y is 1.

A

TRUE Because we have exactly 2 0 and 2 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

The linear correlation coefficient between xi and yi may change if x values are centered by subtracting their mean value μ in the following manner: xi−μ

A

FALSE Centering does not affect the linear correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

The entropy of a uniform probability distribution of n events is log2(n).

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

The use of Chi-square test to deny statistical independence means that, for example, a term in a phrase should not be used as a feature because the square of the number of nearest neighbors is too large.

A

FALSE No relationship between Chi-square test and nearest neighbors.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

If the Pearson correlation coefficient between two data features is zero, then such features are independent.

A

FALSE Pearson correlation ahd Mutual information are not related. They measures two completely different things.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

If the Pearson correlation coefficient between two data features is zero, the Mutual Information between such features is also zero.

A

FALSE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

If the number of input variables is 99, and one starts training from 55 different examples, the parameters of the linear model obtaining zero error on the examples can always be determined.

A

TRUE

The number of input variables need to be equal or greater than the number of examples for interpolation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

The measurements of two phenomena are different in a statistically significant way if one can demonstrate in a theorem that the two measurements will never be equal.

A

FALSE

Because we only care of statistically significant results, not theorems.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

A result is statistically significant when it is obtained by democratic means, asking for the opinion of the largest possible number of experts.

A

FALSE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

For every k>0k>0, in leave-one-out cross-validation, one of the k partitions is left out as validation data and the other partitions are used as training data.

A

FALSE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Goodness functions take measurements as input. No

A

TRUE

17
Q

Measurements can be only numerical.

A

FALSE

18
Q

Standard mathematical optimization requires the
existence of goodness functions to be optimized,
and the de nition of these functions is usually
feasible in the real world.

A

FALSE

19
Q

Machine learning requires the de nition of a

goodness function to be optimized

A

FALSE

20
Q

Machine learning techniques can build good

models only if abundant data is available.

A

TRUE

21
Q

The goodness function to be optimized is Gold(x),
the quantity of gold extracted from a mine at
position x.

A

TRUE

22
Q

The measurements are the gold quantities

extracted in di􀃗erent points.

A

FALSE

23
Q

Kriging is an example of descriptive analytics. No

A

FALSE

24
Q

The model of the goodness function, built using
measurements, aims at generalizing the results
obtained during the experiments.

A

TRUE

25
Q

The output of the model of an unknown point is
the average of the known values of its neighbors,
weighted by the neighbors’ distance to the
unknown point.

A

TRUE

26
Q

Images are an example of structured data.

A

FALSE

27
Q

Text is an example of unstructured data.

A

TRUE

28
Q

Vectors are an example of structured data.

A

TRUE

29
Q

Comma Separated Values (CSV) are an example of structured data.

A

TRUE

30
Q

Predictive analytics goes further than descriptive

and prescriptive analytics.

A

FALSE

31
Q

Descriptive analytics involves the analysis of

historical data in order to provide useful insights.

A

TRUE

32
Q

Predictive analytics tries to anticipate the e􀃗ects of
decisions, by creating models based on historical
data

A

TRUE

33
Q

Prescriptive analytics uses optimization and
simulation algorithms to consider the e􀃗ect of
many possible decisions, in order to take the
decision that it is expected to optimize a certain
goodness function.

A

TRUE