Domande Flashcards

1
Q

According to the scientific visualization rules presented in class, is it possible to plot a graphical representation of the confidence level of one single figure of merit (like the accuracy) of your trained model?

  1. No, the confidence intervals data have different units and meaning and hence can not be represented in the same plot
  2. Yes, the confidence interval data have different units and meaning but they can be represented in the same plot using different visual attributes like “slope” and “area”
  3. Yes, the confidence interval data have the same units and meaning and they can be represented in the same plot
A

Yes, the confidence interval data have the same units and meaning and they can be represented in the same plot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

The number of parameters to be fixed during a complete training in a deep learning model like the VGGNet presented in the course is about:

  1. < 100000
  2. > 100 Million
  3. about 1 Million
  4. about 10 Million
A

> 100 Million

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Considering the class discussing about the basic metrics in data similarity, given a vector A, vector B, a real number alpha, and the cosine metrics cos(A,B) it is possible to say that

  1. alpha * cos(A,B) = cos(alpha*A, B)
  2. cos(A,B) = cos(alphaA, alphaB)*
  3. cos(A,B) = cos(alphaA, B) = cos(A, alphaB)
  4. alpha * cos(A,B) = cos(alphaA, alphaB)
A

cos(A,B) = cos(alphaA, alphaB)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Referring to the class discussion on data leakage what is the worst situation?

  1. The unwanted leakage of data from training dataset to test data set
  2. None of the other options since transferring data from test and/or training dataset is normal when the accuracy of the model is tested
  3. The unwanted leakage of data from test dataset to training data set since you are subtracting data to the generalization test, making the situation more pessimistic
  4. The unwanted leakage of data from test dataset to training data set since you are subtracting data to the generalization test, making the situation more optimistic
A

The unwanted leakage of data from test dataset to training data set since you are subtracting data to the generalization test, making the situation more optimistic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What task of an intelligent vision system is associated to following description: split or separate an image into regions using features, patterns and colors to facilitate recognition, understanding, and Region Of Interests (ROI) processing and measurements.

  1. Model training
  2. Post processing
  3. Enhancing
  4. Segmentation
  5. Feature engineering
A

Segmentation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

According to the class discussion, text prefiltering is often used as input for a neural network to deal with a large text input making the networks able to classifiy the input.

  1. True, using the hamming distance as prefilering
  2. True, using the cosine distance as prefilering
  3. True, using the string approximate match distance as prefilering
  4. True, using the discrete gradient descent as prefilering
  5. True, using the so-called “word embeddings” technique
A

True, using the so-called “word embeddings” technique

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

According to the class discussion, what is Greedy Layer-Wise Training?

  1. A supervised training step to improve auto-encoders
  2. An unsupervised training step to classical feedforward networks
  3. An unsupervised training step to improve auto-encoders
  4. A supervised training step to classical feedforward networks
A

An unsupervised training step to improve auto-encoders

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

The following activity: a) Data Selection; b) Data Filtering; c) Data Enhancing …

  1. Are part of the classical machine learning approaches and they are (correctly) used also in deep learning applications
  2. All the other options are correct*
  3. Are part of the job of the artificial intelligent specialist in normal activities
  4. Contribute to keep lower the complexity of the learning task
    a), b) and c) are extremely important in the final behavior of the trained model and the complexity of the training task
A

All the other options are correct

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

The Inception-v3 deep learning pretrained model discussed during the course is a model for

  1. Post processing
  2. None of the other options
  3. Segmentation
  4. Image enhancing
  5. Image classification*
A

Image classification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Intelligent vision systems can achieve Semantic segmentation by

  1. A hybrid approach by blob detection to select candidate ROIs and then image classification of the ROIs
  2. A complete fully convolutional solution
  3. A hybrid approach by blob detection to select candidate ROIs and then image segmentation of the ROIs
  4. None of the other options
A

A complete fully convolutional solution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Considering the possible Intelligent Vision tasks which is the correct option?

  1. Instance Segmentation is more complex than Object Detection
  2. Instance Segmentation is less complex than Object Detection
  3. Instance Segmentation and Object Detection have a similar complexity
  4. The other otpions are not Intelligent Vision tasks
A

Instance Segmentation is more complex than Object Detection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

In a given picture ImmA you see 1 car and 5 people in a city background. Considering the Intelligent systems IS processing the image ImmA and producing in output the label “humans”, what Intelligent Vision task is performing?

  1. Image classification
  2. Instance segmentation
  3. Object detection
  4. Semantic segmentation
A

Image classification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

According to the class discussion, considering the training of deep learning models on standard CPUs and standard commercial GPUs boards, what is the gain in training performance (time) and efficiency (energy) for a medium/large-size project?

  1. About 100x in performance and 10x in efficiency
  2. More than 100x in performance and more than 5x in efficiency
  3. About 10x in performance and 5x in efficiency
  4. About 2x in performance and 2x in efficiency
A

About 10x in performance and 5x in efficiency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

A basic industrial setup for Intelligent vision systems is typically composed by the following elements

  1. Standard industrial smart camera with optics, external processing HW and SW units, illumination system
  2. Standard industrial camera with optics, illumination system
  3. Just a standard industrial camera with optics
  4. Standard industrial camera with optics, processing HW and SW units, illumination system
  5. Standard industrial camera with optics, processing HW and SW units
A

Standard industrial camera with optics, processing HW and SW units, illumination system

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

In agreement to the class discussion, what kind of labelling error is generally the worst case for the accuracy of the generalization of the model? ERR1 = Duplications with same labels, EER2 = Duplications with different labels

  1. ERR1 is equalt to EER2 by definition
  2. ERR2 is the worst case
  3. ERR1 is the worst case
  4. ERR1 is roughly equalt to EER2 in general
A

ERR2 is the worst case

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

According to the discussion presented in class about the data visualization, and considering the following steps of the design workflow 1) Get Data, 2) Clean Manipulate Data, 3) Train models, 4) Test Data, 5) Improve the design, which are the main step/steps where data visualization should be involved?

  1. # 2 and #5
  2. # 4
  3. # 3 and #5
  4. # 5
  5. # 1
A

2 and #5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

According to the class discussion, the convolution/correlation operations are of foundamental relevance for many deep learning models. What is the characteristic of the autocorrelation map produced by a generic image?

  1. It is not possible to create an autocorrelation map from one single images, two different images are needed
  2. None of the other options
  3. A flat and noisy central plateau
  4. An evident spike at the center with a very well defined maximum
A

An evident spike at the center with a very well defined maximum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Considering the class discussion about feature preprocessing/engineering, alogarithmic scaling to one feature values is typically applied in a case of

  1. A very large range in the values (>0)
  2. Input coming from the preprocessing of long texts
  3. Negative values
  4. Outliers presence
  5. Feature values are integer numbers
    Feedback
A

A very large range in the values (>0)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

According to the notation used in class, which kind of a model is described by the equation
f(x) = sgn(w x + b)

  1. Liner regressor
  2. Soft-max neuron
  3. Liner classifier
  4. Sigmoidal neuron
  5. Gradient descent formula
  6. Number of the model’s parameters
A

Liner classifier

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

A tensor processing unit (TPU) is

  1. A part of a model of the Convolutional Neural Network used to process dedicated tensorial activation functions in the neurons
  2. An internal unit of the Arm processor architecture introduced to support 8-bit fixed-point matrix multiplication for deep learning models
  3. An AI accelerator application-specific integrated circuit (ASIC) and the related board developed specifically for neural network machine learning
  4. None of the other options
A

An AI accelerator application-specific integrated circuit (ASIC) and the related board developed specifically for neural network machine learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

You have a feature in your dataset with the following values F2 = [ -13 0 1 2 4 128 ], which normalization will give you the following F2_norm = [0 0 1 2 4 10 ]

  1. Z-score
  2. Min-MAX
  3. Clipping
  4. A different type of normalization
A

Clipping

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

You have a feature in your dataset with the following values F2 = [ -13 0 1 2 4 128 ], which normalization will give you the following F2_norm = [0 0 1 2 4 10 ]

  1. Z-score
  2. Min-MAX
  3. Clipping
  4. A different type of normalization
A

Clipping

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

The design of intelligent systems for Industry 4.0 applications should be compliant to the following main design principles.

  1. Interoperability, Information transparency, Improved technical assistance, Decentralized decisions
  2. Interoperability, Information transparency, Improved technical assistance
  3. Interoperability, Information transparency, Improved technical assistance, Wireless connectivity
  4. Interoperability, Information transparency, Decentralized decisions
A

La risposta corretta è: Interoperability, Information transparency, Improved technical assistance, Decentralized decisions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Machine Learning on CPUs offer the following advantages

  1. Ease of portability and use-case flexibility, Market availability at different performance and prices
  2. Ease of portability and use-case flexibility, Market availability at different performance and prices, Deployment across a wide spectrum of devices
  3. Ease of portability and use-case flexibility, Deployment across a wide spectrum of devices
  4. Market availability at different performance and prices, Deployment across a wide spectrum of devices
A

Ease of portability and use-case flexibility, Market availability at different performance and prices, Deployment across a wide spectrum of devices

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

The GoogLeNet deep learning pretrained model discussed during the course is model for

  1. Post processing
  2. None of the other options
  3. Image Enhancing
  4. Image classification
  5. Segmentation
A

Image classification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Considering IoT devices as source of data for external intelligent systems (IS is not intended to be embedded into the IoT device), what kind of IoT devices can be really used?

  1. Passive data IoT devices
  2. Active data IoT devices
  3. Dynamic data IoT devices
  4. All of the above
  5. None of the above
A

All of the above

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Referring to the class discussion, the (correct) design practice for neural networks considers

  1. Start with deep learning models since they are the cutting edge and most advanced technology that we have now
  2. Start with deep learning models since they are the cutting edge and most advanced technology we have now, and then use classicals method as reference
  3. Start with simple neural networks before to consider deep learning models
A

Start with simple neural networks before to consider deep learning models

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

The missing values can also be occupied by computing mean, mode or median of the observed given values.

  1. This is very unusual and not common in practice
  2. This is a very simple and effective solution in case the learning method is not capable to deal with missing data
  3. This is not possible, since that is just descriptive statistics about the features, and cannot be used to fill missing data
A

This is a very simple and effective solution in case the learning method is not capable to deal with missing data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Referring to the class discussion on data leakage what is the worst situation?

  1. The unwanted leakage of data from test dataset to training data set
  2. The unwanted leakage of data from training dataset to test data set
  3. None of the above since transferring data from test and/or training dataset is normal when the accuracy of the model is tested
A

The unwanted leakage of data from test dataset to training data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

An additional information can allow the model to learn or know something that it otherwise would not know and in turn invalidate the estimated performance of the model being constructed. This is called:

  1. Data leakage
  2. Data pre-processing
  3. Data harmonization
  4. Data wrangling
A

Data leakage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

The degrees of freedom for a given problem are the number of independent problem variables which must be specified to uniquely determine a solution. Hence the #DoF is important to be considered

  1. To design the number of vectors in the learning dataset.
  2. To avoid overfitting problem in the model
  3. All the above
  4. None of the above
A

All the above

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

About the cosine metrics it is possible to say that:

  1. Two vectors with the same orientation have a cosine similarity of 1
  2. Two vectors oriented at 90° relative to each other have a similarity of 0
  3. All of the above
  4. None of the above
A

All of the above

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What similarity feature/features discussed in class offers/offer the property to allow a fast comparison based on a short 1D vector of elements or bits

  1. phash
  2. ahash
  3. All the above
  4. Cross-correlation
A

All the above

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

In agreement to the class discussion, which description better describes the design activity?

  1. Similarity in the dataset requires more space and processing time
  2. Similarity in the dataset can improve generalization
  3. Both of the above
  4. None of the above
A

Both of the above

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

In agreement to the class discussion, in a dataset of 1100 labelled images, the search for duplications is typically achieved…

  1. by manual exploration of the dataset for better results since the number of images is not critical
  2. by automatic iterations
A

by automatic iterations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

In agreement to the class discussion, what kind of labelling error is generally the worst case for the accuracy of the generalization of the model? ERR1 = Duplications with same labels EER2 = Duplications with different labels

  1. ERR1
  2. ERR2
  3. ERR1 = EE2
A

ERR2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

According to the class discussion, what is the characteristic of the self-correlation (𝑂 = 𝑥𝑐𝑜𝑟2(𝐴, 𝐴)) map produced by a generic image?

  1. A flat and noisy central plateau
  2. An evident spike at the center with a very well-defined maximum
  3. It is not possible to create an autocorrelation map from one single images, two different images are needed
A

An evident spike at the center with a very well-defined maximum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

According to the class discussion, about the relationship between the operation of cross-correlation and convolution it is possible to say that:

  1. They are very similar in meaning and mathematical expression
  2. Despite the mathematical expression is similar, the meaning and their use is completely different
  3. There is no specific relationship since they are different in meaning and mathematical expressions
A

They are very similar in meaning and mathematical expression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

If your data set contains extreme outliers, it better to use as preprocessing

  1. Feature clipping
  2. Min-max normalization
  3. Z’ norm
A

Feature clipping

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

A logarithmic scaling to one feature values is typically applied in a case of

  1. Outliers’ presence
  2. Negative values
  3. A very large range in the values (>0)
A

A very large range in the values (>0)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

According to the scientific visualization rules presented in class, if you are plotting many figures of merit obtained by your trained neural network on a new dataset, which is the correct ranking of visual attributes to be used? Left: low accuracy Right: HIGH ACCURACY

  1. Color intensity > Hue > Length
  2. Area > Length > Hue
  3. Slope > Angle > Volume
  4. Hue > Area > Length
A

Hue > Area > Length

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

According to the scientific visualization rules presented in class, is it possible to plot a graphical representation of the confidence level of your figures of merit of your trained model?

  1. No, it is a statistical index with different units and meaning and hence cannot be represented in the same plot
  2. Yes, the confidence interval data have the same units and meaning, and they can be represented in the same plot
A

Yes, the confidence interval data have the same units and meaning, and they can be represented in the same plot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

According to the discussion presented in class about the data visualization, and considering the following steps of the design workflow 1) Get Data, 2) Clean Manipulate Data, 3) Train models, 4) Test Data, 5) Improve the design, which are the main step/steps where data visualization should be involved?

  1. # 1
  2. # 5
  3. # 1 and #5
  4. # 2, #3 and #5
A

2, #3 and #5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

According to the discussion presented in class about the similarity, consider an image 𝐴(𝑥, 𝑦) with internal similarity (repetitions of patterns). What happens to the output of the self-cross correlation (𝑂 = 𝑥𝑐𝑜𝑟𝑟2(𝐴, 𝐴))

  1. It is not possible to apply the cross correlation to the same image
  2. Output O tends to be a flat plateau with one clear central peak
  3. Output O tends to have many peaks and one evident maximum
  4. Output O tends to have many equivalent peaks with the same maximum value
A

Output O tends to have many peaks and one evident maximum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

Nowadays, the usage of classical feature extraction and data analysis methods is outdated since the capability of the recent deep learning models and methods made them obsolete and not more present in the common practice.

  1. True
  2. False
A

False

46
Q

Artificial Intelligence can be applied to the following sectors.

  1. Robotics
  2. Information Extraction
  3. All the above
A

All the above

47
Q

Artificial neural networks are capable to learn human biases.

  1. False: the achievable complexity of the artificial neural networks is so far from the complexity of the human brain to make impossible to mimic this characteristic
  2. False: human biases are not reproducible nor measurable
  3. True
A

True

48
Q

Recent artificial intelligence models can solve analogy puzzles.

  1. True
  2. False
A

True

49
Q

Considering the “Data knowledge spectrum plot” discussed in class, the minimum amount of data required is in the following case.

  1. No knowledge about the model generating the data is available
  2. A statistical model of the process is available
  3. A mathematical model of the process is available
A

A mathematical model of the process is available

50
Q

It is possible to think to the single datum in input to the neural network as a point in the “input space” of the model, even if the input is a single value, a N dimensional vector, or an image.

a. True
b. False

A

True

51
Q

It is correct to say the one of the key features of an intelligent artificial system is the capability to learn (even if only a limited sense) and/or get better in time.

  1. True
  2. False
A

True

52
Q

According to the Andries Engelbrecht definition of Computational intelligence what of
the following is not included?

  1. Artificial Neural Networks
  2. Evolutionary Computing
  3. Swarm Intelligence
  4. Artificial immune system e. Fuzzy Systems
  5. All the above are included
A

All the above are included

53
Q

According to the class discussion of the Gestalt capability, what of the following sentences is more correct?

  1. The Gestalt capability is a typical feature present by-design in the model of classical neural networks
  2. The Gestalt capability is a typical feature present by-design in the model of deep learning neural networks
  3. The Gestalt capability is a typical human feature not well (yet) mimicked in current artificial networks
A

The Gestalt capability is a typical human feature not well (yet) mimicked in current artificial networks

54
Q

The following activity: Data Selection, Data Filtering, Data Enhancing

  1. Are part of the job of the artificial intelligent specialist in normal activities
  2. Contribute to keep lower the complexity of the learning task
  3. All the above
  4. Are part of the classical machine learning approaches and they are (correctly) no longer used in deep learning applications
A

All the above

55
Q

The Mean Squared Error is typically present in what step of the design.

  1. Representation
  2. Evaluation
A

Evaluation

56
Q

Considering IoT devices as source of data for external intelligent systems (IS is not intended to be embedded into the IoT device), what kind of IoT devices can be really used?

  1. Passive data IoT devices
  2. Active data IoT devices
  3. Dynamic data IoT devices
  4. All of the above
  5. None of the above
A

All of the above

57
Q

Referring to the class discussion, the (correct) design practice for neural networks considers

  1. Start with deep learning models since they are the cutting edge and most advanced technology that we have now
  2. Start with deep learning models since they are the cutting edge and most advanced technology we have now, and then use classicals method as reference
  3. Start with simple neural networks before to consider deep learning models
A

Start with simple neural networks before to consider deep learning models

58
Q

The missing values can also be occupied by computing mean, mode or median of the observed given values.

  1. This is very unusual and not common in practice
  2. This is a very simple and effective solution in case the learning method is not capable to deal with missing data
  3. This is not possible, since that is just descriptive statistics about the features, and cannot be used to fill missing data
A

This is a very simple and effective solution in case the learning method is not capable to deal with missing data

59
Q

Referring to the class discussion on data leakage what is the worst situation?

  1. The unwanted leakage of data from test dataset to training data set
  2. The unwanted leakage of data from training dataset to test data set
  3. None of the above since transferring data from test and/or training dataset is normal when the accuracy of the model is tested
A

The unwanted leakage of data from test dataset to training data set

60
Q

An additional information can allow the model to learn or know something that it otherwise would not know and in turn invalidate the estimated performance of the model being constructed. This is called

a. Data leakage
b. Data pre-processing
c. Data harmonization
d. Data wrangling

A

Data leakage

61
Q

The degrees of freedom for a given problem are the number of independent problem variables which must be specified to uniquely determine a solution. Hence the #DoF is important to be considered

  1. To design the number of vectors in the learning dataset.
  2. To avoid overfitting problem in the model
  3. All the above
  4. None of the above
A

All the above

62
Q

About the cosine metrics it is possible to say that

  1. Two vectors with the same orientation have a cosine similarity of 1
  2. Two vectors oriented at 90° relative to each other have a similarity of 0
  3. All of the above
  4. None of the above
A

All of the above

63
Q

What similarity feature/features discussed in class offers/offer the property to allow a fast comparison based on a short 1D vector of elements or bits

  1. phash
  2. ahash
  3. All the above
  4. Cross-correlation
A

All the above

64
Q

In agreement to the class discussion, which description better describes the design
activity?

  1. Similarity in the dataset requires more space and processing time
  2. Similarity in the dataset can improve generalization
  3. Both of the above
  4. None of the above
A

Both of the above

65
Q

In agreement to the class discussion, in a dataset of 1100 labelled images, the search for duplications is typically achieved…

  1. by manual exploration of the dataset for better results since the number of images is not critical
  2. by automatic iterations
A

by automatic iterations

66
Q

In agreement to the class discussion, what kind of labelling error is generally the worst case for the accuracy of the generalization of the model? ERR1 = Duplications with same labels EER2 = Duplications with different labels

  1. ERR1
  2. ERR2
  3. ERR1 = EE2
A

ERR2

67
Q

According to the class discussion, about the relationship between the operation of cross-correlation and convolution it is possible to say that:

  1. They are very similar in meaning and mathematical expression
  2. Despite the mathematical expression is similar, the meaning and their use is completely different
  3. There is no specific relationship since they are different in meaning and mathematical expressions
A

They are very similar in meaning and mathematical expression

68
Q

According to the class discussion, what is the characteristic of the self-correlation (𝑂 = 𝑥𝑐𝑜𝑟2(𝐴, 𝐴)) map produced by a generic image?

  1. A flat and noisy central plateau
  2. An evident spike at the center with a very well-defined maximum
  3. It is not possible to create an autocorrelation map from one single images, two different images are needed
A

An evident spike at the center with a very well-defined maximum

69
Q

If your data set contains extreme outliers, it better to use as preprocessing

  1. Feature clipping
  2. Min-max normalization
  3. Z’ norm
A

Feature clipping

70
Q

A logarithmic scaling to one feature values is typically applied in a case of

  1. Outliers’ presence
  2. Negative values
  3. A very large range in the values (>0)
A

A very large range in the values (>0)

71
Q

According to the scientific visualization rules presented in class, if you are plotting many figures of merit obtained by your trained neural network on a new dataset,which is the correct ranking of visual attributes to be used? Left: low accuracy Right: HIGH ACCURACY

  1. Color intensity > Hue > Length
  2. Area > Length > Hue
  3. Slope > Angle > Volume
  4. Hue > Area > Length
A

Hue > Area > Length

72
Q

According to the scientific visualization rules presented in class, is it possible to plot a graphical representation of the confidence level of your figures of merit of your trained model?

  1. No, it is a statistical index with different units and meaning and hence cannot be represented in the same plot
  2. Yes, the confidence interval data have the same units and meaning, and they can be represented in the same plot
A

Yes, the confidence interval data have the same units and meaning, and they can be represented in the same plot

73
Q

According to the discussion presented in class about the data visualization, and considering the following steps of the design workflow 1) Get Data, 2) Clean Manipulate Data, 3) Train models, 4) Test Data, 5) Improve the design, which are the main step/steps where data visualization should be involved?

  1. # 1
  2. # 5
  3. # 1 and #5
  4. # 2, #3 and #5
A

2, #3 and #5

74
Q

According to the discussion presented in class about the similarity, consider an image 𝐴(𝑥, 𝑦) with internal similarity (repetitions of patterns). What happens to the output of the self-cross correlation (𝑂 = 𝑥𝑐𝑜𝑟𝑟2(𝐴, 𝐴))

  1. It is not possible to apply the cross correlation to the same image
  2. Output O tends to be a flat plateau with one clear central peak
  3. Output O tends to have many peaks and one evident maximum
  4. Output O tends to have many equivalent peaks with the same maximum
    value
A

Output O tends to have many peaks and one evident maximum

75
Q

You have a dataset X of 1000 samples and number of features F = 4 features. You want to reduce the number of features F to 2 for data visualization. According to the goal, consider the following options. OPTION A: Apply PCA to X and select only the first 2 Principal Components. OPTION B: Apply the Feedforward Feature Selection to X and select only the first 2 more relevant features.

  1. Option A is possible. Option B is possible.
  2. Option A is NOT possible. Option B is possible.
  3. Option A is possible. Option B is NOT possible.
  4. Option A is NOT possible. Option B is NOT possible.
A

Option A is possible. Option B is possible.

76
Q

You have a feature in your dataset with the following values F1 = [-5, 0, +5], which normalization will give you the following F1_norm = [0, 0.5, 1]

  1. Min-MAX
  2. Z-score
  3. Clipping
  4. A different type of normalization
A

Min-MAX

77
Q

According to the class discussion, in general for a given small dataset X, if you train a feed-forward neural models (of the same type) with an increasing number of neurons, which case is more probable?

  1. None of the below
  2. The training error and the validation will decrease indefinitely
  3. The training error will increase
  4. The validation error will decrease indefinitely
A

None of the below

78
Q

According to the class discussion, in a cross-validation single test, which train/test partition of the samples will provide the lower training error but the lower confidence in the test results?

  1. Training set = 99%, Test Set = 01%
  2. Training set = 75%, Test Set = 25%
  3. Training set = 50%, Test Set = 50%
  4. Training set = 25%, Test Set = 75%
  5. Training set = 01%, Test Set = 99%
A

Training set = 99%, Test Set = 01%

79
Q

According to the class discussion, what kind of activity can be performed on the test set?

  1. All the below
  2. Mean test error estimation
  3. Mean test error estimation and standard deviation
  4. Confusion matrix test
A

All the below

80
Q

According to the class discussion, what kind of activity can be performed on the train set?

  1. All the other options
  2. Design of the #of neurons
  3. Design of the #of layers
  4. Normalization
  5. PCA
A

All the other options

81
Q

According to the class discussion, where can be performed the feature engineering?

  1. Only on the train set
  2. Only on the test set
  3. On the train set and the test set
  4. Not on the train, not on the test set, but only on a different dataset.
A

Only on the train set

82
Q

A simple k-Fold Cross Validation procedure may

  1. Lead to disarranging the proportion of examples from each class in the test partitions
  2. Making impossible to process the test error
  3. Get stuck into one the local minima
  4. Produce severe overfitting
  5. None of the other answers
A

Lead to disarranging the proportion of examples from each class in the test partitions

83
Q

Which option is correct?

  1. From the confusion matrix is possible to process the classification error
  2. From the confusion matrix is possible to process the classification error and vice versa
  3. The confusion matrix is applicable only to binary classification systems
  4. The classification error is equal to the sum of the diagonal elements of the confusion matrix
A

From the confusion matrix is possible to process the classification error

84
Q

According to the notation used in class, which kind of a model is described by the
equation 𝑓(𝑥) = 𝑠𝑔𝑛(𝑤 ∙ 𝑥 + 𝑏)

  1. Liner classifier
  2. Liner regressor
  3. Soft-max neuron
  4. Sigmoidal neuron
  5. Gradient descent formula
  6. Number of the model’s parameter
A

Liner classifier

85
Q

According to the notation used in class, which kind of a classifier is better described by the following definition: “the output is the label produced by the most probable classifier”

  1. Bayes Optimal Classifier
  2. Supervised Classifier
  3. K-means
  4. None of the other options
A

Bayes Optimal Classifier

86
Q

According to the class discussion the kNN classifier, what kind of learning is it?

  1. Instance-based Learning
  2. Eager Learning
  3. Hard-limited Learning
  4. Unsupervised Clustering
  5. None of the other options
A

Instance-based Learning

87
Q

According to the class discussion, what is the classifier with the following properties: not based on neural techniques; it’s deterministic with no random initialization; perfect repeatability; a minimum number of parameters is needed; learning is very simple but effective; perfect explain ability

  1. kNN
  2. Linear classifier
  3. Decision Tree
  4. K-means
  5. None of the other options
A

kNN

88
Q

According to the class discussion on kNN classifiers about the k parameter and its relationship to regularization of the decision boundaries and the computational complexity, what is the correct option about larger values of k?

  1. More regularization and more complexity
  2. Less regularization and more complexity
  3. More regularization and less complexity
  4. Less regularization and less complexity
  5. The parameter k is not related to regularization and complexity
A

More regularization and more complexity

89
Q

According to the class discussion on PCA what is the correct option?

  1. PCA vectors are originating from the center of mass of the points
  2. All subsequent principal component vectors are orthogonal
  3. All the other options
A

All the other options

90
Q

According to the class discussion on PCA what is the correct option?

  1. All subsequent principal component vectors are orthogonal
  2. The variance of the data projection on the first PCA vectors is maximized
  3. All the other options
A

All the other options

91
Q

According to the class discussion about unsupervised learning, what is the method with the following properties: you need to specify the number of clusters k in advance, is unable to handle noisy data and outliers, it is not suitable to discover clusters with non-convex shapes

  1. K-means
  2. kNN
  3. Decision tree
  4. None of the other options
A

K-means

92
Q

According to the class discussion, considering the equation of the backpropagation in
a feedforward neural network of weight 𝑤!” connected to the following output
neuron 𝑘, which is the missing term? 𝐷𝐸𝐿𝑇𝐴𝑊 = ? ∗ 𝑦 ∗ 𝑑𝑒𝑙𝑡𝑎

  1. ??? = alfa (the regularization term < 1)
  2. ??? = alfa (the regularization term > 1)
  3. ??? = x_j (the input vector)
  4. ??? = x_j (the input vector error)
A

??? = alfa (the regularization term < 1)

93
Q

According to the class discussion, considering a general CNN architecture, what is the sequence of modules which is more likely

  1. Input layer → Convolution → Relu → Max Pooling → Softmax → Output layer
  2. Input layer → Relu → Convolution → Max Pooling → Softmax → Output layer
  3. Input layer → Relu → Max Pooling Convolution → Softmax → Output layer
  4. Input layer → Relu → Max Pooling → Softmax → Convolution → Output layer
A

Input layer → Convolution → Relu → Max Pooling → Softmax → Output layer

94
Q

According to the class discussion, considering a standard intelligent vision system, which capability can be processed onboard on a recent smart industrial camera?

  1. Segmentation
  2. Segmentation, Measurement
  3. Segmentation, Measurement, Classification with trained non-deep models
  4. Segmentation, Measurement, Classification with trained deep models
  5. Segmentation, Measurement, Classification with trained deep models and training of deep models
A

Segmentation, Measurement, Classification with trained non-deep models

95
Q

According to the class discussion, Traditional Segmentation methods are quite useful to produce blobs or object candidates to be further processed by deep models for classification or measurements. Traditional Segmentation methods can be partitioned in

  1. Global knowledge, Edge-based
  2. Edge-based, Region-based
  3. Global knowledge, Edge-based, Region-based
  4. None of the other options
A

Global knowledge, Edge-based, Region-based

96
Q

According to the class discussion referred to edge computing, is it possible to process images with trained deep learning models on external small, dedicated devices connect via USB connection?

  1. True: the usage of dedicated processors and the USB bandwidth make this option possible
  2. False: the USB bandwidth make this option not possible
  3. False: the needed computational complexity needed to run trained deep learning models make this option not possible
  4. False: the bandwidth and the computational complexity need to process images with trained deep learning model is not adequate
A

True: the usage of dedicated processors and the USB bandwidth make this option possible

97
Q

According to the class discussion what is Greedy Layer-Wise Training?

  1. A supervised training step to improve auto-encoders
  2. A supervised training step to classical feedforward networks
  3. An unsupervised training step to classical feedforward networks
  4. An unsupervised training step to improve auto-encoders
A

An unsupervised training step to improve auto-encoders

98
Q

An AI model is processing an input RGB image to evaluate the age expressed in years of the face present in the image. What kind of model is it?

  1. Classifier model
  2. Regressor model
  3. Clustering model
  4. Reinforced Learning model
  5. Non of the above
A

Regressor model

99
Q

Recent artificial intelligence models can solve analogy puzzles like “Paris is to France as Tokyo is to?” producing the correct answer “Japan”

  1. True
  2. False
A

True

100
Q

According to class discussion the theory of intelligent system should include the following designing steps

  1. Representation
  2. Representation, Evaluation
  3. Representation, Evaluation, Optimization
  4. None of the other option
A

Representation, Evaluation, Optimization

101
Q

Cluster always requires a supervision dataset

  1. Yes
  2. No
A

No

102
Q

According to the class discussion, using a black box solution is

  1. Bad practice for a ML designer
  2. Can be used under specific circumstances
  3. Since all state of the art models tend to be quite large and un-explainable, it is current good practice to adopt the black box approach since you get the best models
A

Can be used under specific circumstances

103
Q

According to the class discussion to the classification systems and their decision boundaries, it is possible in general to optimize during the training/opitimizations step

  1. The accuracy
  2. The margin
  3. Both
A

Both

104
Q

According to the class discussion about AI regulation in EU, the regulation approach is based

  1. List of use cases
  2. Risk assessment of the application
  3. Both
  4. None of the above
A

Both

105
Q

According to the discussion presented in class the EU regulatory framework for AI is

  1. Mainly focused on public services
  2. Mainly focused on health-related applications
  3. Mainly focused on data privacy
  4. None of the other options [since it is quite larger]
A

None of the other options [since it is quite larger]

106
Q

Considering the “Data knowledge spectrum plot discussed in class, which of the following cases is correct when just some parameters of the mathematical model must be tuned/fitted?

  1. When the model of the process generating the data is is available and a huge quantity of data is required to traing the model
  2. When the model of the process generating the data is is available and a limited quantity of data is required to fit the parameters
  3. When no a-priori information is available and a huge quantity of data is required to traing the model
  4. When no a-priori information is available and a limited quantity of data is required to traing the model
A

When the model of the process generating the data is is available and a limited quantity of data is required to fit the parameters

107
Q

Numero di parametri vgg16

A

?? > 1 milione

108
Q

Vantaggio della CPU nel ML

A

?? Tutti

109
Q

Una domanda ad esercizio dove avevamo come risultato un vettore di valori e bisognava indicare quale strategia di normalizzazione è stata applicata, solo che i valori del vettore erano diversi da quelli della simulazione. Erano tipo (0,0,5,10). Io ho messo clipping.

A

??

110
Q

Una domanda dove chiedeva se un tot di tecniche erano tecniche di intelligent visual system: lo erano tutte.

A

??

111
Q

Domanda su cosa fossero vgg, inceptiov3 etc… Ho risposto image classifier

A

??