Domande Flashcards by Noemi Ossola

According to the scientific visualization rules presented in class, is it possible to plot a graphical representation of the confidence level of one single figure of merit (like the accuracy) of your trained model?

No, the confidence intervals data have different units and meaning and hence can not be represented in the same plot
Yes, the confidence interval data have different units and meaning but they can be represented in the same plot using different visual attributes like “slope” and “area”
Yes, the confidence interval data have the same units and meaning and they can be represented in the same plot

Yes, the confidence interval data have the same units and meaning and they can be represented in the same plot

How well did you know this?

Not at all

Perfectly

The number of parameters to be fixed during a complete training in a deep learning model like the VGGNet presented in the course is about:

< 100000
> 100 Million
about 1 Million
about 10 Million

> 100 Million

How well did you know this?

Not at all

Perfectly

Considering the class discussing about the basic metrics in data similarity, given a vector A, vector B, a real number alpha, and the cosine metrics cos(A,B) it is possible to say that

alpha * cos(A,B) = cos(alpha*A, B)
cos(A,B) = cos(alphaA, alphaB)*
cos(A,B) = cos(alphaA, B) = cos(A, alphaB)
alpha * cos(A,B) = cos(alphaA, alphaB)

cos(A,B) = cos(alphaA, alphaB)

How well did you know this?

Not at all

Perfectly

Referring to the class discussion on data leakage what is the worst situation?

The unwanted leakage of data from training dataset to test data set
None of the other options since transferring data from test and/or training dataset is normal when the accuracy of the model is tested
The unwanted leakage of data from test dataset to training data set since you are subtracting data to the generalization test, making the situation more pessimistic
The unwanted leakage of data from test dataset to training data set since you are subtracting data to the generalization test, making the situation more optimistic

The unwanted leakage of data from test dataset to training data set since you are subtracting data to the generalization test, making the situation more optimistic

How well did you know this?

Not at all

Perfectly

What task of an intelligent vision system is associated to following description: split or separate an image into regions using features, patterns and colors to facilitate recognition, understanding, and Region Of Interests (ROI) processing and measurements.

Model training
Post processing
Enhancing
Segmentation
Feature engineering

Segmentation

How well did you know this?

Not at all

Perfectly

According to the class discussion, text prefiltering is often used as input for a neural network to deal with a large text input making the networks able to classifiy the input.

True, using the hamming distance as prefilering
True, using the cosine distance as prefilering
True, using the string approximate match distance as prefilering
True, using the discrete gradient descent as prefilering
True, using the so-called “word embeddings” technique

True, using the so-called “word embeddings” technique

How well did you know this?

Not at all

Perfectly

According to the class discussion, what is Greedy Layer-Wise Training?

A supervised training step to improve auto-encoders
An unsupervised training step to classical feedforward networks
An unsupervised training step to improve auto-encoders
A supervised training step to classical feedforward networks

An unsupervised training step to improve auto-encoders

How well did you know this?

Not at all

Perfectly

The following activity: a) Data Selection; b) Data Filtering; c) Data Enhancing …

Are part of the classical machine learning approaches and they are (correctly) used also in deep learning applications
All the other options are correct*
Are part of the job of the artificial intelligent specialist in normal activities
Contribute to keep lower the complexity of the learning task
a), b) and c) are extremely important in the final behavior of the trained model and the complexity of the training task

All the other options are correct

How well did you know this?

Not at all

Perfectly

The Inception-v3 deep learning pretrained model discussed during the course is a model for

Post processing
None of the other options
Segmentation
Image enhancing
Image classification*

Image classification

How well did you know this?

Not at all

Perfectly

Intelligent vision systems can achieve Semantic segmentation by

A hybrid approach by blob detection to select candidate ROIs and then image classification of the ROIs
A complete fully convolutional solution
A hybrid approach by blob detection to select candidate ROIs and then image segmentation of the ROIs
None of the other options

A complete fully convolutional solution

How well did you know this?

Not at all

Perfectly

Considering the possible Intelligent Vision tasks which is the correct option?

Instance Segmentation is more complex than Object Detection
Instance Segmentation is less complex than Object Detection
Instance Segmentation and Object Detection have a similar complexity
The other otpions are not Intelligent Vision tasks

Instance Segmentation is more complex than Object Detection

How well did you know this?

Not at all

Perfectly

In a given picture ImmA you see 1 car and 5 people in a city background. Considering the Intelligent systems IS processing the image ImmA and producing in output the label “humans”, what Intelligent Vision task is performing?

Image classification
Instance segmentation
Object detection
Semantic segmentation

Image classification

How well did you know this?

Not at all

Perfectly

According to the class discussion, considering the training of deep learning models on standard CPUs and standard commercial GPUs boards, what is the gain in training performance (time) and efficiency (energy) for a medium/large-size project?

About 100x in performance and 10x in efficiency
More than 100x in performance and more than 5x in efficiency
About 10x in performance and 5x in efficiency
About 2x in performance and 2x in efficiency

About 10x in performance and 5x in efficiency

How well did you know this?

Not at all

Perfectly

A basic industrial setup for Intelligent vision systems is typically composed by the following elements

Standard industrial smart camera with optics, external processing HW and SW units, illumination system
Standard industrial camera with optics, illumination system
Just a standard industrial camera with optics
Standard industrial camera with optics, processing HW and SW units, illumination system
Standard industrial camera with optics, processing HW and SW units

Standard industrial camera with optics, processing HW and SW units, illumination system

How well did you know this?

Not at all

Perfectly

In agreement to the class discussion, what kind of labelling error is generally the worst case for the accuracy of the generalization of the model? ERR1 = Duplications with same labels, EER2 = Duplications with different labels

ERR1 is equalt to EER2 by definition
ERR2 is the worst case
ERR1 is the worst case
ERR1 is roughly equalt to EER2 in general

ERR2 is the worst case

How well did you know this?

Not at all

Perfectly

According to the discussion presented in class about the data visualization, and considering the following steps of the design workflow 1) Get Data, 2) Clean Manipulate Data, 3) Train models, 4) Test Data, 5) Improve the design, which are the main step/steps where data visualization should be involved?

# 2 and #5
# 4
# 3 and #5
# 5
# 1

2 and #5

How well did you know this?

Not at all

Perfectly

According to the class discussion, the convolution/correlation operations are of foundamental relevance for many deep learning models. What is the characteristic of the autocorrelation map produced by a generic image?

It is not possible to create an autocorrelation map from one single images, two different images are needed
None of the other options
A flat and noisy central plateau
An evident spike at the center with a very well defined maximum

An evident spike at the center with a very well defined maximum

How well did you know this?

Not at all

Perfectly

Considering the class discussion about feature preprocessing/engineering, alogarithmic scaling to one feature values is typically applied in a case of

A very large range in the values (>0)
Input coming from the preprocessing of long texts
Negative values
Outliers presence
Feature values are integer numbers
Feedback

A very large range in the values (>0)

How well did you know this?

Not at all

Perfectly

According to the notation used in class, which kind of a model is described by the equation
f(x) = sgn(w x + b)

Liner regressor
Soft-max neuron
Liner classifier
Sigmoidal neuron
Gradient descent formula
Number of the model’s parameters

Liner classifier

How well did you know this?

Not at all

Perfectly

A tensor processing unit (TPU) is

A part of a model of the Convolutional Neural Network used to process dedicated tensorial activation functions in the neurons
An internal unit of the Arm processor architecture introduced to support 8-bit fixed-point matrix multiplication for deep learning models
An AI accelerator application-specific integrated circuit (ASIC) and the related board developed specifically for neural network machine learning
None of the other options

An AI accelerator application-specific integrated circuit (ASIC) and the related board developed specifically for neural network machine learning

How well did you know this?

Not at all

Perfectly

You have a feature in your dataset with the following values F2 = [ -13 0 1 2 4 128 ], which normalization will give you the following F2_norm = [0 0 1 2 4 10 ]

Z-score
Min-MAX
Clipping
A different type of normalization

Clipping

How well did you know this?

Not at all

Perfectly

You have a feature in your dataset with the following values F2 = [ -13 0 1 2 4 128 ], which normalization will give you the following F2_norm = [0 0 1 2 4 10 ]

Z-score
Min-MAX
Clipping
A different type of normalization

Clipping

How well did you know this?

Not at all

Perfectly

The design of intelligent systems for Industry 4.0 applications should be compliant to the following main design principles.

Interoperability, Information transparency, Improved technical assistance, Decentralized decisions
Interoperability, Information transparency, Improved technical assistance
Interoperability, Information transparency, Improved technical assistance, Wireless connectivity
Interoperability, Information transparency, Decentralized decisions

La risposta corretta è: Interoperability, Information transparency, Improved technical assistance, Decentralized decisions

How well did you know this?

Not at all

Perfectly

Machine Learning on CPUs offer the following advantages

Ease of portability and use-case flexibility, Market availability at different performance and prices
Ease of portability and use-case flexibility, Market availability at different performance and prices, Deployment across a wide spectrum of devices
Ease of portability and use-case flexibility, Deployment across a wide spectrum of devices
Market availability at different performance and prices, Deployment across a wide spectrum of devices

Ease of portability and use-case flexibility, Market availability at different performance and prices, Deployment across a wide spectrum of devices

How well did you know this?

Not at all

Perfectly

The GoogLeNet deep learning pretrained model discussed during the course is model for 1. Post processing 2. None of the other options 3. Image Enhancing 4. Image classification 5. Segmentation

Image classification

Considering IoT devices as source of data for external intelligent systems (IS is not intended to be embedded into the IoT device), what kind of IoT devices can be really used? 1. Passive data IoT devices 2. Active data IoT devices 3. Dynamic data IoT devices 4. All of the above 5. None of the above

All of the above

Referring to the class discussion, the (correct) design practice for neural networks considers 1. Start with deep learning models since they are the cutting edge and most advanced technology that we have now 2. Start with deep learning models since they are the cutting edge and most advanced technology we have now, and then use classicals method as reference 3. Start with simple neural networks before to consider deep learning models

Start with simple neural networks before to consider deep learning models

The missing values can also be occupied by computing mean, mode or median of the observed given values. 1. This is very unusual and not common in practice 2. This is a very simple and effective solution in case the learning method is not capable to deal with missing data 3. This is not possible, since that is just descriptive statistics about the features, and cannot be used to fill missing data

This is a very simple and effective solution in case the learning method is not capable to deal with missing data

Referring to the class discussion on data leakage what is the worst situation? 1. The unwanted leakage of data from test dataset to training data set 2. The unwanted leakage of data from training dataset to test data set 3. None of the above since transferring data from test and/or training dataset is normal when the accuracy of the model is tested

The unwanted leakage of data from test dataset to training data set

An additional information can allow the model to learn or know something that it otherwise would not know and in turn invalidate the estimated performance of the model being constructed. This is called: 1. Data leakage 2. Data pre-processing 3. Data harmonization 4. Data wrangling

Data leakage

The degrees of freedom for a given problem are the number of independent problem variables which must be specified to uniquely determine a solution. Hence the #DoF is important to be considered 1. To design the number of vectors in the learning dataset. 2. To avoid overfitting problem in the model 3. All the above 4. None of the above

All the above

About the cosine metrics it is possible to say that: 1. Two vectors with the same orientation have a cosine similarity of 1 2. Two vectors oriented at 90° relative to each other have a similarity of 0 3. All of the above 4. None of the above

All of the above

What similarity feature/features discussed in class offers/offer the property to allow a fast comparison based on a short 1D vector of elements or bits 1. phash 2. ahash 3. All the above 4. Cross-correlation

All the above

In agreement to the class discussion, which description better describes the design activity? 1. Similarity in the dataset requires more space and processing time 2. Similarity in the dataset can improve generalization 3. Both of the above 4. None of the above

Both of the above

In agreement to the class discussion, in a dataset of 1100 labelled images, the search for duplications is typically achieved... 1. by manual exploration of the dataset for better results since the number of images is not critical 2. by automatic iterations

by automatic iterations

ERR2

According to the class discussion, what is the characteristic of the self-correlation (𝑂 = 𝑥𝑐𝑜𝑟2(𝐴, 𝐴)) map produced by a generic image? 1. A flat and noisy central plateau 2. An evident spike at the center with a very well-defined maximum 3. It is not possible to create an autocorrelation map from one single images, two different images are needed

An evident spike at the center with a very well-defined maximum

According to the class discussion, about the relationship between the operation of cross-correlation and convolution it is possible to say that: 1. They are very similar in meaning and mathematical expression 2. Despite the mathematical expression is similar, the meaning and their use is completely different 3. There is no specific relationship since they are different in meaning and mathematical expressions

They are very similar in meaning and mathematical expression

If your data set contains extreme outliers, it better to use as preprocessing 1. Feature clipping 2. Min-max normalization 3. Z’ norm

Feature clipping

A logarithmic scaling to one feature values is typically applied in a case of 1. Outliers’ presence 2. Negative values 3. A very large range in the values (>0)

A very large range in the values (>0)

According to the scientific visualization rules presented in class, if you are plotting many figures of merit obtained by your trained neural network on a new dataset, which is the correct ranking of visual attributes to be used? Left: low accuracy Right: HIGH ACCURACY 1. Color intensity > Hue > Length 2. Area > Length > Hue 3. Slope > Angle > Volume 4. Hue > Area > Length

Hue > Area > Length

According to the scientific visualization rules presented in class, is it possible to plot a graphical representation of the confidence level of your figures of merit of your trained model? 1. No, it is a statistical index with different units and meaning and hence cannot be represented in the same plot 2. Yes, the confidence interval data have the same units and meaning, and they can be represented in the same plot

Yes, the confidence interval data have the same units and meaning, and they can be represented in the same plot

#2, #3 and #5

According to the discussion presented in class about the similarity, consider an image 𝐴(𝑥, 𝑦) with internal similarity (repetitions of patterns). What happens to the output of the self-cross correlation (𝑂 = 𝑥𝑐𝑜𝑟𝑟2(𝐴, 𝐴)) 1. It is not possible to apply the cross correlation to the same image 2. Output O tends to be a flat plateau with one clear central peak 3. Output O tends to have many peaks and one evident maximum 4. Output O tends to have many equivalent peaks with the same maximum value

Output O tends to have many peaks and one evident maximum

Nowadays, the usage of classical feature extraction and data analysis methods is outdated since the capability of the recent deep learning models and methods made them obsolete and not more present in the common practice. 1. True 2. False

False

Artificial Intelligence can be applied to the following sectors. 1. Robotics 2. Information Extraction 3. All the above

All the above

Artificial neural networks are capable to learn human biases. 1. False: the achievable complexity of the artificial neural networks is so far from the complexity of the human brain to make impossible to mimic this characteristic 2. False: human biases are not reproducible nor measurable 3. True

True

Recent artificial intelligence models can solve analogy puzzles. 1. True 2. False

True

Considering the “Data knowledge spectrum plot” discussed in class, the minimum amount of data required is in the following case. 1. No knowledge about the model generating the data is available 2. A statistical model of the process is available 3. A mathematical model of the process is available

A mathematical model of the process is available

It is possible to think to the single datum in input to the neural network as a point in the “input space” of the model, even if the input is a single value, a N dimensional vector, or an image. a. True b. False

True

It is correct to say the one of the key features of an intelligent artificial system is the capability to learn (even if only a limited sense) and/or get better in time. 1. True 2. False

True

According to the Andries Engelbrecht definition of Computational intelligence what of the following is not included? 1. Artificial Neural Networks 2. Evolutionary Computing 3. Swarm Intelligence 4. Artificial immune system e. Fuzzy Systems 5. All the above are included

All the above are included

According to the class discussion of the Gestalt capability, what of the following sentences is more correct? 1. The Gestalt capability is a typical feature present by-design in the model of classical neural networks 2. The Gestalt capability is a typical feature present by-design in the model of deep learning neural networks 3. The Gestalt capability is a typical human feature not well (yet) mimicked in current artificial networks

The Gestalt capability is a typical human feature not well (yet) mimicked in current artificial networks

The following activity: Data Selection, Data Filtering, Data Enhancing 1. Are part of the job of the artificial intelligent specialist in normal activities 2. Contribute to keep lower the complexity of the learning task 3. All the above 4. Are part of the classical machine learning approaches and they are (correctly) no longer used in deep learning applications

All the above

The Mean Squared Error is typically present in what step of the design. 1. Representation 2. Evaluation

Evaluation

All of the above

Start with simple neural networks before to consider deep learning models

This is a very simple and effective solution in case the learning method is not capable to deal with missing data

The unwanted leakage of data from test dataset to training data set

An additional information can allow the model to learn or know something that it otherwise would not know and in turn invalidate the estimated performance of the model being constructed. This is called a. Data leakage b. Data pre-processing c. Data harmonization d. Data wrangling

Data leakage

All the above

About the cosine metrics it is possible to say that 1. Two vectors with the same orientation have a cosine similarity of 1 2. Two vectors oriented at 90° relative to each other have a similarity of 0 3. All of the above 4. None of the above

All of the above

All the above

Both of the above

by automatic iterations

ERR2

They are very similar in meaning and mathematical expression

An evident spike at the center with a very well-defined maximum

If your data set contains extreme outliers, it better to use as preprocessing 1. Feature clipping 2. Min-max normalization 3. Z’ norm

Feature clipping

A logarithmic scaling to one feature values is typically applied in a case of 1. Outliers’ presence 2. Negative values 3. A very large range in the values (>0)

A very large range in the values (>0)

According to the scientific visualization rules presented in class, if you are plotting many figures of merit obtained by your trained neural network on a new dataset,which is the correct ranking of visual attributes to be used? Left: low accuracy Right: HIGH ACCURACY 1. Color intensity > Hue > Length 2. Area > Length > Hue 3. Slope > Angle > Volume 4. Hue > Area > Length

Hue > Area > Length

Yes, the confidence interval data have the same units and meaning, and they can be represented in the same plot

#2, #3 and #5

Output O tends to have many peaks and one evident maximum

You have a dataset X of 1000 samples and number of features F = 4 features. You want to reduce the number of features F to 2 for data visualization. According to the goal, consider the following options. OPTION A: Apply PCA to X and select only the first 2 Principal Components. OPTION B: Apply the Feedforward Feature Selection to X and select only the first 2 more relevant features. 1. Option A is possible. Option B is possible. 2. Option A is NOT possible. Option B is possible. 3. Option A is possible. Option B is NOT possible. 4. Option A is NOT possible. Option B is NOT possible.

Option A is possible. Option B is possible.

You have a feature in your dataset with the following values F1 = [-5, 0, +5], which normalization will give you the following F1_norm = [0, 0.5, 1] 1. Min-MAX 2. Z-score 3. Clipping 4. A different type of normalization

Min-MAX

According to the class discussion, in general for a given small dataset X, if you train a feed-forward neural models (of the same type) with an increasing number of neurons, which case is more probable? 1. None of the below 2. The training error and the validation will decrease indefinitely 3. The training error will increase 3. The validation error will decrease indefinitely

None of the below

According to the class discussion, in a cross-validation single test, which train/test partition of the samples will provide the lower training error but the lower confidence in the test results? 1. Training set = 99%, Test Set = 01% 2. Training set = 75%, Test Set = 25% 3. Training set = 50%, Test Set = 50% 4. Training set = 25%, Test Set = 75% 5. Training set = 01%, Test Set = 99%

Training set = 99%, Test Set = 01%

According to the class discussion, what kind of activity can be performed on the test set? 1. All the below 2. Mean test error estimation 3. Mean test error estimation and standard deviation 4. Confusion matrix test

All the below

According to the class discussion, what kind of activity can be performed on the train set? 1. All the other options 2. Design of the #of neurons 3. Design of the #of layers 4. Normalization 5. PCA

All the other options

According to the class discussion, where can be performed the feature engineering? 1. Only on the train set 2. Only on the test set 3. On the train set and the test set 4. Not on the train, not on the test set, but only on a different dataset.

Only on the train set

A simple k-Fold Cross Validation procedure may 1. Lead to disarranging the proportion of examples from each class in the test partitions 2. Making impossible to process the test error 3. Get stuck into one the local minima 4. Produce severe overfitting 5. None of the other answers

Lead to disarranging the proportion of examples from each class in the test partitions

Which option is correct? 1. From the confusion matrix is possible to process the classification error 2. From the confusion matrix is possible to process the classification error and vice versa 3. The confusion matrix is applicable only to binary classification systems 4. The classification error is equal to the sum of the diagonal elements of the confusion matrix

From the confusion matrix is possible to process the classification error

According to the notation used in class, which kind of a model is described by the equation 𝑓(𝑥) = 𝑠𝑔𝑛(𝑤 ∙ 𝑥 + 𝑏) 1. Liner classifier 2. Liner regressor 3. Soft-max neuron 4. Sigmoidal neuron 5. Gradient descent formula 6. Number of the model’s parameter

Liner classifier

According to the notation used in class, which kind of a classifier is better described by the following definition: “the output is the label produced by the most probable classifier” 1. Bayes Optimal Classifier 2. Supervised Classifier 3. K-means 4. None of the other options

Bayes Optimal Classifier

According to the class discussion the kNN classifier, what kind of learning is it? 1. Instance-based Learning 2. Eager Learning 3. Hard-limited Learning 4. Unsupervised Clustering 5. None of the other options

Instance-based Learning

According to the class discussion, what is the classifier with the following properties: not based on neural techniques; it’s deterministic with no random initialization; perfect repeatability; a minimum number of parameters is needed; learning is very simple but effective; perfect explain ability 1. kNN 2. Linear classifier 3. Decision Tree 4. K-means 5. None of the other options

kNN

According to the class discussion on kNN classifiers about the k parameter and its relationship to regularization of the decision boundaries and the computational complexity, what is the correct option about larger values of k? 1. More regularization and more complexity 2. Less regularization and more complexity 3. More regularization and less complexity 4. Less regularization and less complexity 5. The parameter k is not related to regularization and complexity

More regularization and more complexity

According to the class discussion on PCA what is the correct option? 1. PCA vectors are originating from the center of mass of the points 2. All subsequent principal component vectors are orthogonal 3. All the other options

All the other options

According to the class discussion on PCA what is the correct option? 1. All subsequent principal component vectors are orthogonal 2. The variance of the data projection on the first PCA vectors is maximized 3. All the other options

All the other options

According to the class discussion about unsupervised learning, what is the method with the following properties: you need to specify the number of clusters k in advance, is unable to handle noisy data and outliers, it is not suitable to discover clusters with non-convex shapes 1. K-means 2. kNN 3. Decision tree 4. None of the other options

K-means

According to the class discussion, considering the equation of the backpropagation in a feedforward neural network of weight 𝑤!" connected to the following output neuron 𝑘, which is the missing term? 𝐷𝐸𝐿𝑇𝐴𝑊 = ? ∗ 𝑦 ∗ 𝑑𝑒𝑙𝑡𝑎 1. ??? = alfa (the regularization term < 1) 2. ??? = alfa (the regularization term > 1) 3. ??? = x_j (the input vector) 4. ??? = x_j (the input vector error)

??? = alfa (the regularization term < 1)

According to the class discussion, considering a general CNN architecture, what is the sequence of modules which is more likely 1. Input layer → Convolution → Relu → Max Pooling → Softmax → Output layer 2. Input layer → Relu → Convolution → Max Pooling → Softmax → Output layer 3. Input layer → Relu → Max Pooling Convolution → Softmax → Output layer 4. Input layer → Relu → Max Pooling → Softmax → Convolution → Output layer

Input layer → Convolution → Relu → Max Pooling → Softmax → Output layer

According to the class discussion, considering a standard intelligent vision system, which capability can be processed onboard on a recent smart industrial camera? 1. Segmentation 2. Segmentation, Measurement 3. Segmentation, Measurement, Classification with trained non-deep models 4. Segmentation, Measurement, Classification with trained deep models 5. Segmentation, Measurement, Classification with trained deep models and training of deep models

Segmentation, Measurement, Classification with trained non-deep models

According to the class discussion, Traditional Segmentation methods are quite useful to produce blobs or object candidates to be further processed by deep models for classification or measurements. Traditional Segmentation methods can be partitioned in 1. Global knowledge, Edge-based 2. Edge-based, Region-based 3. Global knowledge, Edge-based, Region-based 4. None of the other options

Global knowledge, Edge-based, Region-based

According to the class discussion referred to edge computing, is it possible to process images with trained deep learning models on external small, dedicated devices connect via USB connection? 1. True: the usage of dedicated processors and the USB bandwidth make this option possible 2. False: the USB bandwidth make this option not possible 3. False: the needed computational complexity needed to run trained deep learning models make this option not possible 4. False: the bandwidth and the computational complexity need to process images with trained deep learning model is not adequate

True: the usage of dedicated processors and the USB bandwidth make this option possible

According to the class discussion what is Greedy Layer-Wise Training? 1. A supervised training step to improve auto-encoders 2. A supervised training step to classical feedforward networks 3. An unsupervised training step to classical feedforward networks 4. An unsupervised training step to improve auto-encoders

An unsupervised training step to improve auto-encoders

An AI model is processing an input RGB image to evaluate the age expressed in years of the face present in the image. What kind of model is it? 1. Classifier model 2. Regressor model 3. Clustering model 4. Reinforced Learning model 5. Non of the above

Regressor model

Recent artificial intelligence models can solve analogy puzzles like "Paris is to France as Tokyo is to?" producing the correct answer "Japan" 1. True 2. False

True

According to class discussion the theory of intelligent system should include the following designing steps 1. Representation 2. Representation, Evaluation 3. Representation, Evaluation, Optimization 4. None of the other option

Representation, Evaluation, Optimization

Cluster always requires a supervision dataset 1. Yes 2. No

According to the class discussion, using a black box solution is 1. Bad practice for a ML designer 2. Can be used under specific circumstances 3. Since all state of the art models tend to be quite large and un-explainable, it is current good practice to adopt the black box approach since you get the best models

Can be used under specific circumstances

According to the class discussion to the classification systems and their decision boundaries, it is possible in general to optimize during the training/opitimizations step 1. The accuracy 2. The margin 3. Both

Both

According to the class discussion about AI regulation in EU, the regulation approach is based 1. List of use cases 2. Risk assessment of the application 3. Both 4. None of the above

Both

According to the discussion presented in class the EU regulatory framework for AI is 1. Mainly focused on public services 2. Mainly focused on health-related applications 3. Mainly focused on data privacy 4. None of the other options [since it is quite larger]

None of the other options [since it is quite larger]

Considering the "Data knowledge spectrum plot discussed in class, which of the following cases is correct when just some parameters of the mathematical model must be tuned/fitted? 1. When the model of the process generating the data is is available and a huge quantity of data is required to traing the model 2. When the model of the process generating the data is is available and a limited quantity of data is required to fit the parameters 3. When no a-priori information is available and a huge quantity of data is required to traing the model 4. When no a-priori information is available and a limited quantity of data is required to traing the model

When the model of the process generating the data is is available and a limited quantity of data is required to fit the parameters

Numero di parametri vgg16

?? > 1 milione

Vantaggio della CPU nel ML

?? Tutti

Una domanda ad esercizio dove avevamo come risultato un vettore di valori e bisognava indicare quale strategia di normalizzazione è stata applicata, solo che i valori del vettore erano diversi da quelli della simulazione. Erano tipo (0,0,5,10). Io ho messo clipping.

Una domanda dove chiedeva se un tot di tecniche erano tecniche di intelligent visual system: lo erano tutte.

Domanda su cosa fossero vgg, inceptiov3 etc... Ho risposto image classifier