Final Flashcards
Relationship between INCOME (in $) and CONTINENT of birth could be analyzed using an F-test
TRUE
Relationship between HEIGHT (in cm) of respondents and CONTINENT of birth could be analyzed using an Chi Squared test
False
Relationship between HEIGHT of respondents (in cm) and SEX could be observed using a Scatter Diagram
False
Relationship between HEIGHT of respondents (in cm) and WEIGHT (in Kg.) could be observed using a Box Plot
False
If we BIN two scale variables HEIGHT of respondents (in cm) and WEIGHT (in Kg) we would get ORDINAL versions that could be analyzed using a crosstab
True
t-Test is usually the right option to explore bivariate relationship if we have a scale variable and a categorical variable with more than two categories
False
ANOVA test is sometimes followed by a Post - Hoc Test (Bonferroni, LSD
True
The Null Hypothesis for an F- test is that the mean of a scale variable is the same across different categories of a categorical variable
True
ANOVA is one of the usual inferential tests that complements a SCATTER X/Y graph
False
If we get a p-value of 0.04 in a bivariate INDEPENDENCY test, that means that we have evidences of relationship with a 96% of maximum confidence
True
CLUSTER is an SUPERVISED CLASSIFICATION method
False
Cluster analysis is mainly used to aggregate of CASES, not FIELDS
True
Cluster is one of the main technical resources for PREDICTIVE ANALYTICS
False
Cluster is one of the main technical resources for PREDICTIVE ANALYTICS
True
The evaluation of a cluster solution is NOT MAINLY a technical assessment task
True
We normally STANDARDIZE metric and categorical variables to run a cluster analysis
False
We normally run a HIERARCHICAL cluster only if the number of cases / individuals is relatively small
True
We can use a HIERARCHICAL cluster combining metric and categorical variables
False
The AGGLOMERATION schedule is not a piece of interest in a TWO Step Cluster
True
The choice of the distance measure in hierarchical clusters depends, basically, on the type of variables (categorical, scale,…)
True
One of the advantages of Two-Step is its inherent ability to handle outliers
True
Normally, a good cluster solution is shaped with a LARGE number of clustering variables (not less than 15)
False
A field/variable can be very relevant to define/ distinguish a SPECIFIC CLUSTER, without being of great importance for the solution as a whole
True