Module 1 Flashcards

1
Q

What is Statistical Inference?

A

It is the statistical concept allowing us to generalize a result obtained on a sample onto an entire population?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are Descriptive Statistics?

A

They are methods used to summarize the information contained in a sample using numerical indicators or graphs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the 2 broad categories of variables?

A

Qualitative variables and Quantitative variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What do Qualitative measures report on?

A

Information of qualitative nature.

For example: the type of device used to consult a website.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What do Quantitative measures report on?

A

Those which are evaluated on a numerical scale.

For example, the time during which a user looks at a specific element is measured in seconds.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the 2 sub-groups of Qual Variables?

A

Ordinal and Nominal variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are Ordinal Variables?

A

They correspond to situations where there exists a natural ordering of categories.

For example: the level of familiarity with the tool:
beginner, intermediate, or advanced.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a Nominal Variable?

A

A nominal variable is a variable whose values are simply labels used to name each of the categories.
There is no notion of order.

For Example: the type of device used to visit a website - the information codification one for computer, two for smartphone, and three for tablet - is completely arbitrary.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are discrete variables?

A

Discrete variables take values that can be distinctly listed. They are often used to count things.

For instance, the number of clicks made on a website’s button in a day, which can take values zero, one, two, three, etc..

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the 2 types of Quant Variables?

A

Discrete Variables and Continuous Variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are Continous Variables?

A

Continuous variables take values in an interval.

For example, the completion time of task by user can take any value between zero and, potentially, an infinite number of minutes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are Indicator Variables?

A

They are binary variables, meaning they can take only two possible values which are either coded zero or one.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Why is the zero-one value used?

A

It facilitates certain statistical analysis, namely when interpreting the results obtained.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the 3 popular central tendencies?

A

Mean, Median and Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How is the Mean described?

A

The sum of the observations over the number of observations, is known to all.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How is the median calculated?

A

Calculating the median involves ordering the numbers in ascending order.

The median is simply the observation in the middle.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How is an even Median Calculated?

A

If the number of observations is even, then the median is defined as the mean of two observations in the middle.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

How is the mode described?

A

The mode is the most frequent observation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

The mean is much more sensitive to extreme values than the median?
True or False?

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Different measures of central tendency do not capture exactly the same information and have different properties.
True or False?

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What type of measure is Standard deviation?

A

Standard deviation is a widely used measure of dispersion.

22
Q

How do we measure dispersion?

A

We can measure the dispersion of the data by their deviations from the mean value.

23
Q

What are 2 types of measures of dispersion?

A

Standard Deviation and Range.

24
Q

How is the range dispersion measured?

A

This is the difference between the maximum and the minimum.

25
Q

How is Standard Deviation measured?

A

The standard deviation is the average deviations from the mean

26
Q

What variables are adequet for a histogram?

A

continuous variables and discrete variables

27
Q

What are some Graphical Representations?

A

Pie Chart, Histogram, Bar Graph, temporal graph

28
Q

What does correlation correspond to?

A

correlation corresponds to an association between two variables

29
Q

How do we address the question of correlation?

A

A graphical tool, such as a scatterplot, and a numerical measure, like the Pearson correlation coefficient.

30
Q

What does an increasing line represent?

A

A positive association, a positive linear
relationship

31
Q

What does the correlation coefficient measure?

A

The direction of a relationship and its strength.

32
Q

How is the correlation coefficient defined?

A

This coefficient is defined on a scale ranging from minus one to one.

33
Q

What does a decreasing line represent?

A

A negative correlation coefficient , the line
which best describes the relation is decreasing or has a negative slope.

34
Q

How is the degree of association defined?

A

On a scale of minus one - one

35
Q

What does a correlation near the center indicate?

A

A correlation near the center, therefore, zero
indicates a weak relationship. There is no linear relationship

36
Q

What does a correlation of minus one indicate?

A

A correlation of -0.8 corresponds to a strong negative association.

A correlation of minus one indicates a perfect negative relationship

37
Q

What does a correlation of 1 indicate?

A

When the correlation increases approaching one, for example, when it is at 0.8, we are in the presence of a strong positive linear association.

1 is a perfect positive relationship.

38
Q

Why myst we ensure that the relationship under study is linear in nature?

A

Since correlation is a measure of linear association, it is important to ensure that the relationship under study is linear in nature before calculating or interpreting the correlation coefficient.

39
Q

When is it irrelevant to speak of the correlation coefficient?

A

If the scatterplot rather reveals a relation of another form

40
Q

What does Cronbach’s alpha measure?

A

To measure the degree of coherence or consistency between the measures taken of various aspects of one and the same characteristic.

41
Q

Ordinal Qualitative Variable Examples:

Ordinal Variables = Natural ordering of categories

Ordinal variables are variables that have categories with a specific order or ranking to them, However, they generally don’t demonstrate distances between intervals on the scale.

A

Education level: (Highschool,College, Post Graduate)
Socioeconomic Status: (Low Income, Middle Incom, High Income)
Likert Scale: (agreement or satisfaction)
Movie Ratings: (Poor, Average, Good)
Frequency of Occurrence: (Never, Rarely, Sometimes)
Difficulty Level
Hotel Class
Clothing Sizes

They correspond to situations where there exists a natural ordering of c

42
Q

Nominal Qualitative Variable Examples:

Nominal Variables = Categories without numerical value

Nominal data is a type of qualitative data that is used to label or name variables without providing numeric values. It is the most basic type of measurement scale. It can be sorted into groups, but it has no specific order or hierarchy

A

Nationality
Blood type
Hair color
Preferred mode of transportation

Open-ended questions can be considered nominal because the respondent’s answer is categorized based on their response.

43
Q

Discrete Quantitative Variable Examples:

Discrete Variables = Used to Count Something

Discrete Variable are a type of data that can only take on specific numeric values, and those values have a clear quantitative interpretation. Discrete data is always numerical and can be counted, but not measured.

A

The Population Count
Age
The Number of Students in a Classroom
Customers at a Coffee Shop
Number of Rainy Days in Various Cities

The Number of _________

44
Q

Continuous Quantitative Variable Examples:

Continuous Variable = Numerical / Not Countable

A numeric variable that can take on any value, including values between two values, and is measured with decimal precision. The values are not countable and have an infinite number of possibilities.

A

Height (5.7 feet)
Weight(150.3 lbs)
Temperature(12.8 degrees)
Time (2.5 hours)

Often associated with measurements

45
Q

The grey zone Examples:

It is possible for a variable to be considered of one type or of another

A

Let us take as an example of the completion time of a task. The variable considered is:

Continuous if the time is measured with an accurate stopwatch.
Discrete if the measured time is rounded, for example, to the nearest minute.
Ordinal if the time is coded according to the following categories
1 = Less than 5 minutes
2 = Between 5 and 10 minutes
3 = More than 10 minutes

46
Q

Which variables can the Mode not be used with?

A

The mode cannot be used on continuous variables

47
Q

Which variables can the Mean not be used with?

A

The mean can not be used on nominal variables

48
Q

How do we Interpret the correlation coefficient?

A

The sign – positive vs negative
The magnitude – number between -1 and 1

49
Q

What is the purpose of the Scatter plot?

A

the Scatter plot helps visualize the relationship between two quantitative variables

It is always preferable to study BOTH the scatterplot and the correlatio

50
Q
A