Ts Flashcards

1
Q

What is the purpose of a data analysis plan?

A

The purpose of a data analysis plan is to gather useful information to find solutions to research questions of interest. It may be used to:
- describe data sets;
- determine the degree of relationship of variables;
- determine differences between variables;
- predict outcomes; and
- compare variables.

Source: Planning Data Analyses Using Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the three types of data analysis strategies?

A

The three types of data analysis strategies are:
1. Exploratory Data Analysis
2. Descriptive Data Analysis
3. Inferential Data Analysis

Source: Planning Data Analyses Using Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the purpose of Exploratory Data Analysis?

A

Exploratory Data Analysis is used when it is not clear what to expect from the data. This strategy uses numerical and visual presentations such as graphs. It helps identify inconsistencies in the data.

Source: Planning Data Analyses Using Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the purpose of Descriptive Data Analysis?

A

Descriptive Data Analysis is used to describe, show, or summarize data in a meaningful way, leading to a simple interpretation of data. It does not allow for formulating conclusions beyond the described data.

Source: Planning Data Analyses Using Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the purpose of Inferential Data Analysis?

A

Inferential Data Analysis tests hypotheses about a set of data to reach conclusions or make generalizations beyond merely describing the data. It includes tests of significance of difference and tests of relationship.

Source: Planning Data Analyses Using Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the four levels of measurement scales?

A

The four levels of measurement scales are:
1. Nominal Scale
2. Ordinal Scale
3. Interval Scale
4. Ratio Scale

Source: Planning Data Analyses Using Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a Nominal Scale of measurement used for?

A

A Nominal Scale of measurement is used for labelling variables. It is sometimes called categorical data and is used for variables like gender, religious affiliation, race, or ethnic group.

Source: Planning Data Analyses Using Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is an Ordinal Scale of measurement used for?

A

An Ordinal Scale of measurement assigns order on items being measured, such as ranking individuals, attitudes, or characteristics. It establishes ranking without specifying differences in order.

Source: Planning Data Analyses Using Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is an Interval Scale of measurement used for?

A

An Interval Scale of measurement has equal units of measurement, allowing for interpreting the order of scale scores and the distance between them. However, it does not have a ‘true zero’.

Source: Planning Data Analyses Using Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a Ratio Scale of measurement used for?

A

A Ratio Scale of measurement is considered the highest level of measurement, having the characteristics of an interval scale with a zero point. All statistical operations can be performed on ratio scales.

Source: Planning Data Analyses Using Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the common measures of central tendency?

A

The common measures of central tendency include the mean, median, and mode. These measures help determine the center or middle of a set of data.

Source: Planning Data Analyses Using Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the Mean in terms of central tendency?

A

The Mean is often called the arithmetic average of a set of data. It is calculated as the sum of the observed values divided by the number of observations.

Source: Planning Data Analyses Using Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the mean?

A

The mean is the sum of the observed values in the distribution divided by the number of observations. It is frequently used for interval or ratio data. The symbol X (x bar) is used to denote the arithmetic mean.

The mean is calculated by summing up the observations (items, height, scores or responses) and dividing by the number of observations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How is the mean calculated?

A

The mean is calculated by summing up the observations and dividing by the number of observations.

Formula: X = Σx / n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the weighted mean?

A

The weighted mean is necessary in some situations when you wish to find the mean of all the measures combined into one group. The formula for weighted mean is given by Σ(fx) / n, where f = frequency, x = numerical value or item in a set of data, and n = number of observations in the data set.

Weighted Mean: When the data is grouped into classes, the class midpoint represents the ‘X’ in the formula.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the median?

A

The median is the midpoint of the distribution. It represents the point in the data where 50% of the values fall below that point and 50% fall above it. When the distribution has an even number of observations, the median is the average of the two middle scores. The median is the most appropriate measure of central tendency for ordinal data.

The median may be calculated from ungrouped data by arranging the items from lowest to highest and finding the middle value.

17
Q

What is the median when the number of observations is odd?

A

The Median is 9.

18
Q

What is the median when the number of observations is even?

A

The Median is 20.

19
Q

How to find the median for a set of measurements?

A

The Median is 15.

20
Q

How is the median calculated for grouped data?

A

The Median is 37.77.

21
Q

What is the mode?

A

Mode=25.

22
Q

What is the mode when there are multiple most frequent values?

A

Mode=2 and 4.

23
Q

What is the modal class for the distribution of ages of 100 people?

A

Mode is 31-40.

24
Q

What is bimodal data?

A

Mode=2 and 4

Example 3: Referring to the data on the distribution of the ages of 100 people interviewed for a survey on a topic on national interest, the modal class is 31-40. The mode which corresponds to the class midpoint would be

25
Q

What is the range?

A

The range is the difference between the largest and the smallest values in a set of data.

26
Q

What is average (mean) deviation?

A

This measure of spread is defined as the absolute difference or deviation between the values in a set of data and the mean, divided by the total number of values in the set of data.

27
Q

What is standard deviation?

A

The standard deviation (SD) is a measure of the spread or variation of data about the mean.

28
Q

What percentage of scores falls within one standard deviation of the mean?

A

68% of the scores falls within one standard deviation of the mean

Example: In our example, with a mean of 17.5 and a standard deviation of 6.95, the range would be from 10.5 to 24.45

29
Q

What percentage of scores falls within two standard deviations of the mean?

A

Approximately 95% of the scores falls within two standard deviations of the mean

30
Q

What percentage of scores falls within three standard deviations of the mean?

A

Approximately 99% of the scores falls within three standard deviations of the mean

31
Q

What is the purpose of inferential statistics?

A

Inferential statistics allows us to use samples to make generalizations about the population from which the samples were drawn

32
Q

What is the Test of Significance of Difference (T-test) used for?

A

The T-test is used to measure significant differences between means for independent samples

33
Q

What is ANOVA used for?

A

ANOVA is used to determine the significance of difference of means of two or more groups at one time

34
Q

What does ANOVA rely on to test the hypothesis of equal variances?

A

ANOVA relies on the F-ratio to test the hypothesis that the two variances are equal; that is, the subgroups are from the same population

‘Between groups’ refers to the variation between each group mean and the grand or overall mean

35
Q

When is the Spearman Rank-Order Correlation used?

A

Spearman Rank-Order Correlation is used when data available are expressed in terms of ranks (ordinal variable)

36
Q

When is the Chi-Square Test for Independence used?

A

The Chi-Square Test for Independence is used to test the independence of two categorical variables