Research Methods- Data analysis: Descriptive statistics Flashcards

1
Q

Descriptive statistics -

A

The use of graphs, tables and summary statistics to identify trends and analyse sets of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Measures of central tendency -

A

The general term for any measure of the average value in a set of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Mean -

A

The arithmetic average calculated by adding up all the values in a set of data and dividing by the number of values there are.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Median -

A

The central value in a set of data when values are arranged from lowest to highest.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Mode -

A

The most frequently occurring value in a set of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are measures of central tendency?

A

Measures of central tendency are averages that provide information about the most typical values in a dataset. The three main measures are the mean, median, and mode.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the mean?

A

The mean is the arithmetic average, calculated by adding all scores in a dataset and dividing by the total number of scores.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How is the mean calculated?

A

For the dataset: 5, 7, 7, 9, 10, 11, 12, 14, 15, 17, the mean is calculated as (5+7+7+9+10+11+12+14+15+17) ÷ 10 = 10.7.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a strength of the mean?

A

It is the most sensitive measure of central tendency as it includes all scores in the dataset, making it representative of the data as a whole.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a limitation of the mean?

A

It can be easily distorted by extreme values. For example, replacing 17 with 98 changes the mean to 18.8, which is not representative of the dataset.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the median?

A

The median is the middle value in a dataset when scores are arranged in order.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How is the median calculated?

A

For an even number of scores, the median is the average of the two middle values. In the dataset: 5, 7, 7, 9, 10, 11, 12, 14, 15, 17, the median is (10 + 11) ÷ 2 = 10.5.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a strength of the median?

A

It is not affected by extreme values, making it more reliable for datasets with outliers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a limitation of the median?

A

It is less sensitive than the mean because it does not include all scores in the calculation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the mode?

A

The mode is the most frequently occurring value in a dataset.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is an example of the mode?

A

In the dataset: 5, 7, 7, 9, 10, 11, 12, 14, 15, 17, the mode is 7.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is a limitation of the mode?

A

It is a crude measure and may not be representative of the dataset as a whole.

17
Q

What is a strength of the mode?

A

It is easy to calculate and is the only measure of central tendency that can be used for categorical data.

18
Q

What is a bimodal dataset?

A

A bimodal dataset has two modes, meaning two values occur with the same highest frequency.

19
Q

What are measures of dispersion?

A

Measures of dispersion describe how spread out the scores in a dataset are. The two main measures are the range and standard deviation.

20
Q

What is the range?

A

The range is the difference between the highest and lowest values in a dataset, often calculated as (highest value - lowest value) + 1.

21
Q

How is the range calculated?

A

For the dataset: 5, 7, 7, 9, 10, 11, 12, 14, 15, 17, the range is (17 - 5) + 1 = 13.

22
Q

What is a strength of the range?

A

It is easy to calculate and provides a quick measure of spread.

23
Q

What is a limitation of the range?

A

It only considers the two extreme values, which may not represent the overall spread of the dataset.

24
Q

What is standard deviation?

A

Standard deviation is a measure of how far scores deviate from the mean, providing a precise measure of dispersion.

25
Q

What does a high standard deviation indicate?

A

A high standard deviation indicates that scores are widely spread, suggesting variability in responses or effects.

26
Q

What does a low standard deviation indicate?

A

A low standard deviation indicates that scores are tightly clustered around the mean, suggesting consistency in responses or effects.

27
Q

What is a strength of standard deviation?

A

It includes all values in the calculation, providing a precise measure of dispersion.

28
Q

What is a limitation of standard deviation?

A

It can be distorted by extreme values, similar to the mean.

29
Q

What is an example of a dataset with a misleading range?

A

In the dataset: 0, 47, 49, 50, 50, 50, 51, 53, 54, 56, 56, 57, 100, the range is 101, but most scores are clustered around 50, making the range unrepresentative.

30
Q

What is the purpose of adding 1 to the range?

A

Adding 1 accounts for rounding errors in raw scores, ensuring the range accurately reflects the spread of data.

31
Q

What is the relationship between standard deviation and the mean?

A

Standard deviation measures how far scores deviate from the mean, providing insight into the variability of the dataset.

32
Q

What is an example of a dataset with a low standard deviation?

A

A dataset with scores: 10, 11, 12, 11, 10 has a low standard deviation, indicating scores are close to the mean.

33
Q

What is an example of a dataset with a high standard deviation?

A

A dataset with scores: 5, 20, 35, 50, 65 has a high standard deviation, indicating scores are widely spread.

34
Q

What is the importance of measures of central tendency?

A

They provide a summary of the most typical or central values in a dataset, helping researchers understand the overall pattern of data.

35
Q

What is the importance of measures of dispersion?

A

They describe the spread or variability of scores, providing insight into the consistency or variability of responses in a dataset.

36
Q

What is the difference between the mean and the median?

A

The mean is the arithmetic average, while the median is the middle value. The mean is sensitive to extreme values, whereas the median is not.

37
Q

What is the difference between the range and standard deviation?

A

The range is a simple measure of spread based on extreme values, while standard deviation is a precise measure of how far scores deviate from the mean.

38
Q

What is the role of the mode in categorical data?

A

The mode is the only measure of central tendency that can be used for categorical data, identifying the most frequently occurring category.

39
Q

What is an example of a dataset with no mode?

A

A dataset with scores: 5, 7, 9, 11, 13 has no mode, as all values occur with the same frequency.

40
Q

What is the significance of standard deviation in experiments?

A

A high standard deviation suggests variability in responses to the independent variable, while a low standard deviation indicates consistent effects.

41
Q

What is the impact of outliers on the mean and median?

A

Outliers significantly affect the mean but have little to no effect on the median, making the median more robust for datasets with extreme values.