Data Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

What are the Measures of Central Tendency?

A

Mode
Median
Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is meant by Central Tendency, and Dispersion?

A

Central Tendency - A descriptive statistic that calculate the average almost typical value in the dataset; that is the average score recorded

Dispersion - A descriptive statistic that calculates the spread of scores in the dataset. Measures of central tendency can be misleading without knowing the variation between the scores.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the Mode?

A

Calculates The most frequent score in a data set. The mode is the value that occurs most frequently

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the Median?

A

The middle number within an ordered set of values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the Mean?

A

The average number within a set of values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What type of data is the mode, median and mean typically used for?

A

Mode - Nominal
Median - Ordinal
Mean - Interval/Ratio Level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the Strengths + Weaknesses of the Mode?

A

The mode is very easy to determine. It is not affected by extreme scores.

However it is not a useful measure of central tendency on small data sets, with frequently occurring same values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the Strengths + Weaknesses of the Median?

A

The median is not affected by extreme scores or a skewed distribution

However, it is less sensitive than the mean and is not useful on data sets that have a small number of values, as it may not represent the typically score

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the Strengths + Weaknesses of the Mean?

A

It is the most powerful measure of central tendency because all of the scores in the data set are used in the calculation

It can be affected by extreme values, or when there is a skewed distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the Measures of Dispersion?

A

Range

Standard Deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the Problems with the Range?

A

The range is affected by extreme scores, so it may not be a useful descriptive statistic if there are outliers in the dataset.

It also doesn’t tell us if the scores or bunched around the mean all more equally distributed around the mean

If the dataset has extreme scores, it is often better to calculate the interquartile range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the Strengths + Weaknesses of the Standard Deviation?

A

D

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does the Inferential Statistics Table look like?

A

NO IRC

Chi2, Sign, Chi2
MWU, Wilcoxon, Spearmans Rho

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does it mean by ‘Ordinal +’ ?

A

The levels of measurement are ranked in terms of sophistication- Nominal, then Ordinal, then Interval.

Ordinal + just means ordinal and interval data. The ones used for interval are no longer really used anymore, we just use the ordinal one for them.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the types of Graphical Representations of data?

A

Bar Graph
Pie Chart
Histogram

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are Bar Graphs?

A

Bar graphs are used to present data from a categorical variable, such as the main/medium/mode.

The categorical variable is placed on the x axis, and the height represents the value of that variable

17
Q

What are Histograms?

A

The histogram is used to present the distribution of scores by illustrating the frequency of values in the dataset.

Unlike a bar chart, the bars on a histogram are joined to represent continuous data rather than discrete data.

The possible values are presented on the X axes and the height of each bar represents the frequency of the value

18
Q

How would a Graph always be Labelled?

A

Relevant graph title
Relevant X axis title
Relevant Y axis title

Plot anything you want (say for practical), or is relevant to the question.

19
Q

What does it mean to have Normal Distribution?

A

Normal distribution is characterised by symmetry around the midpoint. The mode median and mean should be aligned around the midpoint.

The tail ends should not meet the horizontal axis and we can estimate the percentage of people that fall under the curve at each standard deviation.

20
Q

What is the Normal Distribution curve?

A

http://img.tfd.com/mk/D/X2604-D-41.png

21
Q

What does it mean to have a Negative Skew?

A

If the aptitude of the sample is unusually high, it told me that most people score highly. This leads to a negative skew, where many people score above the average/mean score

22
Q

What does it mean to have a Positive Skew?

A

If the aptitude of the sample was low it will mean that most people will achieve a lower score below the mean. This will lead to a positive skew

22
Q

What does a Positive and Negative Skew look like on a diagram?

A

http://www.statisticshowto.com/wp-content/uploads/2014/02/pearson-mode-skewness.jpg

24
Q

What are the different levels of measurement?

A

Nominal
Ordinal
Interval

25
Q

What is Nominal Data?

A

Where data forms discrete categories. We know nothing about each value within the categories, we just know the category names.

E.g. 1 - hair can only be nominal data because it can only be described in its categories of blonde, brown, red or black.

E.g. 2 - if you divide the class into students under 1.85m tall and over 1.85m tall, and calculate the frequency in each category, you would have nominal data. You would not know the actual heights of each individual student, or their height in relation to one another

26
Q

What is Ordinal Data?

A

A level of measurement where numbers are rankings rather than scores in themselves. It tells us about the position, but does not tell us what was actually achieved or the difference between each rank

E.g. a rank order for attractiveness on a scale of 1 to 5

27
Q

What is Interval/Ratio Data?

A

Data where an individual score for each participant is gathered, and the score can be identified using a recognised scale with equal distances between each score

E.g. cm, seconds, minutes, kilograms, etc

28
Q

How can you judge significance? What does p<0.x mean?

A

For example, p < 0.10

This means the probability of whether the results are due to chance is 10%

29
Q

What are Type I errors?

A

A type one error occurs because the level of significance is too lenient

When the null hypothesis is rejected and alternative hypothesis is supported when the effect was not real

30
Q

What are Type II errors?

A

A Type II error occurs because the level of significance is too stringent

When the alternative hypothesis is rejected and the null retained when there was actually a real effect

31
Q

Which probability do psychologists normally use?

A

(P<0.05)