Statistics Flashcards

1
Q

Are statistics a method of data gathering or data processing?

A

Processing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the measures of central tendency?

A

The mean, median and mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the mean?

A

Average value of a group of statistical data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the median?

A

Middle value of the group

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the mode?

A

The value that appears most regularly in the group

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is standard deviation?

A

The measure of the average amount by which the data differs from the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Advantages of standard deviation:

A

+ shows variety
+ easy to describe, as not square units
+ data is likely to be similar and, therefore, easy to process

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Disadvantages of standard deviation:

A

Easily affected by anomalies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the standard error of the mean?

A

The measure of the accuracy of a sample mean, as an estimate of the total population mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is chi squared

A

Determines the differences between observed and expected data, as well as whether or not this is due to explainable factors.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How do you carry out chi squared?

A

Use formula: compare expected and observed averages. Compare test statistic to 95% significance level. Prove or disprove null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Advantages of chi squared

A

+ Determines a significance level - easy analysis

+ useful if data can be grouped

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is spearman’s rank correlation coefficient

A

A test that measures the strength and direction of a relationship between two variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do you carry out spearman’s rank

A

Use formula: compare the squared value of their differences by the number of values
Determine if positive or negative correlation
Determine if significant or not

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are spearman’s ranks significance levels?

A
\+ 0-0.19 = weakest correlation 
\+ 0.20-0.39 = weak correlation 
\+ 0.40-0.59 = moderate correlation 
\+ 0.60-0.79 = strong correlation 
\+ 0.80-1 = strongest correlation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Advantages of spearman’s rank:

A

+useful if the data is ordinal (rank able)
+ identifies comparable relationships
+ identifies nature of correlation
+ quicker to calculate than pearson’s

18
Q

Disadvantages of spearman’s rank:

A

+ requires 7 pairs of data
+ ranking of data ignores magnifiques of differences
+ does not give reasons for difference

19
Q

What is pearsons’ product moment correlation coefficient?

A

Similar to spearman’s. Measures strengths if a linear relationship between two variables

20
Q

What are the advantages of Pearson’s coefficient?

A

+ identifies relationship between two variables
+ identifies nature of correlation
+ Uses actual values, not ranked, so raw data is not lost
+ useful if data is normally distributed
+ increasingly precise

21
Q

What are the disadvantages of Pearson’s coefficient?

A

+ time consuming
+ doesn’t explain correlation
+ affected by extreme values
+ assumes all relationships are linear

22
Q

What is the nearest neighbour index?

A

A statistic that measures the distribution between data sets, according to whether they are clustered, random or regular.

23
Q

What is the measuring values of the nearest neighbour index?

A

0 - 2.15

0 is clustered
1 is random
2.15 is regular

24
Q

How do you carry out the nearest neighbour index?

A

Measure distance between the two sites
Compare distance to average distance between sites of entire area
Insert values into formula
Compare to previous values to determine if the distribution is random, regular or clustered.

25
Q

Advantages of nearest neighbour index:

A

+ increasingly objective
+ easy to compare areas
+ allows change over time to be identified

26
Disadvantages of nearest neighbour index:
+ dependent on the side of the area + number does not actually represent pattern, only general overview + some nearest neighbours may be out of sample area + larger areas may end up in exaggerate patterns of clustering
27
What is linear regression analysis?
A statistic that determines the lineal relationship between a dependant and an independent variable. Plotted with line of best fit.
28
Advantages of linear regression analysis:
+ determine strength by measuring surrounding clustering + correlation is visually determined - easy to comprehend + dimple calculation + future trends can be calculated
29
Disadvantages of linear regression:
+ must already have a correlation to present, have to completed spearman’s (for example) beforehand. + no reasoning + no nature of correlation, might be chance
30
What is a location quotient?
A method of quantifying the concentration of a certain variable in one area, compared to the national mean
31
Advantages of a locational quotient:
+ quick and easy to calculate | + clearly shows concentration, when compared with a graph
32
Disadvantages of a locational quotient:
+ raw data is lost as percentages are used + figure may be hard to interpret + no explanation of pattern
33
What is variance?
The measure of how far a data set is spread out
34
Advantages of variance:
+ quick and simple to calculate + squaring data ensures results are not zero + useful if data is normally distributed
35
Disadvantages of variance:
+ doesn’t shows if the correlation is positive or negative + hard to interpret + raw data is lost as units are squared + can be influenced by extreme values
36
Disadvantages of chi squared
+ doesn’t show scale of difference + needs at least 20 data values + complicated formula + trends are not shown by averages and could be affected by anomalies