6. Data Analysis And Ch. 4, 7 of E-book Flashcards

1
Q

On a graph with a normal distribution, where is the mean

A

In the middle

- the normal distribution is symmetric about its mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the standard deviation on a normal distribution

A

How broad the bell shape is

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the total area under the normal distribution curve defined as

A

1.0 = One whole unit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How to work work out a standard normal

A

Z= (x - the mean) / standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What values does the Pearson Correlation Coefficient, r, lie between

A

1 and -1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

If r is larger than zero, what is the correlation

A

Positive correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

If r is less than zero, what is the correlation

A

Negative correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

If r=0 what is the correlation

A

No correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

If r=1 what is the correlation

A

Perfect postive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

If r=-1 what is the correlation

A

Perfect negative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

When not to use Pearson correlation

A

There is a non-linear relationship between variables (see Figure (a) below).
There are outliers (see Glossary)
There are distinct sub-groups, for example, if we mix two samples together such as healthy controls and disease cases (see Figure (b)).
One or both of the variables is not normally distributed.
One or both of the variables is non-numeric.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When can Spearman Rank Correlation Coefficient, Rho, be used

A

This correlation coefficient can be used when the data is not normally distributed, when one or both of the variables are ordinal, or when the sample size is small.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is linear regression

A

term used to describe fitting a straight line to points on a scatterplot.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are residuals

A

The residuals are the difference between the observed data and the predicted value from the model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Give an assumption and requirement of a regression analysis

A

Relationship must be approximately linear

The ‘residuals’ have to be normally distributed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly