Stats and experimental design Flashcards

1
Q

Define correlational studies

A

Observing what naturally goes on in the world without directly interfering with it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define experimental studies

A

one or more variable is systematically manipulated to see their effect (alone or in combination) on an outcome variable. Statements can be made about cause and effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define longitudinal studies

A

This term implies that data come from people at different age points, with different people representing each age point.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the 4 rules about “cause”?

A
  1. Cause is the producer of an effect, while an effect is produced by a cause.
  2. The cause can be a person, object, situation, or event that can result in something, while an effect is the result of the actions of the person or the outcome of some chain of events that have happened.
  3. The cause will in a way explain the reason why the effect happened in the first place.
  4. The cause naturally precedes an effect, while the effect will always follow it.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a simple hypothesis?

A

Where there is a relationship between two variables one is called independent variable or cause and other is dependent variable or effect.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a complex hypothesis?

A

Where a relation between variables exists and there is more than one dependent variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is an empirical hypothesis?

A

When the theory is put to test using actual observation and experiment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a null hypothesis?

A

No relationship between dependent and independent variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is an alternative hypothesis?

A

The opposite of a null hypothesis - when there are multiple hypotheses and one is selected which more workable and most efficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a logical hypothesis?

A

When the hypothesis is verified logically. The four cannons are: agreement, disagreement, difference and residue

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a statistical hypothesis?

A

When the hypothesis can be verified statistically. Will always be regarded as statistical regardless of it being logical or illogical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Continuous data - interval variable

A

The difference between 1 and 2 is equivalent to the difference between 99 and 100

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Continuous data - ratio variable

A

The same as an interval variable, but the ratios of scores on the scale must also make sense

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Categorical data - binary variable

A

There are only two categories e.g. dead or alive/sink or swim/0 or 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Categorical data - Nominal variable

A

There are more than two categories e.g. omnivore, vegetarian, vegan, or fruitarian.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Categorical data - Ordinal variable

A

The same as a nominal variable but the categories have a logical order, e.g. whether people got a fail, a pass, a merit or a distinction in their exam.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Quantitative data

A

Measurable/objective measures on numeric value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Qualitative data

A

Pain indices; subjective assessments of “condition”; Morphotypes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Measurement error

A

The discrepancy between the actual value we’re trying to measure, and the number we use to represent that value
e.g. You weigh 75 kg. You stand on your bathroom scales and they say 80 kg. The measurement error is 5 kg.

20
Q

Index scatterplot

A

A simple plot () of data points.

Can be used to detect “runs” or can be plotted in a sorted form.

21
Q

Frequency distribution/histogram

A

A bar graph where all the bars touch. Values of observations on x and a bar showing how many times each value occurred in the data set.

More continuous than a bar graph

22
Q

Dotplot

A

A simple stacking of dots at each measure - histogram but dots that don’t touch rather than bars that touch

23
Q

Boxplot

A

Displays the interquartile ranges, the mean and the median

24
Q

Calculate the Z-score

A

Z = (sample(x) - mean(xbar))
__________________
standard deviation (s)

25
Q

Define kurtosis and what types there are

A
Kurtosis = the 'heaviness' of the tails
Leptokurtic = heavy tails (thin peaked line)
Mesokurtic = middle
Platykurtic = lights tails (wide flat line)
26
Q

What is a skew and how would a positive and a negative skew be describe?

A

The skew is the symmetry of the distribution. A positive skew has bunched data at the low values with the tail pointing to the high values and a negative skew vice versa

27
Q

How to calculate the degrees of freedom

A

Degrees of freedom is the sample size (n) minus the number of parameters estimated from the data
(usually df = n - 1)

28
Q

Calculate variance

A

variance = sum of squares/degrees of freedom

29
Q

What is a student’s t-distribution and how does it relates to parameter estimation

A

Student’s t test is used for comparing two sample means, where the samples are independent, the variances are constant, and errors are normally distributed. The test statistic is the number of standard errors by which the two sample means are separated

30
Q

Describe a Type I Error

A

Occurs when we believe that there is a genuine effect in our population when, in fact, there isn’t. The probability is the α-level (usually .05)

31
Q

Describe a Type II Error

A

Occurs when we believe that there is no effect in the population when, in reality, there is. The probability is the β-level (often .2)

32
Q

What are true positives and true negatives?

A

If it is said that there is an effect on the population and there in-fact is, this is a true positive.

This is vice versa for a true negative.

33
Q

Relative Risk Ratio equation

A

(a/(a+b)) / (c/(c+d))

34
Q

Odds Ratio equation

A

(a/b) / (c/d)

use when any of the values are below 30

35
Q

Odds Risk Probability value meanings

A

OR < 1 = event less likely in group 1
OR = 1 = event equally as likely in both groups
OR > 1 = event more like in group 1

36
Q

Relative Risk Ratio value meanings

A
RR = 0 = none of the cases in group 1 had the event occur while x number of cases in group 2 had the event occur
RR = 1 =  neutral result: the chance of an event occurring for one group is the same for an event occurring for the other group
37
Q

Sum of squares (SS)

A

(X1 - Xbar)*

38
Q

Variance

A

(X1 - Xbar)* / (n-1)

39
Q

Covariance

A

(X1 - Xbar)(Y1 - Ybar) + (X2 - Xbar)(Y2 - Ybar) + (X3 etc) / n -1

40
Q

Standard Deviation (SD/S)

A

sqrt ( (X1 - Xbar)* / (n-1) )

41
Q

Standard Error (SE)

A

s / sqrt(n)

42
Q

Confidence Interval (CI)

A

Xbar +/- t ( s/sqrt(n) )

43
Q

Chi test

A

SUM OF ( (O-E)* / E )

where O = observed frequency and E = expected frequency

44
Q

Mean Square (MS)

A

sum of squares (ss) / degrees of freedom (df)

45
Q

F-value (F)

A

mean square of row 1 / mean square over row 2