Data analysis Flashcards

1
Q

Conditions for chi square

A
  1. Independence
  2. 5+ Sample size
  3. > 1 degree of freedom
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What do rows in data represent?

A

Observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What do columns in data represent

A

Variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Experimental Study

A

Manipulation of an independent variable to measure the impact on a dependent variable (control group etc.)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Observational study

A

Study of the relationship between variables with no manipulation/intervention

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Where does the explanatory variable belong

A

Horizontal axis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Where does the response variable belong

A

Vertical axis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is sample standard deviation

A

Variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Calculate the probability (A AND B)

A

P(A) x P(B)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Calculate the probability (A OR B)

A

P(A) + P(B) - P(A AND B)
- e.g. (0.3 + 0.7) - (0.7 x 0.3) = 0.79

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Calculate the Probability A given B

A

If the events are independent, the probability is just P(A) - the occurrence of B does not affect the probability of A

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

A given B - conditional

A

P(A and B)/P(B)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Bayes Theorem

A

P(A/B) = P(B/A)P(A) / P(B)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Z score formula

A

x-mean / standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Binomial distribution

A
  • Fixed number of independent trials
  • 2 possible outcomes ‘success’ and ‘failure’
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

T-distribution

A

t = x - μ / s / root(n)

x - sample mean
μ - population mean
s - sample SD
n - Sample size

17
Q

How to calculate degrees of freedom?

A

Sample size (n) - 1

18
Q

What does mutually exclusive mean?

A

When P(A and B) = 0
Events are mutually exclusive when they cannot happen at the same time, and share no basic outcomes

19
Q

Events are independent if…

A

The occurrence of one has no impact on the probability of the other occurring

20
Q

Two tailed or One tailed test

A
  • One tailed - only want to know either lower or higher
  • Two tailed - testing for a difference in either direction
21
Q

Standard error

A
  • Standard deviation / (root)Sample
22
Q

Key notation

A

𝑥- sample mean
𝑝- sample proportion
𝜇- population mean
𝑝- population proportion
𝑠- sample standard deviation
𝜎- population standard deviation
𝑛- sample size
𝛽- population regression coefficient
𝑏- sample regression coefficient
Σ- summation operator

23
Q

Extrapolation

A
  • When you predict about something outside the range of data you already have, based on the value of the predictor variable
  • e.g. using the adults weight for the baby weight model
24
Q

Residual

A

Observed value - predicted value

25
Q

Linear probability model

A
  • ## Odds always between 0 and , which represent the probability of them occurring (yngkid = 0,1)
26
Q
A
27
Q
A