Week 4 Flashcards

1
Q

Descriptive vs Inferential statistics

A

Descriptive statistics describe a sample of population through specific measures: mean, mode, variance

Inferential statistics infer the properties of a population through measures calculated on a sample population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

3 measures of central tendency measurements in descriptive statistics

A

Mean, median, mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Mean vs median vs mode

A

Mean: mathematical center of sample
Mode: most frequently occurring value in sample
Median: value occurring in the center of an ordered sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

3 measures of variability in descriptive statistics

A

Range, variance, std deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Range vs variance vs std deviation

A

Range: Max-min
Variance: ∑(ni - x̅) / N
Std dev: sqrt(variance)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

T-tests

A

Aka student’s test is a parametric inferential test that compares if there is significant difference between the means of 2 groups and describing there difference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

ANOVA

A

Analysis of variance, allows for testing significant difference of means in more than 2 samples

  • Extension of t-tests
  • Samples have normal distribution
  • samples are random and independent
  • Each group has common variance
  • Data are independent
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

When is regression analysis used?

A

It is used to find the relationship between a set of variables in a data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Dependent vs independent variable in linear regression?

A

Dependent variable: variable that is being predicted, aka “response variable”, or “outcome variable”

Independent variable: aka “explanatory variable”, “predictor variable”, the variable that is said to influence the dependent variable usually labeled X

ie How does the number of hours studied (X: predictor) affect the student’s test score (Y: response)?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Which plot is useful for visualizing linear trends in data sets.

A

Scatter plot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the ordinary least squares equation and what is it used for?

A

The least squares equation is used to fit a data set to a line with a given slope (m in y = mx + b), estimating the unknown values of a model on the line.

The equation is defined as :

m = ∑(x - x̄) * (y - ȳ)
——————-
∑ (x - x̄) ²

How well did you know this?
1
Not at all
2
3
4
5
Perfectly