Benchmark info Flashcards

1
Q

describe statistics

A

the science of developing methods for collecting, analyzing, interpreting, and presenting data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is the main goal of statistics

A

to use information from a sample to make inferences about the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is a variable?

A

a measurable attributs that can vary across entities and is represented by a column

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is an observation?

A

the complete set of recorded values for each entity, typically encapsulated in a single row of dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is correlation?

A

it measures the association between two variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

when should you use mean and how is it affected by outliers?

A
  • use with symmetric distributions without extreme values
  • mean is highly sensitive to outliers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

when should you use median and how is it affected by outliers?

A
  • use for skewed distributions or data with outliers
  • median is less affected bu outliers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

when should you use mode and how is it affected by outliers?

A
  • use when identifying the most common category or score
  • mode is not influenced by outliers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what is variance in technical terms?

A
  • the average of the squared differences from the mean
  • the square of the standard deviation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what statistical tests can be used to find outliers?

A

z-score and IQR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

describe histograms

A
  • displays continuous data
  • has touching bars representing frequency across intervals
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what does skewness measure?

A

measure asymmetry of distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what does kurtosis measure?

A

“tailedness” or sharpness of the peak of a distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

describe a scatterplot

A

displays the relationship between two numerical variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what are “measures”?

A
  • quantitative way of representing or summarizing aspects of data
  • tools used to describe, analyze, and make sense of data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what is internal consistency?

A

the extent to which different items that are trying to measure the same variable are truly related and report similar results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

what is construct validity?

A

whether the test accurately measures the underlying theoretical concept it is trying to measure

18
Q

what is content validity?

A

whether the test covers all the important aspects of that construct

19
Q

what is criterion-related validity?

A

how well one measure (predictor) correlates with a related outcome measure (the criterion) that is considered an established standard or a relevant benchmark

20
Q

what is a null hypothesis?

A

a statement that claims there is NO difference, NO effect, NO relationship between variable

21
Q

what is a type I error?

A

rejecting the null hypothesis when it is true

22
Q

what is type II error?

A

failing to reject the null hypothesis when it is false

23
Q

describe symbol, meaning, and interpretation of significance level

A
  • α
  • probability of making a type I error
  • risk of false positive
24
Q

describe symbol, meaning, and interpretation of confidence level

A

1-α
- prbability of not making type I error
- certainty in correctly accepting a true null hypothesis

25
Q

describe symbol, meaning, and interpretation of type II error rate

A

β
- probability of making type II error
- risk of a false negative

26
Q

when do you reject the null hypothesis?

A

when your statistic is larger than your critical value

27
Q

what is a critical value?

A

a threshold that defines the boundary of a statistical test’s rejection region

28
Q

what is another name for the normal distribution?

A

Gaussian distribution

29
Q

does the normal distribution curve ever touch the horizontal axis?

A

no

30
Q

what is the purpose of a t-test?

A

to see if there is a significant difference between means or populations

31
Q

what is the purpose of an ANOVA?

A

to test for significant differences between 3 or more groups

32
Q

what is a factorial ANOVA used for?

A

to test for the effects of two or more independent variables (factors) on a dependent variable

33
Q

what does the number of factors in an ANOVA indicate?

A

the number of factors indicates the number of main effects

34
Q

what is the purpose of correlation?

A

it measures the direction and strength of a linear relationship between two variables

35
Q

what does the p-value of a correlation tell you?

A

the probability of observing a correlation at least as extreme as the calculated coefficient (assuming there is no actual correlation)

36
Q

what are the limitations of correlation?

A
  • sensitive to outiers
  • curvilinear relationships (correlation can only measure linear relationships)
  • restriction of range
37
Q

what are the two uses of a chi-squared test?

A

1) test for indepndence
2) test for goodness of fit

38
Q

what kind of data is typically used with a chi-squared test?

A

catagorical data

39
Q

what is Y-hat in linear regressions?

A

the predicted value of Y

40
Q

what are residuals in linear regressions?

A

observed value - predicted value

41
Q

what does R-squared represent in linear regression?

A

how well the independent variable exaplains the variance in the dependent variable