Benchmark info Flashcards

(41 cards)

1
Q

describe statistics

A

the science of developing methods for collecting, analyzing, interpreting, and presenting data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is the main goal of statistics

A

to use information from a sample to make inferences about the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is a variable?

A

a measurable attributs that can vary across entities and is represented by a column

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is an observation?

A

the complete set of recorded values for each entity, typically encapsulated in a single row of dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is correlation?

A

it measures the association between two variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

when should you use mean and how is it affected by outliers?

A
  • use with symmetric distributions without extreme values
  • mean is highly sensitive to outliers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

when should you use median and how is it affected by outliers?

A
  • use for skewed distributions or data with outliers
  • median is less affected bu outliers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

when should you use mode and how is it affected by outliers?

A
  • use when identifying the most common category or score
  • mode is not influenced by outliers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what is variance in technical terms?

A
  • the average of the squared differences from the mean
  • the square of the standard deviation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what statistical tests can be used to find outliers?

A

z-score and IQR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

describe histograms

A
  • displays continuous data
  • has touching bars representing frequency across intervals
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what does skewness measure?

A

measure asymmetry of distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what does kurtosis measure?

A

“tailedness” or sharpness of the peak of a distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

describe a scatterplot

A

displays the relationship between two numerical variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what are “measures”?

A
  • quantitative way of representing or summarizing aspects of data
  • tools used to describe, analyze, and make sense of data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what is internal consistency?

A

the extent to which different items that are trying to measure the same variable are truly related and report similar results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

what is construct validity?

A

whether the test accurately measures the underlying theoretical concept it is trying to measure

18
Q

what is content validity?

A

whether the test covers all the important aspects of that construct

19
Q

what is criterion-related validity?

A

how well one measure (predictor) correlates with a related outcome measure (the criterion) that is considered an established standard or a relevant benchmark

20
Q

what is a null hypothesis?

A

a statement that claims there is NO difference, NO effect, NO relationship between variable

21
Q

what is a type I error?

A

rejecting the null hypothesis when it is true

22
Q

what is type II error?

A

failing to reject the null hypothesis when it is false

23
Q

describe symbol, meaning, and interpretation of significance level

A
  • α
  • probability of making a type I error
  • risk of false positive
24
Q

describe symbol, meaning, and interpretation of confidence level

A

1-α
- prbability of not making type I error
- certainty in correctly accepting a true null hypothesis

25
describe symbol, meaning, and interpretation of type II error rate
β - probability of making type II error - risk of a false negative
26
when do you reject the null hypothesis?
when your statistic is larger than your critical value
27
what is a critical value?
a threshold that defines the boundary of a statistical test's rejection region
28
what is another name for the normal distribution?
Gaussian distribution
29
does the normal distribution curve ever touch the horizontal axis?
no
30
what is the purpose of a t-test?
to see if there is a significant difference between means or populations
31
what is the purpose of an ANOVA?
to test for significant differences between 3 or more groups
32
what is a factorial ANOVA used for?
to test for the effects of two or more independent variables (factors) on a dependent variable
33
what does the number of factors in an ANOVA indicate?
the number of factors indicates the number of main effects
34
what is the purpose of correlation?
it measures the direction and strength of a linear relationship between two variables
35
what does the p-value of a correlation tell you?
the probability of observing a correlation at least as extreme as the calculated coefficient (assuming there is no actual correlation)
36
what are the limitations of correlation?
- sensitive to outiers - curvilinear relationships (correlation can only measure linear relationships) - restriction of range
37
what are the two uses of a chi-squared test?
1) test for indepndence 2) test for goodness of fit
38
what kind of data is typically used with a chi-squared test?
catagorical data
39
what is Y-hat in linear regressions?
the predicted value of Y
40
what are residuals in linear regressions?
observed value - predicted value
41
what does R-squared represent in linear regression?
how well the independent variable exaplains the variance in the dependent variable