Introduction and Basics- Lecture 1 Flashcards

Question

what is variance?

Answer 1

variance of set of observations is the average of the squares of the deviations of the observations from their mean

Answer 2

square root of the variance and variance showing how the data varies across collection of sample set. Large standard deviation indicates data points are far from the mean

Answer 3

if there are 9 samples, can be assumed as 1 dataset with N=9 BUT can also be assumed as 3 datasets from 3 independent studies, N=3 mean remains the same but changes standard deviation

Answer 4

standard deviation of sample means and a measure of how representative a sample is likely to be of the population

Answer 5

a lot of variability between the means of different samples, thus sample might not be representative of population

Answer 6

most sample means are similar to the population mean, thus sample is accurate reflection of population

Answer 7

Histogram, but number of bins are important. Histograms not good when you dont have enough data. (too many bins = noisy, too few bins can mask out important features)

Answer 8

1. central bellcurve shape uniform 2. shifted to left (positive) or right (negative) 3. 2 efective central values and 2 populations of responses

Answer 9

used to convert any normal distribution such that: - mean = 0 - standard deviation - 1 important z score: +- 1.96 (removes outlying data- 2.5%)

Answer 10

𝑧=(𝑋−𝑋 ̅)/𝑠

Answer 11

nothing is happening

Answer 12

what you're expecting to happen is happening, trying to disprove null hypothesis

Answer 13

Probability that the observed statistic is equal to or more extreme, than observed result then Ho is true. trying to find at which point you have enough evidence against null hypothesis to support actual alternate hypothesis.

Answer 14

swinging against null hypothesis, further towards end of bell curve, stronger evidence against null hypothesis

Answer 15

in particular condition experiment is set up with, only going one way

Answer 16

not null hypothesis, can go up or down

Answer 17

either + or - but not both (p<0.5) if z>- 1,645)

Answer 18

number that separates blue zone (ends of bell curve) from the middle. To be statistically significant, z score needs to be in the blue curve

Answer 19

large p value, not disproving null hypothesis. Not good indication is that is it 2 significant means.

Answer 20

If 2 sided, P= 0.037 x 2 = 0.07 Does not swing either way, thus not surveying enough people or greater number of population. With this p-value, cannot say we have enough evidence against null hypothesis. Right on borderline and no strong evidence in either direction

Answer 21

a- level = indication error and is set before collection of data, to help set up experiment. defines error we are willing to make to say we made a difference. If we're wrong, its an alpha error p- value = calculated after we gather data Calculated probability of a mistake by saying it works e.g level of significance. Descrives percent of population/ area under the curve in the tail that is beyond our statistic

Answer 22

P ≤ a so p is smaller than alpha

Answer 23

P>a if a -level is small and tight data set, harder to reject null hypothesis

Answer 24

probability of erroneously retaining Ho

Answer 25

erroneous rejection of true Ho

Answer 26

erroneous retention of false Ho

Answer 27

1- B (beta) probability of avoiding a type II error (retaining a false null hypothesis 1- B = Pr (reject Ho i I Hfalse)

Answer 28

used to make a conclusion about a population based on a sample dataset

Answer 29

involves the organisation, summarization, and display of data

Answer 30

summary measure of the overall level of a dataset (mean, median, mode, geometric mean)

Answer 31

the median is less sensitive to outliers (extreme scores) than the mean and thus a better measure than the mean for highly skewed distributions

Answer 32

measures the amount of scatter in a dataset

Answer 33

crude measure of variability

Answer 34

the data points are far from the mean

Answer 35

used to quanitfy the idea of statistical significance of evidence

Answer 36

probability that results gained by chance and chance is the only factor

Answer 37

one group; no concurrent control group

Answer 38

two samples; data points uniquely matched

Answer 39

two samples, separate (unrelated) groups

Answer 40

single sample as just one experiment done and one population

Answer 41

paired sample as comparison and 2 pairs

Answer 42

independent as same thing not measured twice. Two groups with different reatments

Answer 43

number of observations in the data that are free to vary when estimating statistical parameters

Answer 44

the smaller of (n1 – 1) or (n2 – 1)

Answer 45

(𝑥 ̄_1−𝑥 ̄_2)±(𝑡_(𝑑𝑓,1−𝛼/2))(𝑆𝐸_(𝑥 ̄_1−𝑥 ̄_2 ))

Answer 46

µ is the population mean and X ̅ is the sample mean

Answer 47

The probability of erroneously rejecting the null hypothesis

Answer 48

The experimental design when defining α A P-value that is equal to or smaller than α A z-score above the critical value (in a one sided test)

Answer 49

Degrees of freedom and α

Answer 50

Family wise error rate

Answer 51

(1−0.05)3 = 0.857, so P (reject at least one) = 1−0.847 = 0.143 - This is the family-wise error rate.

Answer 52

family wise error rate

Answer 53

The significance level defines the distance the sample mean must be from the null hypothesis to be considered statistically significant.

Answer 54

The confidence level defines the distance for how close the confidence limits are to sample mean.

Answer 55

1.If the P value is less than your significance (alpha) level, the hypothesis test is statistically significant. 2.If the confidence interval does not contain the null hypothesis value, the results are statistically significant. 3.If the P value is less than alpha, the confidence interval will not contain the null hypothesis value.

Answer 56

confidence intervals to assess the precision of the sample estimate. For a specific variable, a narrower confidence interval suggests a more precise estimate of the population parameter than a wider confidence interval

Answer 57

Test for overall significance using a technique called “Analysis of Variance” (ANOVA) Do post hoc comparison on individual groups

Answer 58

descriptive

Answer 59

inferential

Answer 60

inferential

Answer 61

correlation and regression for bivariate multiple regression for multivariate

Answer 62

The probability that the observed test statistic is equal to or more extreme, than the observed result when Ho is true

Answer 63

Probability of observed statistic is very low.

Answer 64

Null hypothesis has different arrangements. Cannot treat them as independent samples, have to compare 1, 2 and 3,not each thing individually, because end up acrewing random differences.

Introduction and Basics- Lecture 1 Flashcards

(97 cards)