Statistics Flashcards

Question

What is inferential statistics used for?

Answer 1

To draw inferences about the population from the sample

Answer 2

- Data can take on 1 of a number of categories - Number of categories is small - Use of table frequency

Answer 3

To see which category is most common, least common and which categories occur more frequently

Answer 4

Can not see immediately what share of sample is contained in each category

Answer 5

Percentages

Answer 6

- Bar charts | - Pie charts

Answer 7

Number of occurs

Answer 8

Categorical data

Answer 9

Plot histograms

Answer 10

Density = proportion in bin/bin width

Answer 11

Relative frequency of observations

Answer 12

Different bin widths

Answer 13

Defines where data are located in the range of possible values

Answer 14

- Mean - Mode - Median

Answer 15

Equal to the sum of values divide by the number of values

Answer 16

- Rank data in order - Median is the middle number - If even number of data points, no single point so take mean of 2 middle values

Answer 17

Most commonly occurring value

Answer 18

Technical name for the spread or variability of the data

Answer 19

- Standard deviation - Interquartile range - Range

Answer 20

Equal to the square root of the mean of the difference between values and the mean squared

Answer 21

Data where 25% of data is above and 25% of data below

Answer 22

Simply the smallest and largest values

Answer 23

Box and whiskers plots

Answer 24

Can be easier to compare between groups

Answer 25

More than 1.5 IQRs above the upper quartile

Answer 26

- Blood pressure - BMI - Size of an orange

Answer 27

- Number of headaches - Number of people with diabetes - Number of oranges

Answer 28

Categorical

Answer 29

- Ethnicity - Blood types - Variety of orange

Answer 30

- Disease severity - Satisfaction rating - Orange quality rating

Answer 31

Equal to the standard deviation of the sample divided by the square root of the sample size

Answer 32

By increase in standard deviation

Answer 33

More variability there is in the population the more the uncertainty in our estimate

Answer 34

With increasing sample size

Answer 35

Bigger the sample size, the more information we have, the more precise our estimate

Answer 36

T-tests are used to test whether the means in two groups are different from each other -Continuous data

Answer 37

Testing whether the difference in our sample reflects a difference in the population

Answer 38

Standard deviation of the population increases our precision on our estimate is worse as our sample sizes go up the precision on our estimates is better

Answer 39

- Data in each group are normally distributed in population - Variance (SD) is constant across groups - Data points are independent of each other

Answer 40

- Before and after data on the same people - Small sample from the same area - Using same piece of equipment when collecting subsets of our data

Answer 41

Version of t-tests which assumes unequal, rather than equal variance

Answer 42

Where standard deviations from two different groups are quite different

Answer 43

- Normally distributed data in each group | - Independent data points

Answer 44

- Normally distributed data in each group | - Constant variance across groups

Answer 45

-More than two groups

Answer 46

Look overall at the data and see if there are any differences by groups, rather than comparing individual groups to each other

Answer 47

Analysis of Variance | -Partition the variance in the data to that high can be attributed between groups and that which is left over

Answer 48

-p-value telling how much evidence there is that there is some variability between groups

Answer 49

Occur after an overall assessment | Occurs after ANOVA

Answer 50

- Data in each group are normally distributed in the population - Variance (SD) is constant across groups - Data points are independent of each other

Answer 51

Paired t-test

Answer 52

Mann-Whitney test

Answer 53

An alternative to a t-test when we have non normally distributed data in each group Comparing continuous data between two groups

Answer 54

Two groups have the same mean in the population

Answer 55

If we select one value at random from each group, the value from the first group will be larger than the value from the second group 50% of the time

Answer 56

Make no assumptions about the form of the data

Answer 57

Wilcoxon test

Answer 58

Paired data

Answer 59

More than 2 groups

Answer 60

- Wilcoxon signed-rank test | - Kruskal Wallis Test

Answer 61

Chi-squared test

Answer 62

- Data points are independent - Data are described by the binomial distribution - At least 5 expected counts

Answer 63

Fishers exact test

Answer 64

Pearson's correlation coefficient aka rho

Answer 65

``` 1 = Perfectly correlated 0 = No correlation -1 = Negatively correlated ```

Answer 66

- Data points are independent - One set of data is normally distributed for any given value of the other with contacts variance - Relationship is linear

Answer 67

Similar to correlation looks at relationship between two continuous variables First a functional form y = bx+ c

Answer 68

- p-vaule | - R-squared value

Answer 69

Proportion of variance

Answer 70

- Data points are independent - Outcome data is normally distributed for any given value of the exposure - Outcome data has a constant variance for all values of the exposure - Relationship is linear

Answer 71

Spearman's correlation

Answer 72

Fisher's exact test

Answer 73

Mann-Whitney test

Answer 74

Kruskal-Wallis test

Answer 75

T-test for unequal variance

Answer 76

Paired t-test

Answer 77

Repeated measures ANOVA

Answer 78

Range of plausible values for the thing we are trying to esitmate

Answer 79

Standard errors Standard deviations Confidence intervals

Answer 80

- Meaning of different symbols - What error bar representing - Provide p-value - State what statistics are used - Describe everything on graphs - Sample size number

Statistics Flashcards

(120 cards)