Statistics Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Accuracy

A
  • how close the measurement is to the true value
  • compared to a gold standard
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

alpha error

A

probability of Type I error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

beta

A

type II error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

bias

A

outcome differs from the correct answer

in a systematic non-random way

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

biased studies

A
  • subjects in group 1 differ from subjects in group 2
  • in a meaningful way that will affect the conclusions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

cohort

A

its a group with common characteristics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

confidence interval

A
  • The 95% confidence interval defines a range of values that you can be 95% certain contains the population mean. With large samples, you know that mean with much more precision than you do with a small sample, so the confidence interval is quite narrow when computed from a large sample.
  • the true parameter (such as the mean) is expected to fall within this range
  • Most commonly, the 95% confidence level is used
  • Factors affecting the width of the confidence interval include
    • the size of the sample,
    • the confidence level,
    • the variability in the sample.
  • A larger sample size normally will lead to a better estimate of the population parameter.
  • is a range of likely values for the population parameter based on: the point estimate, e.g., the sample mean
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

counfounding variable or factor

A
  • when 2 variables are related to a 3rd variable
  • one might or might not know the factor is related to the 2 principle variables
    *
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Dependent variable

A
  • its the outcome or effect
    • for example visual acuity
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

independent variable

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

null hypothesis

A

there is no significant difference between specified populations, any observed difference being due to sampling or experimental error.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

normal

A

its a Gaussian or bell-shaped curve distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

power

A

probability of finding a true difference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

regression

A

how much a dependent variable Y changes based on changes of the independent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

type I error

A
  • if we reject the null hypothesis when in fact it is true
  • its alpha error
  • we reject the null hypothesis if p<0.05
    • so the groups are different
    • but in reality, they are not
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

type II error

A
  • its the beta error
  • when we accept the null hypothesis when in fact, it is FALSE
    *
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

what are the 2 ways of missinterpreting p value?

and what to do to avoid missinterpretation?

A
  • a p value >0.05
    • thought to be non-significant
    • “no effect” or “no difference”
      • THE CORRECT interpretation should be:
        • The is no strong evidence that the intervention has an effect
        • no strong evidence that there is a difference
    • ALWAYS CHECK THE P VALUE WITH THE CI
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

what to look for in the Confidence interval?

A
  • you want a NARROW RANGE
    • the wider the range, the worse
    • the RESULT falls within that range with X% of conficence
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

give examples of categorical variables

A
  • male/female
  • republican/democrat
  • pass/fail
  • yes/no
  • etc…..
21
Q

what test to use to measure the association between categorical variables?

A

Chi square

22
Q

what test to use to the correlation between

continuous variables?

A

correlation

regression

23
Q

which is the most common measure of correlation?

how do you interpret?

A
  • the R value (Pearson coefficient
    • R ranges from -1 to +1
    • 0 means no correlation
    • the farther from 0 the better
24
Q

what test to use to measure association

between categorical variables

when to use each?

A
  • chi square and fisher exact test
  • if small numbers (small n) use FISHER
  • the chi squre is an aprox of the fisher so use chi2 when you have large numbers
25
Q

when to sue a t-test?

A
  • use t-test to determine if two different set are significantly different from each other
    *
26
Q

what are the 3 types of t-test?

A
  • one-sample t-test
    • compared the mean of the sample to a reference mean of a know population
  • independent samples t-test
    • compares 2 means from 2 independent samples
  • paired samples t-test
    • compares 2 means that are from repeated measurements of same participants
27
Q

what to do if you want to compare the means or the differences of more than 2 groups?

what if you do multiple t-test? what eror you incur in?

A
  • use ANOVA
    • doing multiple t-test comparisons would result in a type I error
28
Q

what number is the enemy of confidence interval?

A

1

29
Q

if you can not mask the doctor or the patient what can you MASK at least?

A

the person that is making the fiinal outcome

30
Q

how to calculate the degrees of freedom?

A

degrees of freedom = n-1

Degrees of freedom can be described as the number of scores that are free to vary. For example, suppose you tossed three dice. The total score adds up to 12. If you rolled a 3 on the first die and a 5 on the second, then you know that the third die must be a 4 (otherwise, the total would not add up to 12). In this example, 2 die are free to vary while the third is not. Therefore, there are 2 degrees of freedom.

In many situations, the degrees of freedom are equal to the number of observations minus one. Thus, if the sample size were 20, there would be 20 observations; and the degrees of freedom would be 20 minus 1 or 19.

31
Q

what is the difference between a continious variable and a discrete?

A
  • continuos has unlimited possibilities
    • weight for ex 120; 120.1; 120.12 etc
  • discrete is limited
    • children in the family 1, 2, 3, 4
      *
32
Q

what is the goal of research?

A

THE GOAL ID TO REJECT THE NULL HYPOTHESIS

thats it! (although the goal is to reject with what level of confidence)

For example:

null hypothesis: the vessel density is similar in RCD and CRD patients

33
Q

what characteristics are important in a hypothesis?

A
  • its an statement - not a question
  • based on the research question
  • is it testable?
  • is it informative?
  • s it relevant?
    • TRY to make it simple (a 1:1) hypothesis
    • the more variables the more difficult to reject
34
Q

what is a type I error?

A

when you reject the null hypothesis when in fact is true…

for example:

HO= there is not difference in vessel density between RCD and CRD

my results rejects and says THERE is a difference (when there is not)

35
Q

what is a type II error?

A

when you DONT reject the null hypothesis when in fact it is not true

for example:

HO = there is no difference in vessel density among RCD and CRD patients

Result: there is not difference

the real fact is that there is a difference.

36
Q

what is the difference between type 1 and type 2 error?

A

type I rejects HO

Type 2 accepts HO

type 2 es complaciente y la acepta

37
Q

when to use CI of 95% vs a higher CI?

A

use 95% for all

except in something super important like new medicines, then use a 99% CI

38
Q

what is parametric and non-parametric data?

A
  • parametric means that the data FOLLOWS PARAMETERS
    • follows a predictable parameter
    • in other words is normally distributed
  • non-parametric does not follow normal distribution
    • random values in the dispersion graph
39
Q

how to describe the following data:

  1. normally distributed data -
  2. non-parametric -
  3. categorical -
  4. over time -
  5. two related variables -
  6. non-parametric variables -
  7. categorical -
A
  1. normally distributed data - mean, SD, min, max
  2. non-parametric - median, histogram
  3. categorical - frequencies
  4. over time - line plot / Time series
  5. two related variables - Person correlation
  6. non-parametric variables - Spearman’s correlation
  7. categorical - crosstabulations
40
Q

how to describe categorical data?

A
  • normally distributed data - mean, SD, min, max
  • non-parametric - median, histogram
  • categorical - frequencies
  • over time - line plot / Time series
  • two related variables - Person correlation
  • non-parametric variables - Spearman’s correlation
  • categorical - crosstabulations
41
Q

how to describe the relation between non-parametric variables

A
  • normally distributed data - mean, SD, min, max
  • non-parametric - median, histogram
  • categorical - frequencies
  • over time - line plot / Time series
  • two related variables - Person correlation
  • non-parametric variables - Spearman’s correlation
  • categorical - crosstabulations
42
Q

how to describe non-parametric data?

A
  • normally distributed data - mean, SD, min, max
  • non-parametric - median, histogram
  • categorical - frequencies
  • over time - line plot / Time series
  • two related variables - Person correlation
  • non-parametric variables - Spearman’s correlation
  • categorical - crosstabulations
43
Q

what test to do in the following scenarios:

  1. comparing more than 2 groups in non-parametirc data
A
  1. comparing single sample to normative previously published sample - single t-test
  2. comparing 2 groups paired (normal, not normal and categorical
    1. normal - paired t-test
    2. not normal - Wilcoxon
    3. categorical - McNemar
  3. comparing 2 independent samples
    1. normal - independent t-test
    2. not notmal - Mann-whitney
    3. Categorical - ChiSquare
  4. comparing more than 2 groups - repeated measures
    1. normal - ANOVA
    2. not-normal FRIEDMAN ANOVA
    3. categorical - Cochran’s Q
  5. more than 2 groups - independent
    1. normal - one way ANOVA
    2. not normal - Kruskal Wallis
    3. categorical - chi Square
44
Q

what test to do in the following scenarios:

comparing 2 grousp paired non-parametric data

A

wilcoxon

45
Q

what test to do in the following scenarios:

comparing catagorical variables

A

chi square

46
Q

what is the normal distribution curve?

A

is the most important and most widely used distribution in statistics. It is sometimes called the “bell curve,” although the tonal qualities of such a bell would be less than pleasing. It is also called the “Gaussian curve” after the mathematician Karl Friedrich Gauss.

47
Q

in the normal gausian curve or bell curve, where do the majority of values fall inito?

A
  • the majority in the +/- 2SD
  • the center is the mean
  • 95% of values lie within +/- 1.96 SD from the mean
48
Q

in a normal distribution curve or bell curve or gausian curve, what is the value of 1 SD, 2 SD and 3 SD?

A
  • 1 SD = 68%
  • 2 SD = 95%
  • 3 SD = 99%