Midterm Flashcards

1
Q

Concerned with presentation, organization and summarization of data. Organizing and graphing the data to get an idea of what they show

A

Descriptive statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Allow us to generalize from sample of data to a larger group of subjects. Using results of a study to infer how everyone else will be affected

A

Inferential statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Includes various methods of organizing and graphing the data to get an idea of what they show. Pictures, graphs, etc.

A

Descriptive statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Allow us, as researchers, to generalize from our sample of data to a larger group or population. Are used to determine the probability that a conclusion based on analysis of data from a sample is true
*Subject to random error

A

Inferential statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Whatever is being observed or measured. May be dependent or independent

A

Variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

1) the outcome of interest and

2) changes in response to some intervention.

A

Dependent variable

dependent changes in response to the independent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

1) the intervention

2) what is being manipulated by the researcher.

A

Independent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Can have only one of a limited set of values. Have values that can assume only whole numbers.
Ex..number of kids (2, 3, or 4. can’t have 2.13 kids), hair color, gender. There is no inbetween.

A

Discrete variables/data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

May have any value, within a defined range.

Ex…blood pressure, weight, serum levels, time, height.

A

Continous variables/data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Central or typical value for a probability distribution or in simpler terms, averages.

A

Central tendency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Is the measure of central tendency for interval and ratio data

A

Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Used as a measure of central tendency when the mean would be meaningless, as with ordinal data… the value such that half of the data points are above and half are below it.

A

Median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

The only measure that may be used with nominal data and consists of the most frequently occurring category. Nominal data (ex: male vs. female) is derived from qualitative measures.

A

Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

refers to how closely the data cluster around the measure of central tendency.

A

A measure of dispersion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Used with ordinal data, is the difference between the highest and lowest values.

  • always a single number
  • is easy to calculate but is unstable and it isn’t of much use
A

Range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

measures how far a set of numbers is spread out

*if small, indicates data points are very close to the mean

A

Variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Is the square root of the variance of a random variable or statistical population.. is expressed in the same units as the original measurement.
*Smaller indicates closer to the mean

A

Standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Normal distributions are symmetrical around their…

A

Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

The mean, median, and mode in a normal distribution are…

A

Equal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

The area under the normal curve is equal to….

A

1.0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Normal distributions are more dense in the ________ and less dense in the ______

A

More dense= CENTER
Less dense= TAILS

(Bell curve)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Normal distribution is defined by these 2 parameters…

A

Mean

Standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

_____% of the area of a normal distribution is within one standard deviation of the mean.

A

68

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Approx _____% of the area of a normal distribution is within two standard deviations of the mean.

A

95

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

3 different shaped lines can each be normal distributions with different shapes, what makes them have a different shape?

A

Different standard deviations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q
  • The mean, median and mode all have the same value
  • The curve is symmetric around the mean
  • The tails of the curve get closer and closer to the X-axis as you move away from the mean but they never quite reach it.
  • In mathematics the curve approaches the X-axis asymptotically
A

Properties of the normal curve

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Many statistical tests assume a….

A

Normal distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

The mean and the variance are…

A

Independent!

You can change one and the other will stay the same

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Many natural phenomena are in fact…

A

Normally distributed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Whatever the actual distribution of data, if we draw a large number of samples of reasonable size, the means of those samples will always be normally distributed. This fact arises from the…

A

Central limit theorem

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

states that if we draw equally sized samples from a non-normal distribution, the distribution of the means of these samples will still be normal, as long as the samples are large enough.
How large is large enough? It depends on the shape of the distribution. Generally anything over 30 is enough

A

Central Limit Theorem

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What is the application of Central Limit Theorem…

A

Allows us to assume normal distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

relative likelihood that a certain event will or will not occur, relative to some other events.

A

Probability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

of an event is an “estimate” that the event will happen based on how often the event occurs after collecting data or running an experiment (in a large number of trials). It is based specifically on direct observations or experiences. All things must be equal. If circumstances change from testing scenario, the outcome probability will change

A

Empirical probability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

Most of medical probabilities are derived through..

A

Empiric means

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

________ _______of an event is the number of ways that the event can occur, divided by the total number of outcomes. It is finding the probability of events that come from a sample space of known equally likely outcomes.
-Chance of winning on the roulette wheel

A

Theoretical probability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

Two events, X and Y , are ______ ________ if the occurrence of one precludes the occurrence of the other.

A

Mutually exclusive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

Flipping a coin is an example of this because if the head side appears, the tails side does not.

A

Mutually exclusive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

the probability of X or Y is the probability of X plus the probability of Y
Pr (X or Y) =Pr (X) +Pr (Y)

A

Additive Law

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

Two events, X and Y , are _______ dependent if the outcome of Y depends on X , or X depends on Y .
*Ex…life expectancy depends on gender, year born, access to health care, country of birth, etc.

A

Conditionally

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

Probability of being President is ______ on being a US citizen

A

Conditional

42
Q

Pr (X and Y)= Pr(X) x Pr(X|Y)

Pr (X|Y) means the probability of X occuring given that Y has occured.

A

Multiplicative Law

43
Q

The additive law is for…

A

Mutually exclusive

44
Q

The multiplicative law is for….

A

Conditionally dependent

45
Q

Shows the probabilities of different outcomes for a series of random events, each of which can have only one of two values.

A

Binomial distribution

46
Q

When you perform a hypothesis test in statistics, a _______ helps you determine the significance of your results.

A

P-value

47
Q

_______ tests are used to test the validity of a claim that is made about a population.

A

Hypothesis

48
Q

Usually refers to a general statement or default position that there is no relationship between two measured phenomena, or no difference among groups.

A

Null hypothesis

49
Q

A _____ p-value indicates strong evidence against the null hypothesis, so you reject the null hypothesis

A

Small

*(typically ≤ 0.05)

50
Q

A _____ p-value indicates weak evidence against the null hypothesis, so you fail to reject the null hypothesis.

A

Large

*typically > 0.05

51
Q

_______ _______ in experimental measurements are caused by unknown and unpredictable changes in the experiment. These changes may occur in the measuring instruments or in the environmental conditions

A

Random errors

52
Q

How do you minimize random error?

A

TAKE MORE DATA!

53
Q

_______ ________ can be evaluated through statistical analysis and can be reduced by averaging over a large number of observations

A

Random error

54
Q

includes all of the elements from a set of data. A measurable characteristic of a population such as mean, is called a parameter. Use N.

A

Population

55
Q

consists of one or more observations from the population. A measurable characteristic of a sample is called a statistic. Use n.

A

Sample

56
Q

is the purest form of probability sampling. Each member of the population has an equal and known chance of being selected. When there are very large populations, it is often difficult or impossible to identify every member of the population, so the pool of available subjects becomes biased

A

Random sampling

57
Q

is often used instead of random sampling. It is also called an Nth name selection technique. After the required sample size has been calculated, every Nth record is selected from a list of population members. As long as the list does not contain any hidden order, this sampling method is as good as the random sampling method. Its only advantage over the random sampling technique is simplicity. This is frequently used to select a specified number of records from a computer file.

A

Systemic sampling

58
Q

is commonly used probability method that is superior to random sampling because it reduces sampling error. A stratum is a subset of the population that share at least one common characteristic. Examples of stratums might be males and females, or managers and non-managers. The researcher first identifies the relevant strata and their actual representation in the population. Random sampling is then used to select a sufficient number of subjects from each stratum. “Sufficient” refers to a sample size large enough for us to be reasonably confident that the stratum represents the population.

A

Stratified sampling

*Stratified sampling is often used when one or more of the strata in the population have a low incidence relative to the other strata

59
Q

is used in exploratory research where the researcher is interested in getting an inexpensive approximation of the truth. As the name implies, the sample is selected because they are convenient. This nonprobability method is often used during preliminary research efforts to get a gross estimate of the results, without incurring the cost or time required to select a random sample.

A

Convenience sampling

60
Q

reflects that there will be no observed effect for our experiment. This is what we are attempting to overturn by our hypothesis test. We hope to obtain a small enough p-value that we are justified in rejecting this. WANT TO DISPROVE THIS!

A

Null hypothesis

61
Q

H0 states there is..

A

no difference between the interventions being studied (null hypothesis)

62
Q

Reflects that there will be an observed effect for our experiment.

*Denoted by Ha or by H1

A

Alternative or experimental hypothesis

63
Q

If the null hypothesis is rejected….

A

then we ACCEPT the alternative hypothesis!

there is a relationship! this is what we want!

64
Q

is inversely related to beta or the probability of making a Type II error. In short, power = 1 – β.

A

Statistical power

65
Q

is the likelihood that a study will detect an effect when there is an effect there to be detected.

A

Statistical power

66
Q

If statistical power is high, the probability of making a _____ error, or concluding there is no effect when, in fact, there is one, goes down.

A

Type II

67
Q

Statistical power is affected chiefly by the size of the effect and the ______________ used to detect it

A

Size of the sample

68
Q

occurs when the investigator determines there is a difference between the 2 groups (disproves H0) when in fact there is no difference, essentially, a false positive result

A

Type I Error

69
Q

The probability of a Type 1 Error is defined as the

A

Alpha level

70
Q

Occurs less commonly in ITT (intention to treat) due to the fact that ITT is a more cautious approach to data interpretation

A

Type I Error

71
Q

occurs when the investigator determines there is no difference between the 2 groups (accepts H0) when in fact there is a difference, essentially a false negative result

A

Type II Error

72
Q

The probability of a Type II Error is defined as the

A

Beta level

73
Q

This type of error may occur when analysis is too cautious

A

Type II

74
Q

A study in which people are allocated at random (by chance alone) to receive one of several clinical interventions. One of these interventions is the standard of comparison or control. The control may be a standard practice, a placebo (“sugar pill”), or no intervention at all.

A

Randomized Control Trial (RCT)

75
Q

Someone who takes part in a randomized controlled trial (RCT) is called a

A

Participant or subject

76
Q

seek to measure and compare the outcomes after the participants receive the interventions. Because the outcomes are measured, these are quantitative studies.

A

Randomized Control Trial (RCT)

77
Q

are quantitative, comparative, controlled experiments in which investigators study two or more interventions in a series of individuals who receive them in random order. One of the simplest and most powerful tools in clinical research. GOLD STANDARD***

A

Randomized Control Trial

78
Q

fall under the category of analytic study designs which aim to evaluate and identify causes or risk factors of diseases or health related events. The investigator does not intervene, yet “observes” and assess the strength and relationship between an exposure and a disease variable.

A

Observational studies

79
Q

Cohort studies, case-control studies, and cross-sectional studies are all examples of….

A

Observational studies

80
Q

group of people with defined characteristics who are followed up to determine incidence of, or mortality from, some specific disease, all causes of death, or some other outcome. In a cohort study, an outcome or disease-free study population is first identified by the exposure or event of interest and followed in time until the disease or outcome of interest occurs. Can be prospective or retrospective.

A

Cohort Study

81
Q

Similar to taking a history and physical; the diseased patient is questioned and examined, and elements from this history taking are knitted together to reveal characteristics or factors that predisposed the patient to the disease.

A

Case control studies

82
Q

Identify subjects by outcome status at the outset of the investigation. Outcomes of interest may be whether the subject has undergone a specific type of surgery, experienced a complication, or is diagnosed with a disease. Are quick, relatively inexpensive to implement, require comparatively fewer subjects, and allow for multiple exposures or risk factors to be assessed for one outcome

A

Case control study

83
Q

Data are collected on the whole study population at a single point in time to examine the relationship between disease (or other health related state) and other variables of interest

A

Cross-Sectional study

84
Q

provide a snapshot of the frequency of a disease or other health related characteristics in a population at a given point in time. Can assess the burden of disease in a population

A

Cross-Sectional Study

85
Q

answers a defined research question by collecting and summarizing all empirical evidence that fits pre-specific criteria. Ex: researching articles by other researchers to answer your question about a treatment, disease, etc.

A

Systematic review

86
Q

a subset of systematic reviews. A method that combines pertinent qualitative and quantitative study data from several studies to develop a single conclusion with a greater statistical power. Its conclusion is stronger than any single study due to more subjects, greater diversity of subjects, and more effects and results.

A

Meta-analysis

87
Q

recommendations to clinicians about the care of patients with specific conditions. These are based on the best available research evidence and practice experience. The guidelines include a systematic review of research evidence and a decision analysis (a set of recommendations involving both the evidence and value judgments regarding the benefits and harms of alternative care options)

A

Clinical practice guidelines

88
Q

percent of times a diagnostic exam identifies an abnormality **true positive is the numerator TP / TP + FN

A

Sensitivity

89
Q

percent of times a diagnostic exam suggests the right diagnosis **true negative is the numerator TN / FP + TN

A

Specificity

90
Q

A d-dimer test is _____ for blood clots but it is not _____ for PEs (the pt could have a DVT). To calculate the actual % we would have to be given a 2x2 table with true positive (TP), false positive (FP), false negative (FN) and false positive (FP) data.

A

it is SENSITIVE but NOT SPECIFIC

91
Q

When you have a positive screening (like a d-dimer) the next indicator you need to be aware of is the __________. This % is the probability that the initial positive test result will be a true positive and the person actually has the disease.

A

PPV (positive predictive value)

92
Q

The _________ is the % probability that your negative screening tests will actually mean you don’t have a disease (making sure the result is NOT a false negative).

A

NPV (negative predictive value)

93
Q

these #s are what we use to calculate sensitivity and specificity and relate closely to positive and negative predictive values.

A

False positives/False negatives

94
Q

For example, if the hospital you work in has a lot of false positive mammogram results it means the screening mammogram program has a low…….

A

Positive predicted value (PPV)

no equations for this, just collected data

95
Q

the risk/benefit ratio of being in one group in a study.

A

Absolute risk

96
Q

You need to calculate the _____ _____ of the control and the treatment groups before you can complete any other calculation. # outcomes (ex. survivors) / total #.

A

Absolute risk

97
Q

The difference (thus the equation is a subtraction problem) in the risk of receiving treatment versus not. AR (control) - AR(treatment).

A

Absolute risk reduction

98
Q

Clinically, _____ is important because you can express the risk/benefit ratio of trying a new drug to your patient in whole number forms

A

ARR (Absolute risk reduction)

99
Q

Pretend the BMJ just published an article saying a (hypothetical) new late-onset asthma prevention drug had an ARR of .30, or 30%. And pretend that normally the occurrence of late-onset asthma in adults is 1 in every 600 adults. This ARR means that the new drug does what to the absolute risk of developing late onset asthma?

A

Reduces ARR!

from the previous 1/600 to 1/2,000 adults. So, if one of your patients were at high risk for developing late-onset asthma, she may want to consider taking this new preventative medication.

100
Q

The ratio of risk between the two groups in the study. ARR / AR (control).

A

Relative risk reduction (RRR)