Biostatistics Flashcards

1
Q

What is Biostatistics defined as?

A

-the application of statistical theory in medicine, public health, or biology

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Population

A
  • large group of all subjects interest in a study.
  • very large almost impossible to collect data from whole population
  • have to select a subset of subjects to get samples from a population, then generalize findings from samples to the target population
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

conceptual population

A

-population of persons people/ identity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

population in clinical setting?

A

-usually talk about it as population of measurements such as weight, height, and blood pressure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Total population

A
  • target population
  • all of the subjects of interest in a study, about which the study wants to generalize the conclusion
  • ex: all 12 year olds in US
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Defined population

A
  • a subpopulation confined by certain characteristic(s) such as demographics and geographical areas in the total population
  • ex: 12 year olds in RUSD
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Study population

A

The group of individuals in a study

  • In a clinical trial, all participants who followed criteria of inclusions and exclusions make up the study population
  • ex 12 year olds in certain school in RUSD
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Sample

A
  • a procedure to select samples from the study population, they are representative of the total population
  • validity based on how random and well rounded this sample group is
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How do we estimate the required sample size?

A

1) pre-determined power (80/90%)
2) specific significance level
3) mean & variance of the primary outcome; can be approximated
4) the design of the study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is a statistic?

A
  • calculated from a sample for a specific characteristic of the sample
  • will be used to estimate the corresponding parameter of the study population & further generalized to the target population to find mean & standard deviation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why do we sampling to get samples?

A

Money, Time, Practicality, and Accuracy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Probability sampling

A
  • each subject has a known probability of being selected
    1) simple random sampling
    2) stratified random sampling
    3) Systematic random sampling
    4) Clustered random sampling
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

simple random sampling?

A

based on probability to take the samples

-each member of the subset has an equal probability of being chosen

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

stratified random sampling?

A
  • stratify the study population into subgroups, then take random samples from such subgroups
  • ex: names of 25 employees being chosen out of a hat from a company of 250
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Systematic random sampling

A
  • type of probability sampling
  • members from a larger population are selected according to a random starting point and a fixed, periodic interval
  • interval, is calculated by dividing the population size by the desired sample size
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Clustered random sampling

A
  • the researcher divides population into separate groups (clusters)
  • a random sample of clusters is selected from the pop
  • researcher conducts his analysis on data from the sampled clusters
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

non-probability sampling

A
  • each subject doesn’t know probability of being selected, –stimations are biased
    1) voluntary samples
    2) convenience samples
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

voluntary samples

A

-whoever is self selected into the samples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

convenience samples

A

-whoever is convenient to be selected and/or investigated. -ex: staff members in med school recruited for some trials

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Sampling errors

A
  • random errors
  • are unavoidable
  • the differences between the sample & population, due to sampling randomness
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

sampling errors effect on the data?

A
  • Random error does not have consistent effects across the entire sample
  • the sum would be zero if the sample size is large enough.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

what does sampling error add? what does it not affect?

A

-random error adds variability to the data but doesn’t affect average performance of the samples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Non-sampling errors

A
  • more serious due to mistakes made in the acquisition of data, inappropriate sample selection, or response biases
  • can bias estimation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Random Samples vs. Randomization

A

Both involve the use of the probability sampling method.

25
Q

Random sampling

A
  • determines who should be included in the samples;
  • related to sampling procedure;
  • effect external validity (generalizability)
26
Q

Randomization or random assignment

A
  • determines if sampled subjects should be in treatment or control group
  • related to design operation
  • effects internal validity
27
Q

how we can conduct correct population inferences from the sample data we have?

A
  • mean and variance are the cornerstones for statistical inference
  • used to estimate the corresponding values of population.
28
Q

1) letters used for population?

2) letters used for sample estimation?

A

1) greek letters

2) sample estimation

29
Q

How use an unbiased estimation of population variance?

A
  • use n-1 instead of n in sample variance equation

- N (capital N) used for the population variance

30
Q

Standard deviation (SD)?

A
  • measure of the variability of a measurement among the subjects in a population or in a sample
  • can be estimated for both population & sample
  • often unknown for pop. but can be estimated from SD of its samples
31
Q

Standard Error (SE)?

A
  • measure of the precision of the sample mean
  • decreases w/ increasing sample size
  • estimated only for sample.
32
Q

In general what is reported when summarizing the sampling variation?

A
  • SD (standard deviation)

- SE is given for any statistical inference of mean

33
Q

What is Outcome?

A
  • a specific characteristic of interest being studied
  • could be more than one
  • also called endpoints
  • can be described or quantified by different measurement scales
34
Q

Measurement scales (4)?

A

1) Nominal
2) Ordinal
3) Interval
4) Ratio

35
Q

Nominal

A
  • Categorizing subjects by characteristics such as gender and ethnicity
  • the categories have neither order or ranking
  • neither logic nor mathematical operation
36
Q

Ordinal

A
  • ranking subjects into different orders
  • precise differences between ranks do not exist, and one rank is not better than other
  • pain severity
37
Q

Interval

A
  • quantitative measurement with equal units, every one unit difference is meaningful and constant,
  • doesn’t have absolute zero point
  • ex: temperature
38
Q

Ratio

A
  • uses zero to present the absence of value (absolute zero point)
  • ex: height, age, weight
39
Q

Qualitative data

A
  • describe subjects in qualities, cannot measure subjects in precise quantities
  • nominal scale and ordinal scale.
40
Q

Quantitative data

A
  • quantify the quantities of the subjects, can be measured by scales or counted by numbers.
  • can be discrete or continuous
41
Q

Discrete

A
  • Reflect a number during the counting process, no decimal
  • Zero is the minimum
  • ex: number of children
42
Q

Continuous

A

Reflect a measurement with decimal places, often depends on the precision of the measuring device

43
Q

What does probability provide?

A
  • a quantitative description of the chances (of successful treatment outcomes) or likelihoods (of disease) associated with various outcomes
  • provides a bridge between sample statistics and population parameters
44
Q

what three components do you need to estimate probability?

A

1) outcome
2) sample space
3) an event

45
Q

outcome?

A
  • a possible result from an experiment

- ex: toss coin get heads or tails

46
Q

Sample Space?

A
  • the set of all possible outcomes of an experiment

ex: flip coin can be head or tails

47
Q

An Event?

A
  • a subset of outcomes that are equally likely to take place, can be defined by one or more members of the sample space.
    ex: toss coin can be H/T; but if toss coin more than once can be HHT or TTH etc
48
Q

Empirical Probability ?

A
  • estimated as the proportion of: how many times the event of interest occurred / the total number of all the potential events that might be observed
  • epidemiology, is defined by empirical probability
49
Q

Marginal probability

A

the probability of an event occurring

50
Q

Conditional probability

A
  • measure of the probability of an event occurring given that another event has occurred.
  • important in medicine since disease is based on many factors
51
Q

Prevalence

A
  • expressed & reported as a percentage, per 1000, or per million
  • measure of disease burden in the population but not a measure of risk
  • is a snapshot in time, but can use different time scales
52
Q

time scales used in prevalence calculations?

A

1) point prevalence (at a certain time point, most common)`
2) period prevalence (during a certain time period)
3) cumulative incidence (before a time point)

53
Q

what does incidence rate measure?

A

1) new cases in a population over a given period
2) risk of developing a disease within a given period
3) how quickly new cases develop in the population
- per 100; 100 million, or 1000
- also called absolute risk

54
Q

how are prevalence and incidence rate related?

A

-Prevalence= Incidence Γ— Disease Duration

55
Q

how does a new treatment for lung cancer patients that prolongs survival effect the prevalence, incidence & disease duration?

A

1) prevalence increased
2) incidence unchanged
3) duration increased

56
Q

how does an effective AIDS vaccine created and approved

effect the prevalence, incidence & disease duration?

A

1) prevalence decreased
2) incidence decrease
3) duration unchanged

57
Q

how does a sensitive & early detection test is developed to diagnose cancer at earlier stages to make cancer relatively easy to control
effect the prevalence, incidence & disease duration?

A

1) prevalence increase
2) incidence increase
3) duration increase

58
Q

Mortality Rate

A

(π‘π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘‘π‘’π‘Žπ‘‘β„Žπ‘  π‘‘π‘’π‘Ÿπ‘–π‘›π‘” π‘Ž 𝑠𝑝𝑒𝑐𝑖𝑓𝑖𝑐 π‘π‘’π‘Ÿπ‘–π‘œπ‘‘)/(π‘π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘π‘’π‘Ÿπ‘ π‘œπ‘›π‘  𝑖𝑛 π‘‘β„Žπ‘’ π‘π‘œπ‘π‘’π‘™π‘Žπ‘‘π‘–π‘œπ‘› π‘‘π‘’π‘Ÿπ‘–π‘›π‘” π‘‘β„Žπ‘’ π‘π‘’π‘Ÿπ‘–π‘œπ‘‘

59
Q

Case fatality rate

A
  • is a %

- (π‘π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘‘π‘’π‘Žπ‘‘β„Žπ‘  π‘Žπ‘“π‘‘π‘’π‘Ÿ π‘‘π‘–π‘ π‘’π‘Žπ‘ π‘’ π‘œπ‘›π‘ π‘’π‘‘ π‘œπ‘Ÿ π‘‘π‘–π‘Žπ‘”π‘›π‘œπ‘ π‘–π‘ )/(π‘π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘‘β„Žπ‘’ π‘π‘’π‘Ÿπ‘ π‘œπ‘›π‘  π‘‘π‘–π‘Žπ‘”π‘›π‘œπ‘ π‘’π‘‘ π‘€π‘–π‘‘β„Ž π‘‘β„Žπ‘’ π‘‘π‘–π‘ π‘’π‘Žπ‘ π‘’) x 100