Chapter 6 - Estimation Flashcards

1
Q

Two types of statistical inference

A
  • estimation

- hypothesis testing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Estimation

A
  • concerned with estimating the values of specific population parameters (point estimates)
  • sometimes, interval estimation is carried out to specify a range
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Hypothesis testing

A
  • concerned with testing whether the value of a population parameter is equal to some specific value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Random sample

A
  • a selection of members of the population so that each member is independently chosen and has a known nonzero probability of being selected
  • a popular alternative to random sampling is cluster sampling
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Simple random sample

A
  • a random sample where each group member has the same probability of being selected
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Study population

A
  • the group we want to study

the random sample is selected from the study population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Randomized clinical trials

A
  • optimal study design in clinical research
  • used for comparing different treatments, in which patients are assigned to a particular treatment by some random mechanism
  • randomization = the process of assigning treatments to patients
  • patients assigned to different treatment modalities will be smaller if the sample sizes are large
  • if sample sizes are small, then patient characteristics of treatment groups may not be comparable
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Methods of randomization

A
  • random selection

- random assignment (block randomization)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Stratification

A
  • patients are subdivided into subgroups, or strata, according to characteristics thought important for patient outcome(s)
  • separate randomization lists are maintained for each stratum
  • typical characteristics = age, sex, overall clinical condition
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Blinding

A
  • double blind = neither the physician nor the patient knows what treatment the patient is getting
  • single blind = the patient is blinded as to treatment assignment but the physician is not
  • unblinded = both the physician and patient are aware of the treatment assignment
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Design features of RCTs

A
  • gold standard = randomized double-blind study
  • this prevents biased reporting of outcome
  • however, it may not always be feasible
  • in some cases, the side effects may strongly indicate actual treatment received
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Estimation of the mean of a distribution

A
  • the minimum variance unbiased estimator of the population mean is the sample mean
  • population mean = expected value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Standard error of the mean

A
  • the standard error represents the estimated standard deviation obtained from a set of sample means from repeated samples of size n from a population with underlying variance
  • it is not the standard deviation of an individual observation
  • as sample size increases, the variability of the mean (standard error) decreases
  • variance can be affected by experimental technique
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Central-limit theorem

A
  • the skewness of the distribution can be reduced by transformation data using the log scale
  • the central-limit theorem can then be applicable for smaller sizes
  • as sample size increases, the distribution of the sample mean becomes approximately normally distributed
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Interval estimation

A
  • interval estimation involves specifying a range within which parameter values are likely to fall
  • 95% of the Z values from the repeated samples of size n will fall between the interval of -1.96 and 1.96
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

t distribution

A
  • standard deviation is rarely known in practice, and n is often small
  • when n is small, it is not safe to assume that it is normally distributed
  • this problem was solved by William Gossett (Student)
  • the t distribution is not unique, but is a family of distributions with the parameter of degrees of freedom
  • t distribution becomes very similar to the normal distribution as the degrees of freedom increases
17
Q

Factors affecting the length of a CI

A
  • as the sample size increases, the length of the CI decreases
  • as the standard deviation increases, the length of the CI increases (cannot really be controlled)
  • as the confidence desired increases, the length of the CI increases
18
Q

Chi-square distribution

A
  • used to find the sampling distribution of the sample variance , in order to obtain an interval estimate of the population variance
  • only takes on positive values and is always skewed to the right
  • n is equal to or greater than 3, the distribution has a mode greater than 0
  • the skewness diminishes as n increaes