Sampling, data description & probability Flashcards

0
Q

Population (target population)

A

The totality of subjects about which we want to make inference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
1
Q

Biostatistics

A

Application of statistical methods to medical and biological problems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Sample

A

The subset of the population on which data is actually collected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Sampling error

A

The difference between the sample result and the true underlying population value. Error may be caused due to bias (sampled subjects are not representative of the population) or random variation (variation due strictly to chance, even with unbiased selection of subjects)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Sample of convenience

A

Take who you can get. Easy to obtain and bias may be a problem

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Random sample

A

Every individual in the population has an equal chance of being in the sample. Used to ensure that uncontrolled factors do not bias results. May be difficult to obtain

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Stratified sampling

A

Sample is drawn within each of two or more strata (groups with common characteristics). Used to improve accuracy in certain circumstances

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Data variables

A

The measurement or observation made of the sampled subjects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Categorical

A

Values fit into natural categories. Ex. Gender, disease status, vital status

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Discrete variable

A

Ordered numerical data restricted to integer values (count data). Ex. Number of siblings, # of days hospitalized

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Continuous variable

A

Numerical data that can take on any value. Often limited by precision of measuring instrument. Ex. Age, height, weight, cholesterol

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Descriptive statistics

A

Part of statistical methods that deals with organizing and summarizing data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Frequency distribution

A

A table of categories along with their observed frequencies. Categories may be natural (gender, race) or they may be crated from continuous variables by grouping values together (21-49 years old)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Histogram

A

Graphical representation of a frequency or relative frequency distribution. Used to determine shape of a distribution of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Mean

A

Sum of all observations divided by n, the number of subjects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Median

A

Half of the values are below the median and half are above. The middle most observation of ordered data

16
Q

Mode

A

The most frequently occurring observation in the sample

17
Q

Range

A

The difference between the highest and lowest observation

18
Q

Standard deviation

A

A measure of the average distance of each observation from the mean

19
Q

Normal distribution

A

Widely used distribution with a bell shape. Mean, median and mode are equal. 68% of its area lies within 1 std dev of mean. 95% of its area lies within 1.96 std dev of mean. 99% of its area lies within 2.58 std dev of mean

20
Q

Confidence interval

A

Also called confidence limits. An interval that describes where the population mean is likely to be with a certain level of confidence (usually 95%). Formula: mean +/- (1.96)(SEx)

21
Q

Standard error of the mean

A

Computed as SEx= s/n^1/2

22
Q

Set

A

Collection of distinct objects (ex. Sample of patients)

23
Q

Event

A

A characteristic defining a subset of our set (ex. Affliction with a disease)

24
Q

Pr(event)

A

Probability of the event, estimated as (# experiencing event)/(#in the set)

25
Q

Conditional probability

A

Let e1 and e2 be two events. The conditional probability of e1 given e2 is expressed as Pr(E1|E2) = probability of e1 and e2 occurring jointly/probability of e2 occurring

26
Q

General multiplication rule

A

Pr(E1 and E2) = Pr (E1|E2) x Pr(E2)