Lectures 1-4 Flashcards

Question

Ordinal or ordered scale

Answer 1

e.g., ratings, preferences

Answer 2

refers to the differences among a set of measurements

Answer 3

differences among persons (experimental units) in the “true” values of the variable of interest

Answer 4

differences between the measured and true values

Answer 5

difference between the average (expected) value of a measurement (variable) and the true value that it targets

Answer 6

variation among measurements about their average or mean value, even if that mean differs from the true targeted value

Answer 7

MSE= variance + bias^2

Answer 8

something that brings about an effect or result

Answer 9

``` another variable (𝑋𝑋2) that needs to be taken into account when assessing the true association between the risk factor 𝑋𝑋1and the outcome 𝑌𝑌 BMI ```

Answer 10

another variable (𝑋𝑋2) that identifies subgroups of individuals (units) across which the association between the risk factor 𝑋𝑋1and the outcome 𝑌𝑌will differ

Answer 11

Estimate the association between the outcome of mortality and treatment, and characterize the estimate’s uncertainty

Answer 12

Best predict the outcome of mortality on the basis of available data of treatment and other factors, and characterize the prediction’s accuracy

Answer 13

control allocation of “treatment” to subjects (experimental units)

Answer 14

control variation (e.g., effect of pesticide on rate of mutations in rat pups)

Answer 15

randomize to produce groups with similar observed and unobserved characteristics; average over rather than control variation (e.g., compare two treatments to reduce blood pressure)

Answer 16

do not control allocation of “treatment” to subjects (experimental units)

Answer 17

the count(frequency) of the number of individuals in a particular group

Answer 18

a frequency distribution which describes an observed set of values of a variable

Answer 19

``` the count (frequency) of the number of individuals in a particular age group or lower age group ►That is, the cumulative count ```

Answer 20

the proportion of individuals in a particular age group = the count (frequency) of the number of individuals in a particular age group divided by the overall total

Answer 21

the cumulative proportion of individuals in a particular age group or any lower age group

Answer 22

difference between largest and smallest values

Answer 23

“average” of the squared differences of observations from the sample mean 𝑠𝑠2=Σi=1n(xi−𝑥𝑥)2𝑛𝑛−1

Answer 24

𝑠𝑠=𝑠𝑠2. square root of variance

Answer 25

Upper hinge =𝑄𝑄3 ►Median=𝑄𝑄2 ►Lower hinge=𝑄𝑄1 ►Interquartile range (IQR)= 𝑄𝑄3−𝑄𝑄1 ●Contains the middle 50% of the observations ►Whiskers: lines drawn to the smallest and largest actual observations within the calculated fences

Answer 26

Fences are notobserved data points ►Fences are calculated to provide guidelines for identifying outliers ►𝑈𝑈𝑈𝑈𝑈𝑈𝑈𝑈𝑈𝑈𝑓𝑓𝑒 𝑒𝑒𝑒𝑒𝑒 =𝑢𝑢𝑝 𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝 ℎ𝑖𝑖𝑖 𝑖𝑖𝑖 +1.5∗𝐼𝐼𝐼 𝐼𝐼=𝑄𝑄3+1.5∗𝐼𝐼𝐼 𝐼𝐼 ►𝐿𝐿𝐿𝐿𝐿𝐿𝐿𝐿𝐿𝐿 𝑓𝑓𝑒 𝑒𝑒𝑒𝑒𝑒 =𝑙𝑙𝑜 𝑜 ℎ𝑖𝑖𝑖 𝑖𝑖𝑖 −1.5∗𝐼𝐼𝐼 𝐼𝐼=𝑄𝑄1−1.5∗𝐼𝐼𝐼 𝐼𝐼

Answer 27

Outliers are actual observed data values falling beyond the calculated fences (higher or lower)

Answer 28

more lower values, sparse higher values ►Also: long “tail” of higher values ►Also: mean > median > mode

Answer 29

reverse of positively skewed

Answer 30

not skewed in either direction

Answer 31

Values that are “far” from most values | ►Importance: a few outlying values can strongly influence certain statistical summary measures and analyses

Answer 32

each increment represents change by a constant amount

Answer 33

each increment represents change by a constant multiplier

Answer 34

provides a measure of the uncertainty associated with the occurrence of events

Answer 35

exactly the experiment result

Answer 36

specific way(s) the experiment can turn out

Answer 37

Two events, A and B, are mutually exclusive if the events cannot occur together

Answer 38

Two events, A and B, are statistically independent if the probability of A occurring is not influenced by the presence or absence of B

Answer 39

𝑃𝑃(𝐴𝐴|𝐵𝐵)=𝑃𝑃𝐴𝐴and𝐵𝐵/𝑃𝑃𝐵𝐵, where 𝑃𝑃𝐵𝐵≠0 (Vertical bar | = “given”) 12

Answer 40

𝑃𝑃(𝐴𝐴|𝐵𝐵)=𝑃𝑃(𝐴𝐴) | ►That is, the probability of 𝐴𝐴occurring is not influenced by the presence or absence of 𝐵𝐵

Answer 41

“and”𝑃𝑃𝐴𝐴and𝐵𝐵

Answer 42

From conditional probability, we can write the joint probability as …

Answer 43

Two outcomes or events are mutually exclusive if and only if the probability of their joint outcome equals zero

Answer 44

Two outcomes or events are statistically independent if and only if the probability of their joint outcome equals the product of the probabilities of occurrence of each outcome

Answer 45

a complete listing of the probabilities for every possible value of a random variable

Answer 46

two possible outcomes ►Underlies much of statistical applications to epidemiology ►Basic model for logistic regression

Answer 47

uses counts of events or rates | ►Basis for log-linear and survival models

Answer 48

means are normally distributed or approximately normally distributed

Answer 49

useful in describing times to events and population growth

Answer 50

Factorial ►Permutations ►Combinations

Answer 51

𝑛𝑛factorial” = number of possible arrangements (orderings) of n objects ► Notation: “𝑛𝑛factorial” =𝑛𝑛!

Answer 52

ordered arrangement of 𝑛𝑛objects taken 𝑟𝑟at a time

Answer 53

a selection of 𝑛𝑛objects taken 𝑟𝑟at a time without regard to order

Answer 54

Describes the totally random (haphazard) occurrences of events in time or objects in space

Lectures 1-4 Flashcards

(78 cards)