Biostat Chapters 1-4 Flashcards

1
Q

3 Basic Measurement Scales

A
  1. Categorical (nominal)- represent unordered categories
  2. Ordinal- Categories that can be put in order/ranked
  3. Quantitative (scale, continuous, interval, ratio)- represent meaningful numerical values for which arithetic operations make sense
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Arithmetic Average

A

A distribution’s gravitational center. This is where the distribution would balance if placed on a scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Average

A

Refers to the center of distribution. Can be measured as arithmetic average and the median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Axis multiplier

A

included to show the total value of the stem

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Bias

A

An over or underestimation of something (value, factor)A systematic error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Biostatistics

A

Broad range of activities that help us improve the intellectual content of data from biological, biomedical, and public health realted studies. It is more than just a compilation of computational methods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Blinding

A

When the study subject is kept in the dark about the explanatory variable being used.Double blind is when the study subject and investigator are both kept in the dark.Triple blind is when the study subject, the investigator and the statistician are all kept in the dark.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Cargo Cult Science

A

Appears to be scientific but does not follow the scientific method

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Categorical Measurements

A

Places observations into classes or groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Census

A

A survey that attempts to collect information on all individuals in the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Chebychev’s rule

A

Applies to all data sets, regardless of their shape. Says at least three-fourths of the data points will lie within two standard deviations of the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Cluster Samples

A

Randomly selected larger units (clusters) consisting of smaller subunits.

Ex. Households

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Comparative Studies

A

Designed to quantify the relationship between variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Complex Sampling Designs

A
  1. Cluster Sample
  2. Stratified Sample
  3. Multistage Sample
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Confounding

A

When lurking variables effect the explanatory variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Controlled Trial

A

A trial when there is one or more control group

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Data tables

A

Form containing observations, variables and values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Degrees of freedom

A

The variance is the average of the sum of squares, with the sum of squares divided by “n-1” instead of “n”. The number “n-1” is the degrees of freedom of the variance. You lose one degree of freedom because knowing n-1 of the deviations determines the last deviation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Descriptive statistics

A

a set of observations that describe the characteristics of a sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Deviation

A

Data point minus the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Distributional shape

A

Describes the symmetry, direcion of skew if asymmetric, modality (number of peaks), and kurtosis (steepness of peaks).Note: descriptors of shape are unreliable whn data sets are small and moderate in size.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Equipose

A

Balance doubt between benefits and risks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Explanatory Variable (Independent Variable/Factor)

A

The exposure being investigated in a comparative study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Factors

A

Explanatory variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Frequency Distributions
Tells us how often various values occur in a batch of numbers
26
Frequency tables
List frequencies (counts), relative frequencies (proportions), and cumulative frequencies (proportion of values up to and including the current value). Quantativative data may first need to be grouped into class intervals before tallying frequencies.
27
GIGO
"Garbage In, Garbage Out"
28
How are categorical variables displayed?
Bar or Pie chart
29
Imprecision
Inability to get the same result upon repetition
30
Infer
Deduce or conclude based on evidence
31
Inferential statistics
consists of a set of statistical techniques that provide predictions about the population based on information in a sample from that population
32
Interaction
When multiple factors produce an effect that would otherwise not be predicted
33
Location
References the distribution's center. Measured by mean and median
34
Lower outside value
Values below the lower fence
35
Lurking variables
Extraneous factors that influence the explanatory variables' response variable
36
Measurement
Assigned numbers or codes according to prior-set rules
37
Multistage sampling
Large units are chosen at random. Subunits are sampled in successive stages.
38
Nonexperimental Studies
"observational study". The investigator merely classifies individuals as exposed or nonexposed without intervention
39
Nonresponse bias
Nonresponse bias is the bias that results when respondents differ in meaningful ways from nonrespondents. Nonresponse is often problem with mail surveys, where the response rate can be very low. Occurs when a large percentage of individuals refuse to participate in a survey or cannot otherwise be contacted.
40
Objectivity
The intent to measure things as they are without shaping them to conform to a preconceived worldview
41
Observation
A unit upon which measurements are made (e.g. individuals). Data from observations are stored in rows of data tables
42
Placebo
an inert or innocuous intervention
43
Population
A group of people
44
Precision
Ability to be replicated
45
Probablity Sample
A sample in which each member of the population has a known probablity of entering the sample
46
Resistance
Ability to remain relatively consistent when outliers are present
47
Response Variable
The outcome of a study
48
Sample
A subset of a population
49
Sampling frame
The actual set of units from which a sample has been drawn.
50
Sampling Independence
Independent samples are those samples selected from the same population, or different populations, which have no effect on one another. That is, no correlation exists between the samples.
51
Simple Random Sample (SRS)
A probability sample in which each population member has the same chance of entering the sample
52
Spread
Refers to the extent to which values are dispersed.
53
Stem-and-leaf Plot
The stem on a stemplot represents a number line with "bins". Leaves represent the following significant digits value. Excellent way to explore the shape, location and spread of distribution.
54
Stratified Random Sample
A sample that draws independent SRSs from within relatively homogenous groups or "strata". Ex. Population divided into 5 year age blocks (0-4, 5-9, 10-14, etc.)
55
Surveys
Designed to quantify population characteristics
56
The Normal Rule
When there is Normal distribution: 68% of the data points will lie within one SD of the mean. 95% of the data points will lie within two standard deviations of the mean. 99.7% of the data points lie within three standard deviations of the mean.
57
The three measures of central location
Mean, Median, Mode
58
Treatment
A specific set of factors applied to study subjects
59
Trials
Experiment involving human subjects
60
Types of Comparative Studies
Experimental studies permit the investigator to assign study exposures to study subjects Observational studies- do not permit the assignment of exposures to study subjects
61
Types of Statisical Studies
1. Surveys 2. Comparative Studies - Experimental - Observational
62
Undercoverage
When some groups in the source population are left out or are underrepresented. Can cause bias
63
Upper outside value
values above the upper fence
64
Valid
When somthing is unbiased. It is not over or under estimated when compared to the true value
65
Validity
Ability to objectivley identify the true nature of the observation
66
Values
Realized measurements. For example, the value for the variable "AGE" for observation #1 is "32". Values are stored in table cells
67
Variables
Characteristics which are measured such as age, gender or disease status. Data from variables form columns of data tables
68
Volunteer Bias
Can occur because self-selected participants of a survey tend to be atypical of the population