statistics part 1 Flashcards

1
Q

variability introduced due to either our measuring instruments not being precise enough or differences in how two different people read the measurement

A

observational or measurement variability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

variability that accounts for the fact that members of populations are simply different

A

natural variability/inter individual variability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

opposite of natural variability, occurs because we have assigned our population or sample to two or more treatment groups and then observe the variability between the groups

A

induced variability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

occurs when we take multiple samples from a population randomly. these samples will be different due to the randomness of the sampling process

A

sample variability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

observational studies, experimental studies,surveys

A

3 major types of ways to collect data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

collections of data from a population where variability is not induced by treatments but by the sample itself (sampling variability)

A

surveys

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

collections of data from a population where assignment of individuals from the population into treatment groups is not under the control of those performing the study

A

observational studies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

in experimental studies individuals are assigned randomly to treatment groups in order to determine the effect of the treatment on the variability of the data. In these cases, the assignment, although random, is under the control of those performing the study

A

experimental studies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

is critical for being able to minimize variability due to sampling bias (error that results in a misrepresentation of a population)

A

why random sampling is important

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

mena, median, and mode in a set of data

A

measures of central tendency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

used to break up data into fourths, and each contains 25% of the data

first quartile is median for lower half of the data, second quartile is the median for the entire set of data, third quartile is the median for the upper half of the data (3/4s of the data lies below the third quartile and one fourth lies above)

A

quartiles

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

measures how the data is spread out & dispersed
look at the range, standard deviation (way to describe the difference between the mean and the values in the data set)

A

measure of dispersion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

population form & sample form standard deviation

A

2 forms of standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

used when the data being analyzed includes the entire set of possible data

A

possible form standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

used when the data is a random sample taken from the entire set of data

A

sample form standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

1 var stats ORDER (mean, sum of all data values, sum of the squares of data values)
Sx- Sample standard deviation
ox- the population standard deviation
n- number of values in the data set
minX- lowest data value
Q1- quartile 1
med- median
Q3- 3rd quartile
maxX- highest data value

A

stats on calc (1-var stats)

17
Q

Sx

A

sample standard deviation, only use when asked

18
Q

ox

A

population standard deviation, use unless asked for sample standard deviation

19
Q

lowest, q1, median, q3, highest

A

box and whisker plot

20
Q

normal distributions are symmetrical with a single central peak at the mean (average) of the data. the shape of the curve is descrubed as bell shaped with the graph falling off evenly on either side of the mean.

50% of the distribution lies to the left of the mean and 50% lies to the right of the mean

A

normal distribution & normal curve

21
Q

for a normal distribution, nearly all of the data will fall within THREE standard deviations of the mean:

68% of the distribution lies within one standard deviation of the mean

95% of the distribution lies within two standard deviations of the mean

99.7% of the distribution lies within three standard deviations of the mean

A

empirical rule

22
Q

68% of the distribution lies within one standard deviation of the mean

95% of the distribution lies within two standard deviations of the mean

99.7% of the distribution lies within three standard deviations of the mean

A

percentages to know

23
Q

68% of the distribution lies within ONE standard deviation of the mean

A

68% of the distribution lies within ONE standard deviation of the mean

24
Q

95% of the distribution lies within TWO standard deviations of the mean

A

95% of the distribution lies within TWO standard deviations of the mean

25
Q

99.7% of the distribution lies within THREE standard deviations of the mean

A

99.7% of the distribution lies within THREE standard deviations of the mean

26
Q

** you can ONLY use the normal curve for a question if it specifically says that the data is NORMALLY DISTRIBUTED OR STANDARDIZED

A
27
Q
  • used to determine a particular data point’s percentile, which is the percent of the population less than the data point
  • when calculating a percentile, ALWAYS use a lower bound of -100,000

-2nd var stats, plug numbers into set and then solve but the equation MUST say normal distribution

A

calculating percentiles (normalcdf)

28
Q

the complete set of all subjects that share a common characteristic that is being studied

A

population