Physics #12 Flashcards

1
Q

measures of central tendency

A

those that describe the middle of a sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

mean

A

adding up all of the values in the data set and dividing the result by the number of values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

outlier

A

an extremely large or extremely small value compared to the other data values
can shift the mean in one way

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

median position calculation

A

(n+1)/2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

when is using a mean not helpful

A

when there is an outlier

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

when is using a median not helpful

A

when there is a large range or multiple modes (good for data sets with outliers though)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

if the median and mean are far from each other, this implies the presence of a ______

A

outlier

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

if the mean and median are close, this implies a ______ distribution

A

symmetrical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

describe the values around the median

A

50% above and 50% below it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

mode

A

the number that appears most often in a set of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

can there be multiple modes or no modes?

A

yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

the mean is best for data with a ____ distribution

A

normal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what is unique about the normal distribution and central tendencies?

A

in the normal distribution, all of the measures of central tendency are the same

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is the mean and standard deviation of a standard distrubtion?

A

mean is 0 and sd is 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

a ____ distribution is one that contains a tail on one side of the data set

A

skewed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

positive skew vs. negative skew

A

positive skew has tail to the right and mean greater than the median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

where is the tail for a right skew distribution

A

to the right

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

bimodal distribution

A

a distribution containing two peaks with a valley in between.
can be measured as two different distributions if there is so little data in the valley.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

can data that do not follow a normal distribution be analyzed with measures of central tendency and measures of distribution?

A

yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

range of data set

A

distance between largest and smallest values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

when you do not have a complete data set how do you approximate the standard deviation of a normal distribution?

A

1/4 of the range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

interquartile range

A

related to the median, first, and third quartiles

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

what is the median in quartile speak

A

Q2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

quartiles

A

divide data in ascending order into groups that comprise 1/4 of the data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

how to calculate a quartile

A

first quartile: multiply n by 1/4
if whole number, take mean between that whole number and the next and that is the quartile
if decimal, round up to next whole number and take that as quartile position
same for 3rd quartile except multiply by 3/4

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

how to calculate interquartile range

A

Q3-Q1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

how to calculate outlier with interquartile range

A

an outlier is 1.5x the interquartile range above the 3rd quartile or below the 1st quartile

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

how is standard deviation calculated

A

difference between each data point and the mean, squaring it, summing all of these. Then divide by the number of points in the data set minus 1 and then taking square root of everything.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

how to calculate an outlier using standard deviation

A

more than 3 standard deviations from the mean

30
Q

the average distance from the mean will always be ____

A

0

31
Q

independent events

A

have no effect on one another
can occur in any order without impacting each other
probabilities not expected to change

32
Q

dependent events

A

the probability of the second event is dependent on the first event

33
Q

mutually exclusive outcomes

A

cannot occur at the same time

one cannot flip both heads and tails in one throw

34
Q

what is the probability of two mutually exclusive outcomes occurring?

A

0%

35
Q

exhaustive

A

a group of outcomes is exhaustive if there are no other possible outcomes
Ex: flipping a coin be heads or tails. These are the only two possibilities.

36
Q

if events are mutually exclusive, the probability of both of them happening is gotten by ____

A

multiplying their probabilities together

37
Q

what is the probability of having at least one boy in 10 live births if boy or girl is 50/50

A

99.9%

38
Q

null hypothesis

A

hypothesis of equivalence

two groups are statistically equal

39
Q

directional vs. nondirectional hypothesis

A

nondirectional: the populations are not equal
directional: mean of population A is greater than mean of population B

40
Q

what type of distribution do t-tests rely on?

A

standard

41
Q

test statistic

A

calculated and compared to a table to determine the likelihood that that statistic was obtained by random chance. This likelihood is our p-value

42
Q

p-value

A

from a table, the likelihood that the statistic was obtained by random chance

43
Q

the p-value is compared to a ______

A

significance level (usually 0.05)

44
Q

if p-value is greater than alpha, then we _____

A

fail to reject the null hypothesis, not a statistically significant difference between the two populations.

45
Q

type I error

A

the likelihood that we report a difference between two populations when one does not actually exist

46
Q

type II error

A

when we incorrectly fail to reject the null hypothesis. The likelihood we report no difference between two populations when one actually exists

47
Q

power

A

the probability of correctly rejecting a false null hypothesis

48
Q

confidence

A

the probability of correctly failing to reject a true null hypothesis.

49
Q

how to produce confidence intervals

A

95% confidence level, find z- or t-score and multiply this by standard deviation before adding and subtracting from the mean to get a range of values

50
Q

what does a pie chart represent

A

relative amounts of entities

popular in demographics

51
Q

cons to pie chart

A

when there are so many categories the graph becomes incoherent

52
Q

bar charts

A

used for categorical data, which sort data points based on predetermined categories

53
Q

histograms

A

numerical data

good for mode, showing distribution of data

54
Q

what does a box plot show

A

range, median, quartiles, and outliers for a set of data

also called a box and whisker plot

55
Q

how. is a box and whisker plot laid out?

A

box is around Q1 to Q3, the median is a line in the middle. Whiskers extend to max and min values of the data set. Outliers can also be dots with the whiskers the closest point within 1.5 IQR

56
Q

maps organize data _____

A

geographically

57
Q

do linear graphs have to be linear?

A

no, could be linear, parabolic, exponential, or logarithmic

58
Q

linear graphs

A

show the relationship between two variables

axes will be consistent that each until will occupy the same amount of space

59
Q

semilog graphs

A

one axis is exponential (uses a ratio) and the other is linear

60
Q

log-log graphs

A

both axes use ratios

61
Q

what is difficult of map representations of data?

A

may only be able to look at 2 variables max

62
Q

positive vs. negative correlation

A

positive: as one goes up, the other goes up
negative: as one goes up, the other goes down

63
Q

correlation

A

connection between two variables

64
Q

correlation coefficient

A

indicates the strength of a relationship

65
Q

what does a correlational coefficient of 0 mean?

A

no apparent relationship

66
Q

what is the only one of Hill’s criteria uniformly required for causation?

A

temporality

67
Q

all variables that are causally related must be correlationally related. True or false?

A

true

68
Q

if all numbers are the same, what is the mode?

A

no mode

69
Q

the standard distribution is the normal distribution shifted so _____

A

the mean is 0 and standard deviation is 1

70
Q

just cause there is statistical significance, should there be clinical significance

A

not necessarily, if it does not benefit anyone.

71
Q

linear graph if ______ scale throughout

A

same (addition)

72
Q

if a study has low power what does this mean

A

it is more difficult to get results that are statistically significant