Ch 12 - Data Based and Statistical Reasoning Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

What does measure of central tendency provide?

A

a single value representation for the middle of a group of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the arithmetic mean (average)?

A

a measure of central tendency that equally weighs all values; it is not affected by outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the median?

A

the value that lies in the middle of the data set
- 50% of data points are above and below the median.
(n+1)/2 where n is the number of data values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the mode?

A

the data point that appears most often; there may be multiple (or 0) modes in a data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is normal distribution?

A

symmetrical; the mean, median, and mode are all the same

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is standard distribution?

A

a normal distribution with a mean of 0 and a standard deviation of one

  • used for most calculation
  • 68% of data points occur within one standard deviation of the mean, 95% within 2, and 99% within 3
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are skewed distributions?

A

have differences in their mean, median, and mode

- the skew direction is the direction of the tail of the distributions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are bimodal distributions?

A

have multiple peaks, although not necessarily multiples modes
- may be useful to perform data analysis on the 2 groups separately

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is range?

A

the difference between the largest and smallest values in a data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is interquartile range?

A

difference between the value of the 3rd quartile and the 1st quartile (IQR = Q3 - Q1)
- can be used to determine outliers
Q1 = n x 1/4
Q3 = n x 3/4
(if whole, take mean of this position and next, if decimal round up to the next whole number and take that as quartile position)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is standard deviation?

A

a measurement of variability about the mean
- can also be used to determine outliers
sigma = square root (sum (value - mean)^2)n-1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are outliers?

A

may be a result of true population variability, measurement error, or a non normal distribution

  • any value lower than 1.5 x IQR below Q1 or any value higher than 1.5 x IQR above Q3
  • any value that lies more than 3 standard deviations from the mean
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are independent and dependent events?

A
  • the probability of independent events does not change based on the outcome of other events
  • the probability of dependent events changes depending on the outcome of other events
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are mutually exclusive outcomes?

A

cannot occur simultaneously

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does it mean for a set of outcomes to be exhaustive?

A

there are no other possible outcomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How are hypothesis tests used?

A

use a known distribution to determine whether a hypothesis of no difference (the null hypothesis) can be rejected

17
Q

What does the p value determine?

A

whether or not a finding is statistically significant is determined by the comparison of a p value to the selected significance level
- significance level 0.05 is commonly used

18
Q

How do the mean, median, and mode compare for a right-skewed distribution?

A

the mean of a right (positively) skewed distribution is to the right of the median, which is to the right of the mide

19
Q

How do range and standard deviation generally relate to one another mathematically?

A

where the data are not available, the range can be approximated as 4x the standard deviation

20
Q

Why would the average difference from the mean be an inappropriate measure of distribution?

A
  • the average distance from the mean will always be 0, which is why we always square the distance from the mean and then take the square root at the end for standard deviation
  • it forces all of the values to be positive numbers, which will not cancel out to 0
21
Q

What are the probability rules?

A
  • and: multiply the probabilities

- or: add the probabilities (and subtract the probability of both happening together)

22
Q

What are confidence intervals?

A

used to determine a potential range of values for the true mean of a population

23
Q

How is the p value calculated during a hypothesis test?

A

after the test statistic is calculated, a computer program or table is consulted to determine the p value of the statistic

24
Q

How is power related to probability?

A

power is the probability that the individual rejects the null hypothesis when the alternative hypothesis is true for the population

25
Q

How do exponential and parabolic curves differ in shape?

A

they both have a steep component; however, exponential have horizontal asymptotes and become flat on one side while parabolic are symmetrical and have steep components on both sides of a center point

26
Q

What is correlation?

A

refers to a connection - direction relationship, inverse relationship, or otherwise - between data

27
Q

What is causation?

A

correlation does not necessarily imply causation, but causation does mean correlation

28
Q

What is required in order for a conclusion to be useful?

A

there must be practical (clinical) and statistical significance

29
Q

What is a type 1 error?

A

the probability of mistakenly rejecting the null hypothesis

  • set by selecting a significant level
  • p value is greater than significance level
30
Q

How is the confidence level increased?

A

to increase the confidence level, one must increase the size of the confidence interval to make it more likely that the true value of the mean is within the range
- thus making the confidence interval wider

31
Q

What is the most common measure of distribution?

A

standard deviation; it is most closely linked to the mean of a distribution and can be used to calculate p values, which are probabilities