Chapter 2 Flashcards

1
Q

What are the two types of variables?

A

1) Categorical. 2) Quantitative.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Categorical Variable

A

These variables that sort and organize the data into different categories or classifications are called categorical variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

T/F One categorical variable will have multiple categories

A

True.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Name the variable and the categories for the following:

sex(male or female.)

A

Sex is the categorical variables. Categories are female and male.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Quantitative variables

A

Numerical data that gives us some quantity is called quantitative.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

T/F All quantitative variables are numeric, but not all numeric variables are quantitative.

A

True. If you can take an average, its probably quantitative. If not, its likely categorical.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the two types of quantitative variables?

A

1) Continuous

2) Discrete.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Continuous variables

A

Can take on any value within an interval, and there is no theoretical distance between two observable values. Ex: Height, Weight, Temperature.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Discrete Variable

A

Has a minimum distance between its measurements. 4 births, 5 births, 6 births. Number of students in a class.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Frequency

A

Simply means the number of occurrences of a categorical variable. 12 students had red hair.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Bar Graph

A

Displays categorical varibles. Categories on horizontal axis. Frequencies on the vertical axis.. Height of rectangles are equal to the category’s frequency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Pie Chart

A

Displays categorical data in terms of percentages of a circle.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the weaknesses of pie charts.

A

If a subject can fall into more than one category, the chart can sum up to more than 100 percent. If there are too many categories, it can be difficult to determine which slices are relatively larger.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Contingency Table

A

Shows how two categorical variables relate to each other. Country and regional food preferences.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Conditional Distribution

A

Restricts categorical variables to show distribution for just the cases that satisfy a certain condition. For instance murdered and not murdered.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Dot Plot

A

Most simplistic way to display quantitative data. Displays dots which represent values and stack up on values on the x axis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Histogram

A

A graph that uses bars to portray the frequencies of the possible outcomes of a quantitative variable. Y is observations within a certain range. X axis ranges of values a variable can take. IQ frequencies for 7th graders.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Distribution

A

The layout of a quantitative datasets. The curve of distributions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Modalitity

A

Mode is the high point(s) or most frequent observations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Unimodal

A

Has only one high point

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Bimodal

A

Two high points.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Uniform

A

Relatively flat data with no peaks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Symmetric

A

Quantitative distributions are close to mirror images of each other

24
Q

Skewed Left

A

Distribution has a long left tail. Many high observations but few low observations.

25
Q

Skewed right

A

Long right tail. Many low observations, but few high observations.

26
Q

Mean

A

Sum of the observations divided by the number of observations.

27
Q

Median

A

The point that splits the data in two half of the observations are below it, half of the observations are above it.

28
Q

T/F The mean is resistant to extreme values of skewness, while the median is sensitive to extreme values in the data

A

False, the MEDIAN is generally resistant to extreme values in the data. The MEAN is sensitive to extreme values in the data set.

29
Q

Range

A

The difference between the smallest and largest observations. No resistant to extreme values by definition.

30
Q

Standard Deviation

A

How far observations in the data set typically fall from the mean. Distance of a data point from the mean.

31
Q

Variance

A

Square of the standard deviation.

32
Q

T/F Standard deviation and variance are not resistance to outliers

A

True.

33
Q

Standard deviation pample

A

s

34
Q

Standard deviation population

A

o

35
Q

Variance Sample

A

s^2

36
Q

Variance Population

A

o^2

37
Q

Sample Size

A

n

38
Q

Population Size

A

N

39
Q

Empirical Rule

A

If a distribution is b ell shaped then approximately:

1) 68 percent of the observations fall within one standard deviation of the mean.
2) 95 Percent of observations fall within 2 standard deviations of the mean.
3) Nearly all observations fall within 3 standard deviations of the mean.

40
Q

The pth percentile

A

The percentage of observations such that p percent fall at or below that value. 90th percentile SAT.

41
Q

Quartiles

A

25 percent is the first quartile.
50 percent second quartile (median
75 percent is the third quartile.

42
Q

Interquartile Range

A

Distance between Q1 and Q3

43
Q

T/F The interquartile Range is not resistance to skewness and extreme values

A

False, it is resistant to skewness and extreme values because it only looks at values surrounding the median.

44
Q

Outleier

A

Is an unusually small or unusually large observation.

45
Q

How do you flag a observation as being an outlier.

A

An observation is an outlier if the following are true. Q1-1.5 less than or Q32+1.5IQR greater than.

46
Q

Outlier Check 2

A

Any observation that falls 3 standard deviations above or below the mean.

47
Q

Five number summary of a data set.

A

1) minimum value.
2) First quartile,
3) Median
4) 3rd quartile
5) Maximum value.

48
Q

Box Plot

A

Graphical depiction of the 5 number summary.

49
Q

T/F Use mean and standard deviation only for reasonably symmetric distribution

A

True. Otherwise use five-number summary.

50
Q

Match the Following:
Standard Deviation Mean
IQR Median

A

Use mean and stanard deviation together since they are not resistant to skewness and extreme values. Use only for unimodal and symmetric data sets. Use median and IQR for skewed or non unimodal data. Resistant to skewness and extreme values.

51
Q

Sid by SIde Boxplots

A

Compare two or more groups on a quantitative variable.

52
Q

Z Score

A

The score of an observation tells us how many standard deviations the observation falls from the mean.
z= observation-mean/standard deviation- observation-x/s
Neg if below mean pos it above mean. If greater than 3 or less than -3 then it is an outlier.

53
Q

Variable

A

any characteristics observed in a study.

54
Q

Pareto Chart

A

Bar graph from highest to lowest frequency.

55
Q

Pareto Principle

A

That a small subset of categories often contain the most observations.

56
Q

Time Series

A

Data set collected over time

57
Q

Time Plot

A

Charts each observation on the vertical scale against the time it was measured on the horizontal scale. Ex Population over time.