Unit 1 Flashcards

1
Q

variable (def)

A

a characteristic that changes from one individual to another

collect data on variables from individuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

categorical variable (def)

A

takes on values that are category names or group lables

ex. age group, dominat hand, gender, grade level, zipcode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Quantitive variable

A

takes on numerical values for a measured or counted quanity

height, salt level in water, number of candies in a bag, age of building

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

relative frequency is…

A

propotional to the whole

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what graphs are used for categorical data

A
  • bar charts (aka bar graphs)
  • pie charts
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Bar charts NEVER ____

A

touch!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the cateigories in bar charts called (labels on the bottom)

A

bins

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

pie charts only display

A

relative frequencies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

when comparing groups of diffrent sizes, do you use counts or propotions

A

proportions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what are the 2 types of quanitative variables

A
  1. Discrete
  2. Continous
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Discrete vs. continous variable

A

Discrete: countable number of values
- ex. number of wins, number of skittles, amount of eruptions in a year
Continous: values that are theortically infinfite (measuring)
- ex. weight of frog, speed of a car

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What graphs are for quanititve data

A
  • dot plot
  • steam and leaf plot
  • histogram (most common)
  • cumulative graph
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

histogram vs bar chart

A

histogram
- bars touch
- labels/numbers on line
- quantitive

Bar chart
- space between bars
- labels in the bins
- categorical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does the slope mean in a cumulative graph

A

the steeper the slope, the more data in the range

no slope = no data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What things do you need to include when describing the distrubtion of a quantitative variable/graph

A

Shape - unimodial, bimodial, symmetical, skews
Outliers
Center - one value that summarizes data
Spread - how the data varies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

The mean is easily influenced by…

A

outliers (non resistant)

best for symmetric data

17
Q

Is the median resistant or non-resistant

A

resistant because it is not influenced by outliers

18
Q

when the data is roughly symmetic, the mean and median are ____

A

the same (or pretty close together)

19
Q

When the data is skewed left, the mean is ____ than the median

A

smaller

20
Q

When the data is skewed right, the mean is ____ than the median

A

larger

21
Q

Percentile

A

the value that P% of the data is less and or equal to it
- ex. If you take the SAT, and land in the 95% percentile, it means that 95% of the test takers got your score or lower

22
Q

Main points in percentiles

A
  • Q1: first quartile is the 25th percential or the median of the bottom half of data
  • Median (overall): 50th percential
  • Q3: 75th percentile or the median of the top half of data
23
Q

How to transform the data

A
  1. add/subtract a value
  2. multipy all the values
24
Q

how does adding/subtracting (transformations) effect the data

A
  • measures of center
  • measures of postion (percentiles/quartiles)

Measures of spead are not effected

25
Q

how does multiplying (transformations) effect the data

A

effects all areas of statistics
- mean, median, IQR, etc.

26
Q

elements of the 5 number summary

A
  1. min
  2. Q1
  3. median
  4. Q3
  5. Maximum

creates boxplot

27
Q
A