chapter 1: picturing distributions with graphs Flashcards

1
Q

Individuals

A

Particular things

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Population

A

Set of individuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Variable

A

an attribute of an individual.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

value (of a variable)

A

any way that that variable could be exhibited by an individual

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

True or False: every individual must have only a single value for any given variable

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Data

A

numbers with a context, or, values of variables for the individuals in a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Dataset

A

The particular data that we are presented with

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

observation

A

a member of our dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

sample

A

That part of the population from which our observations come

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

The size of a dataset (or sample)

A

the number of observations in it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

in order to clearly define a statistical problem, we must…

A

we must first clearly state the population and the variables that it concerns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

exploratory data analysis

A

when you seek simply to describe datasets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

distribution

A

a description of the values a variable takes and how

often it takes them

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

graph

A

a visual representation of a distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

count

A

a category’s size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

If we want to know the proportion of observations in a dataset that are in a given category…

A

we divide the count of that category by the sample size

17
Q

If we want to know the percentage of observations in a dataset that are in a given category…

A

we multiply the proportion of observations in the dataset that are in that category by 100%

18
Q

two ways we can look at the size of a given category

A

we might be interested in how it compares to either the sizes of the other categories, or in how it compares to the size of the population

19
Q

To compare the sizes of the categories with each other

A

we use a bar graph

20
Q

One drawback of bar graphs is

A

it is difficult to look at proportions of observations using them

21
Q

To compare the sizes of the categories with the size of the dataset

A

we calculate the percentage of the dataset that fall into each category

22
Q

to compare the percentage of any given category not only to each other, but… to the dataset as a whole as well

A

we use a pie chart

23
Q

roundoff error

A

when the error in percentage totals reflect the accumulated errors in rounding

24
Q

when dealing with quantitative data, we must have [blank]

A

a unit of measurement

25
Q

If we allow the value of a variable for a single individual to change, we must also allow [blank] for it to do so

A

time, on a time plot

26
Q

the long-term upward or downward movement

over time on a time plot

A

trend

27
Q

The amount of change over time in a time plot can be deceiving, so make sure that you [blank]

A

check the scale of the variable and where its values begin on the 𝑦-axis

28
Q

For small data sets, if we want to compare the values that a quantitative variable takes among different
individuals in a population, we can use a [blank]

A

stem plot (numbers in a column (“stem”) at left, other numbers stretching in rows to the right. you lop off the latter number on the stem and the former on the leaf

29
Q

In a stem plot, we reduce the digits in each observation to only the first digit, which we call a…

A

leaf (get it?). Ex. a score of 107 is represented by the 7 to the right of the 10 in the stem

30
Q

True or false: we NEVER include commas or decimal points in a stem plot

A

True

31
Q

For larger quantitative datasets, ranges into which the values of the data fall are called [blank]

A

bins

32
Q

When you make quantitative data categorical in bins, you can then make a bar graph called a [blank]

A

histogram

33
Q

we define statistics as…

A

the science of learning from data

34
Q

right-skewed or left-skewed

A

when data is further away from the center on one side than the other