chapter 1 - Statistics, Data, and Statistical Thinking Flashcards

1
Q

Statistics

A

science of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Descriptive Statistics

A

Describing sets of data using numerical and graphical methods to explore data patterns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Inferential Statistics

A

Drawing conclusions about sets of data based on sampling (need to distinguish between population and sample)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

units

A

rows

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

variables

A

columns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Parameter of interest

A

what your data is focused on

numerical value that you want to draw conclusions about from a sample of data — typically characteristic of a population

ex: average income

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

experimental (observational) unit

A

object (person, thing, transaction, event) upon which we collect data

ex: individual students/health workers whom we collect data from

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Variable

A

numbers or characteristics that can be counted or measured

characteristic of the experimental units

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Population

A

A set of experimental units that we are interested in studying

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Sample

A

a subset of the units of the relevant population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Statistical Inference

A

estimation, prediction, or other generalization about a population based on information contained in a sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Parameter

A

number that describes the whole population

any quantity computed from the observations in the population

ex: population mean (average) µ (mu)

average length of a butterfly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Statistic

A

any quantity computed from the observations in the sample

ex: ex: sample mean x̄ (x bar)

ex: the average income for a sample drawn from the U.S. is a sample statistic.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Measure of reliability (measure of uncertainty)

A

statement (usually quantified) about the degree of uncertainty associated with a statistical inference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Four Elements of Descriptive Statistical Problems

A
  1. The population or sample of interest
  2. One or more variables that are to be investigated
  3. Tables, graphs, or numerical summary tools
  4. Identification of patterns in the data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Five Elemental of Inferential Statistical Problems

A
  1. The population of interest
  2. One or more variables that are to be investigated
  3. A sample
  4. The inference about the population based on information contained in the sample
  5. A measure of reliability for the inference
17
Q

Quantitative data (numerical data)

A

data that can be counted and measured in numeric values

ex: temperature
unemployment rate
test scores
number of female executives

18
Q

Qualitative data

A

measurements that cannot be recorded on a numerical scale; can only be classified into one of a group of categories

ex: zip codes
“yes/no”

19
Q

Representative sample

A

exhibits characteristics typical of those possessed by the population of interest

20
Q

Stratification

A

splitting population into strata and then a random subsample is taken from each stratum

SUB-SAMPLE EACH GROUP

21
Q

cluster sample

A

a sample in which each population unit belongs to a cluster, and the clusters are sampled

SAMPLE GROUPS

22
Q

Systematic sample

A

every kth unit in the population

selected according to a random starting point but with a fixed, periodic interval

divide population size by desired sample size

23
Q

Randomization

A
  • selection is fair
  • protects against biased samples
  • help represent all the features of the population
24
Q

Sampling variability (sampling error but no error has taken place)

A

sample-to-sample differences in the values of the variables