1.1-1.5 Flashcards

1
Q

Define statistics.

A

Statistics is the science of data. More specifically, it is the science of the collection, classification,
analysis, and interpretation of information/data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why is it important to ensure statistics are being applied properly? Provide an example.

A

Must ensure statistics are being applied properly, otherwise, there can be catastrophic
effects
o Example: Sally Clark was falsely convicted of the murder of her own child due to
the improper use of statistical findings

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the two processes involved in statistics?

A

o Describing data sets
o Drawing conclusions from that data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define measurement in statistics.

A

Measurement is the process we use to assign numbers to variables of individual
population units

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are some ways measurements can be made>

A

▪ Using instruments
▪ Asking questions
▪ Ratings using scales

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why is it important to know whether the data you are dealing with is qualitative or quantitative?

A

It is essential that you understand whether the data you are interested in are
quantitative or qualitative, since the statistical method appropriate for describing,
reporting, and analyzing the data depends on the data type (quantitative or qualitative).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the three ways of obtaining data?

A

o Published source
o Designed experiment
o Observational study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the two classes of variables?

A

Quantitative and qualitative.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the two subclasses of quantitative variables?

A

discrete and continuous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the two subclasses of qualitative variables?

A

Ordinal and nominal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the four elements of a descriptive statistics problem?

A
  1. The population or sample of interest
  2. One or more variables (characteristics of the population or sample units) that are to be
    investigated
  3. Tables, graphs, or numerical summary tools
  4. Identification of patterns in the data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the five elements of an inferential statistics problem?

A
  1. The population of interest
  2. One or more variables (characteristics of the population units) that are to be investigated
  3. The sample of population units
  4. The inference about the population based on information contained in the sample
  5. A measure of the reliability of the inference
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a response value?

A

What we are interested in finding

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a sample size?

A

Number of participants in the study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a population?

A

Set of all experimental units we are interested in studying

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a sample?

A

Subset of population of which we have actual observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Why do we use samples?

A

Use samples because it is often impossible to gather data on all experimental
units

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Define descriptive statistics.

A

Utilizes numerical and graphical methods to look for patterns in a
data set, to summarize the information revealed in a data set, and to present that
information in a convenient form.

19
Q

WHat is inferential statistics?

A

Utilizes sample data to make estimates, decisions, predictions, or
other generalizations about a larger set of data.

20
Q

What is a variable?

A

A characteristic or property of an individual experimental (or observational)
unit in the population.

21
Q

What is a census?

A

A study that includes a measurement of every experimental unit.

22
Q

What is a statisitical inference?

A

An estimate, prediction, or some other generalization about a
population based on information contained in a sample

23
Q

What is a measure of reliability?

A

A statement (usually quantitative) about the degree of
uncertainty associated with a statistical inference.

24
Q

Define qualntitative data

A

Data that are measured on a naturally occurring numerical scale.

25
Q

Differentiate between interval and ratio data.

A

Can be Interval Data – The origin has no meaning
or
o Ratio Data – Origin is a meaningful number

26
Q

Define qualitative data.

A

Measurements that cannot be measured on a natural
numerical scale; they can only be classified into one of a group of categories

27
Q

Differentiate between ordinal and nominal data.

A

Can be Nominal – Unable to be ranked
or
o Ordinal – Can be ranked/ordered

28
Q

What is a published source?

A

The data set of interest has already been collected.

29
Q

What are designed experiments?

A
  • Conducted by the researcher and the units in the study have
    strict controls.
30
Q

WHat is an observational study?

A

Researchers observe experimental units in their natural settings
for an observational study and records the variables of interest. In this case, there are no
controls placed on the units

31
Q

What is a representative sample?

A
  • Exhibits characteristics typical of those possessed by the target
    population.
32
Q

What is a simple random sample?

A

Simple Random Sample - of n experimental units is a sample selected from the
population in such a way that every different sample of size n has an equal chance of
selection.

33
Q

What is selection bias?

A

Selection bias occurs when some experimental units in the population have less chance
of being included in the sample than others

34
Q

What is nonresponse bias? When is it likely to happen?

A

a type of selection bias that results when data on all experimental
units in a sample are not obtained. In surveys where people choose whether or not they will respond.

35
Q

What is measurement error? Give an exmaple.

A
  • inaccuracies in the values of the data collected. In surveys, the
    error may be due to ambiguous or leading questions and the interviewer’s effect on the
    respondent.
36
Q

What is a parameter?

A

A numerical summary of the population

37
Q

What are some drawbacks of random sampling>

A

Not everyone wants to participate.
Those selected may not be qualified to participate.
The random sample might not have enough range.

38
Q

Explain stratified random sampling.

A

The population is divided into non-overlapping subpopulations (strata).
Characteristics of cases are more similar within strata than across strata.
* Then take a simple random sample in each stratum.
* Combine the random selection from each stratum to make overall sample.

39
Q

Explain cluster sampling. When is this method commonly used?

A

The population is divided into large number of (small) clusters.
* We randomly sample a few clusters.
* We then choose all cases within the sampled clusters.
* This technique is usually used when the population is dispersed, e.g. across a wide geographic region.

40
Q

What is systematic sampling?

A

Cases are selected in an ordered sampling scheme.
* First case is selected at random, subsequent elements
follow a predetermined pattern.
* Selecting kth case from the list of all cases.

41
Q

What is the most common type of nonprobability sampling? Explain what it is.

A

Most common nonprobability sampling is volunteer sampling.
* when surveying (e.g., sending questionnaire) to a selection of people is not at random and receive volunteer
response.

42
Q

Describe the difference between a statistic and a parameter.

A

A parameter is a number describing a whole population (e.g., population mean), while a statistic is a number describing a sample (e.g., sample mean).

43
Q

Explain the difference between an explanatory variable and a response variable.

A

An explanatory variable is the expected cause, and it explains the results. A response variable is the expected effect, and it responds to other variables.