222) 3) Chapter 3: Descriptive Statistics Flashcards

1
Q

What two key features quantitative variables describe numerically according to descriptive statistics?

A

The center of the data: typical observation. The variability of the data: the spread around the center.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Comparing qualitative categories using relative frequencies?

A

Relative frequency is the proportion or percentage of the observations that fall in that category.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Histograms?

A

A graph of a relative frequency distribution for a quantitative variable is called a histogram.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

The Mean?

A

AVERAGE. The best known and most commonly used measure of the center is the mean. Mean is the sum of the observations divided by the number of observations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What denotes this symbol?

A

Sample Size.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is denoted by this symbol?

A

The variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is denoted by this symbols? y1 y2 ….

A

Observations.

The n sample observations on a variable y are denoted by y1 for the first observation, y2 for the second and so forth.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is denoted by this symbol?

A

Read as y-bar.

Denotes sample mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What says the definition of sample mean?

Sample mean is y-bar.

A

It says that:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Shortened expression for the samle mean of n observations?

A

Sample mean is equal to sum of all variables divided by number of observations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is outlier for the mean?

A

When one observation is much larger or smaller that the others, such as in highly skewed distrbutions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Properties of the mean?

A
  1. Formula for the mean uses numerical values for the observations so it is appropriate only for quantitative variables.
  2. The mean can be higly influenced by an observation that falls well above or well below the bulk of the data, called an outlier.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is weighted average for combined set of observations (2 sets of data with sample sizes n1;n2)

A

Expressed by sample mean multiplied by sample size of first data set + sample mean multiplied by sample size of second data set divided by sum of sample sizes of both data sets.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is median?

A

The median is the observations that falls in the middle of the ordered sample. When the sample size n is odd, a single observation occurs in the middle. When the sample size is even two middle observations occur, and the median is the midpoint between two.

Index of middle observation?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Properties of the median?

A
  1. Median like a mean is appropriate for quantitative variables.

Also valid for ordinal-scale (requires oredered observations).

Not appropriate for nominal-scale data.

  1. For symmetric distributions median and the mean are identical
  2. For skewed distributions mean lies toward the direction of skew (longer tail) relative to the median.
  3. The median is insensitive to the distance of the observations (variable values).
  4. Is not affected by outliers.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

In which cases mean or median are mostly preferred?

A

If a distribution is higly skewed the median is usually preferred because it better represents what is typical

If the distribution is close to symmetric or only mildly skewed or of it is discrete with few distinct values, the mean is usually prefferred, because it uses the numerical values of all observations.

17
Q

What is Mode?

A

The mode is value that occurs most frequently.

18
Q

Properties of mode?

A
  1. Apporpriate for all types of data.
  2. Can get bimodal value. If answers are polarized there ir 2 almost equal often answers.
  3. Mean, median, and mode are identical for unimodal, symmetric distribution.
19
Q

Variability of the data?

What is range?

A

When the relative frequency for one observation is higher that same relative frequency for another observation.

Range is difference between the largest and smallest observation.

20
Q

Describe range?

A

When one observations has more and greater variability than another observation.

21
Q

Deviation from the sample mean y-bar?

A

Deviation of an observation yi from the sample mean y-bar is: image 1

22
Q

When the deviation is negative or positive?

A

If the observation falls below the mean deviation is negative.

23
Q

Standard deviation expression?

A