Applied Economics & Statistics 1: An Introduction to Statistics, Measurement, and Presentation of Data* Flashcards

1
Q

Explain why Statistics is important

A
  • Statistics is not just used in economics.
  • Required for many business subjects, social sciences, and hard
    sciences.
  • Needed to process and analyse data, and data available increases
    constantly.
  • Google tracks internet usage, supermarkets purchases, police crime
    data…
  • Statistical techniques required to make sense of the data and enable
    personal, business, and scientific decision making.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What’s a ‘statistic’?

A

A statistic is a number used to communicate a piece of information

The inflation rate is 5.2%.
The average mark on a module is 60%.
The price of a new Toyota Supra is £50,545.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What’s the name for ‘a number used to communicate a piece of information’?

A

A statistic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define ‘statistics’

A

The science of collecting, organizing, presenting, analysing, and interpreting data to assist in making more effective decisions.

- How does this year’s rate of inflation compare with last year’s? Is
there a trend of increasing or decreasing inflation? Is there a
relationship between inflation and interest rates?
- How does the module mark vary compared to previous years, and
other modules? Does changing the lecturer teaching the module
affect average marks?
- The price of a new Toyota Supra is £50,545. The Mazda MX-5 is
cheaper, costing £25,825. What are the differences in the cars’ specs,
and how are they related to price? What other information would you
need for a purchase decision

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the different types of statistics?

A

Descriptive and Inferential Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What’s ‘Descriptive Statistics’?

A
  • Methods of organizing, summarizing, and presenting data in an
    informative way
  • Organize and summarize data with graphs and tables.
  • Statistical measures describe the characteristics of a distribution
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What’s the name for ‘methods of organizing, summarizing, and presenting data in an
informative way’?

A

Descriptive Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Define ‘Inferential Statistics’

A

The methods used to estimate a property of a population on the basis of a
sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What’s the name for ‘the methods used to estimate a property of a population on the basis of a
sample’?

A

Inferential Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Define ‘population’ in statistics

A

The entire set of individuals / objects of interest or measurements
obtained from all individuals / objects of interest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Define ‘sample’ in statistics

A

A portion, or part, of the population of interest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Describe & explain the types of statistical variables

A
  • Qualitative variable - The characteristic being studied is non-numeric. E.g.: gender, religion, eye
    colour
  • Quantitative variable - The characteristic is numerical and the numbers have a meaning. E.g.
    number of children in a family, hourly wage, minutes remaining in the
    lecture.
    . It can be further divided into:
    1. Discrete - These can assume only certain discrete values. There are usually “gaps” in
    between the values. E.g. children in a family —this variable can only take
    on a discrete set of values, 0, 1, 2, 3 etc.
    2. Continuous - These can assume any value within a range and can be measured to any
    required degree of precision. E.g.: weights, heights and time. The time it
    takes for each student to finish an exam can be measured anywhere along
    the real positive line, i.e. from 0 to infinity and could be measured to the
    millisecond.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the levels of measurement?

Why are they different?

A
  • Data can be further classified into 4 levels of measurement:
    1. Nominal data.
    2. Ordinal data.
    3. Interval data.
    4. Ratio data.
    Each require different methods for summarizing and presenting, and a
    different type of statistical analysis.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Describe & explain ‘nominal data’

A
  • Nominal Data - Data represented as labels
    or names. They have no order. They can only be classified and counted.
    E.g., hair colour, religion, sexual orientation, gender.
  • No other mathematical operations permitted.
    E.g., even if we assign numerical values like heterosexual=1, gay=2,
    bisexual=3, it makes no sense to say 1+2=3, therefore
    heterosexual+gay=bisexual.
  • Labels are mutually exclusive, e.g., can’t be both Christian and Muslim.
  • Labels are exhaustive: every individual observation must belong to a
    category (even if it’s ‘Other!’)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What’s the name for ‘data recorded at the nominal level of measurement represented as labels
or names; they have no order; they can only be classified and counted’?

A

Nominal Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Describe & explain ‘ordinal data’

A
  • Ordinal Data - Data based on a relative
    ranking or rating of items based on a defined attribute or qualitative
    variable. Variables based on this level of measurement are only ranked or
    counted.
    E.g., university league table position, educational attainment, level of
    satisfaction with your lecturer.
  • Differences between data values are meaningless: university A being
    above B in the leave table only says A is better than B, not by how much.
  • Can be compared and ranked as labels have relative values.
  • Can also
    find absolute and relative size of each label
17
Q

What’s the name for ‘data recorded that’s based on a relative ranking or rating of items based on a defined attribute or qualitative
variable; variables based on this level of measurement are only ranked or
counted’?

A

Ordinal Data

18
Q

Describe & explain ‘interval data’

A
  • Data where the interval or the
    distance between values is meaningful. The interval level of measurement is based on a scale with a known unit of measurement
  • Equal differences in the values are represented by equal differences in
    the measurements.
  • Known units of measurement e.g. degrees Celcius, or shoe size. Difference between 10◦C and 15◦C is the same as that between 20◦C
    and 25◦C.
  • Zero is a point on the scale, not the absence of a condition.
  • Ratios don’t make sense with interval data: a size 28 dress is not
    twice as large as a size 14
19
Q

What’s the name for ‘data recorded where the interval or them distance between values is meaningful and based on a scale with a known unit of measurement’

A

Interval data

20
Q

Describe & explain ‘ratio data’

A
  • Data that’s based on a scale with
    a known unit of measurement and a meaningful interpretation of zero on
    the scale
  • Zero means an absence of the characteristic.
  • Most quantitative data is recorded at this level.
    E.g.: weight, age, number of family members, income, population,
    investment, distance travelled, etc.
    A pint takes twice as much beer as a half pint.
21
Q

What’s the name for ‘Data recorded that are based on a scale with a known unit of measurement and a meaningful interpretation of zero on
the scale’?

A

Ratio data

22
Q

State the ways data can be presented

A
  • Frequency Distribution Table.
  • Histogram, Frequency Polygon and Cumulative Frequency Distribution
  • Bar Chart, Line Chart, Pie Chart, Scatter Plot
23
Q

Describe a frequency distribution and what the point of it is

Include definition

A
  • Suppose we had a set of data and the data are organised in tabular form. The table doesn’t give us much of an idea of how the sales are
    distributed.
  • That’s because it only presents the raw data.
  • A table that’s organized in some way would be much more useful.
  • A way of achieving this is with a frequency distribution, or frequency
    table
  • Frequency Distribution - Grouping of qualitative data into mutually exclusive and collectively
    exhaustive classes showing the number of observations in each class.
24
Q

What’s the name for ‘grouping of qualitative data into mutually exclusive and collectively
exhaustive classes showing the number of observations in each class’?

A

Frequency Distribution

25
Q

State the key components of a frequency distribution

A
  • Class Interval
  • Class Midpoint
  • Class Frequency
  • Relative Frequency
  • Cumulative Frequency
26
Q

How do you obtain the ‘class interval’ in a frequency distribution

A

Obtained by subtracting the lower limit of a class
from the lower limit of the next class.

27
Q

Which component of a frequency distribution is ‘obtained by subtracting the lower limit of a class
from the lower limit of the next class’?

A

Class Interval

28
Q

What’s the ‘class midpoint’ in a frequency distribution

A

A point that divides a class into two equal parts; the average of the upper and lower class limits

29
Q

What’s the ‘class frequency’ in a frequency distribution

A

The number of observations in each class

30
Q

In a frequency distribution, what’s ‘a point that divides a class into two equal parts; the average of the upper and lower class limits’?

A

Class Midpoint

31
Q

In a frequency distribution, what’s ‘the number of observations in each class’?

A

Class Frequency

32
Q

Describe & explain how a frequency distribution is constructed

A

Step 1: Decide on the number of classes. A useful rule to determine the
number of classes (k) is the “2 to the k rule.” This is where you choose the smallest number for k such that 2^k > n, where n is the total
number of observations.
For example, let’s say n here is 30, so:
24 =16 < 30
25 =32 > 30
26 =64 > 30.
Hence, the smallest k that satisfies the condition is 5 and so 5 classes
should be enough

Step 2: Determine the class interval. Class intervals should be the same
for all classes. First calculate the range of the dataset, the difference
between the highest and lowest value. Then divide this by the number of classes which was obtained in step 1. If this new number is a decimal, round up.
Step 3: Set the individual class limits. The lower band of the first class needs to be lower than the lowest value in the dataset. Then it should be easy to know the class limits of each class from knowing the class interval from step 2.
Step 4: Determine the frequency for each class interval by categorising each data point into the appropriate class. There is some loss of detail from grouping the data like this. We can’t tell
what the lowest data point is, or which data point is which.
Step 5: Add a column for relative frequency (the fraction, or proportion, of
observations in each class) by dividing the frequency for that class by total frequency. This allows us to understand the relative size of each class.
Step 6: Add a column for cumulative frequency. These accumulate the frequencies as
we move up the class intervals, allowing us to quickly see important points about the data like the distribution

33
Q

How is qualitative data usually presented?

A

Via a bar chart

34
Q

What type of data do bar charts show?

A

Qualitative data

35
Q

What’s a bar chart?

A
  • A graph that shows qualitative classes on the horizontal axis and the class frequencies on the vertical axis. The class frequencies are proportional to the heights of the bars.
  • Horizontal axis shows the variable of interest.
  • The vertical axis shows the frequency of each possible outcome.
  • There is a gap between the bars.
36
Q

What’s the name for ‘a graph that shows qualitative classes on the horizontal axis and the classfrequencies on the vertical axis. The class frequencies are proportional tothe heights of the bars’?

A

Bar Chart

37
Q

Describe pie charts’ use

A

Pie charts can also be useful to present qualitative data. They are especially useful when comparing relative frequencies.