Lecture 11: Statistics Theory Flashcards

1
Q

What does Statistic Mean?

A

It is the branch of mathematics associated with:

  • collecting and analysing data to describe things
  • interpret things
  • predict things
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Who uses statistics and why??

A

People who purposefully collect data, and want to understand the information they have collected.

  1. Government: Collect for population info.
    1. size, structure and distribution
    2. changes in these values over time (birth, migration, age)
  2. Scientists: related to experiments and use to try to understand situations
    1. ​relationships between treatments and outcomes
    2. correlation between causes and effects
  3. Engineers: regarding performance of a material under certain conditions
    1. mean time to failure
    2. stress testing/ load rating
    3. reliability
  4. Managers: to make informed decisions
    1. return on investments
    2. rostering
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the two main branch of statistics?

A
  1. Inferential
  2. Descriptive
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Explain Descriptive Statistics

A
  • Statistics used to summarise or describe the main characteristics or a set of data
  • e.g. average daily temperatures (max/min)
  • useful for analysing sets of data to understand what the data shows or to enable comparisons
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Inferential statistic

A
  • useful for decision making when a data for the entire population is unavailable
  • make conclusions about characteristics of a population based on characteristics of a “random sample” selected
  • e.g. aim: quality assurance - proportion of defective items in a batch estimated based on results of a selected sample
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does the term Variable mean?

A

A characteristic of interest that can take different values. Can be either Discrete or continuous.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Discrete variable?

A

the set of possible outcomes is countable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is continuous variable?

A

the outcomes can take values on a continuous scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does the term population mean?

A

all members of the group

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does the term parameter mean?

A

a numerical characteristic of a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does the term sample mean?

A

a subset of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does the term subsample mean?

A

a subset of the sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does the term statistic mean?

A

a characteristic of a sample or subsample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Explain Descriptive Statistic

A

How do you organise large data?

  1. Organise Data
    1. a frequency distribution table
    2. high and low value are easy to distinguish
    3. common values are obvious
    4. easy to perform calculations
    5. frequency values might indicate the shape of the distribution
  2. Graphic representation
    1. bar chart
    2. table
    3. frequency polygon
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the formula for probability distribution?

A

Relative frequency = score frequency / total number of scores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Describing Distributions. What are some terms to consider?

A
  • Measures of central tendancy
    • arithmetic mean (average)
    • median
    • mode
  • measures of dispersion
    • range
    • variance
    • standard deviation
  • common shapes of distribution
    • symmetric
    • skewed
17
Q

Explain all three Central Tendencies.

A
  1. Mean: add all elements/ total of elements
  2. Median: middle number
    1. if there are two: add and divide by 2
  3. Mode: Most occurring
    1. bimodal: two number occur same amount
18
Q

Explain Dispersion terms:

A

Dispersion: how dispersed data is

  • range: Large - smallest. the difference
  • variance: calculates how much each value deviates from mean
  • deviation: difference between a score X and mean
  • standard deviation: the square root of a variance
19
Q

What is symmetric? List the three examples of symmetric shapes.

A

Symmetric: shape can be folded along a vertical axis so the two sides coincide

  • rectangular
  • uniform
  • bell-shaped
20
Q

What is skewed? give examples.

A

Skewed: a non symmetric shape

  • J curve
  • Gamma distribution

If positive skewed. Mean is greater then Median

if Negative skewed. Mean is Less then Median

21
Q

List two main common distributions. Give a brief explanation.

A
  1. Binomial - applies to discrete variables
    1. family distribution where:
      1. n = number of samples
      2. p = probability of success
  2. Normal - applies to continuous random variables
    1. family distribution
      1. mean
      2. standard deviation
22
Q

Principles of inferential statistics.

A

we begin with a sample of data points and try to determine the distribution of the rest of the population.

23
Q

what does estimation mean

A

is the process of summarising a sample to only describe it wih a characteristic

24
Q

confidence intervals:

A

are used to quantify how confident we are about our estimations

25
Q

hypothesis testing

A

uses a sample data to evaluate the credibility of a claim about the population

  • is an assumption about the population parameters
  • it might not be true - we use experimental data to decide.