365DataScience - Intro Flashcards

1
Q

What is a Population?

A

A collection of all items of interest in a study.

  • uppercase N in the formula
  • Hard to observe
  • Hard to contact
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What do you call the numbers obtained when working with a Population?

A

Parameters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is an example of a Population?

A

The number of employees working at Anthem

- This is difficult to define because people work on site, at home, offshore, etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a Sample?

A

A subset of the population

  • lowercase n in the formula
  • Easy to observe
  • Easy to contact
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What do you call the numbers obtained when working with a Sample?

A

Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is an example of a Sample?

A

Interviewing 50 Anthem employees who eat their lunch in the Norfolk site cafeteria.

  • Not a good example since this isn’t a random or representative sample.
  • Most Anthem employees don’t work at the Norfolk building so we won’t be able to get a truly random sample.
  • Also since most don’t work here, it’s not true representation of the entire population
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

In Statistics, do you work more with Population data or Sample data?

A

Almost always work with Sample data and make decisions based on these samples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

A Sample must have what two traits for an insight to be precise?

A
  1. Randomness
    a. Members collected for the sample are by chance, i.e. random
  2. Representativeness
    a. Subset of population that accurately reflects the members of the population
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What’s the best way to get randomness and representativeness?

A

Getting access to a database of information that holds the necessary data for the population, i.e. Anthem employees, in order to create a true sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly