Foundation of Data Analytics Flashcards

1
Q

Parameter Def.

A

Value of a variable that characterises a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Statistics Def.

A

Value of a variable that characterises a sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Population Parameter Examples

A

Mean (mew) and standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Sample Parameter Examples

A

Mean (x bar) and standard error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Probability Sampling Def.

A

Every member of population has a known chance of being selected. Warrants better representation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Sampling Bias

A

Some members of a population are systemically more/less likely to be chosen. Forms a biased sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Uniform Occurrence

A

All variables are equally likely to be chosen

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Sampling Distribution Def.

A

Compilations of means of different samples across a population. The mean of a sampling distribution curve should be close to population mean. Always normally distributed. Gives support for inferences from sample to population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Statistical Dispersion Def.

A

Fluctuations in means in samples of the same population. Caused by sample size (bigger N = lower fluctuation) and original population variation/outliers (smaller sd = smaller variations)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

CLT States

A

Sample mean should be similar to pop mean, standard error should be smaller then standard deviation and sample means will alwys be normally distributed (and the mean of the sampling distribution should be equal to pop mean)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Standard Error Relationship To Accuracy of sample to pop representation

A

Large standard error = sample mean is inaccurate representation of true pop parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Random Variation

A

How unexpalanable variation can be accounted for and modeled. May be produced via observation through random sampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is sampling error based on sampling distribution

A

Estimates random sampling error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Interval Estimates

A

Select a range of values that could hypthetically be the pop mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Point Estimate

A

Selecting 1 value to hypothetically be pop mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

95% confidence intervals

A

Assuming that 95% of samples include pop mean (through sample mean or standard error). Gained by cutting 2.5% off your data at either end (remove outliers)

17
Q

Normal Curve Def

A

Graph form that helps describe errors in measurement

18
Q

How is probability of occurrence measured

A

Area under curve

19
Q

Z scores

A

A form of standardising scales. Standardising standard deviation