Definitions Flashcards

Question 1

Q

DATA

Answer

A

Raw information from which statistics are created

Question 2

Q

POPULATION

Answer

A

The pool from which a statistical sample is drawn. eg. total number of tech start ups in Asia

Question 3

Q

SAMPLES

Answer

A

Samples are units collected from the statistical population

Question 4

Q

SYSTEMATIC SAMPLING

Answer

A

Systematic sampling is where units are collected at regular intervals eg. every 10th person.

Question 5

Q

STRATIFIED SAMPLING

Answer

A

Dividing population into strata (SUB GROUPS) and then selecting units from each strata. Random samples are then taken from each strata, normally in proportion to the actual percentage of occurrence of the strata in the population.

Question 6

Q

CLUSTER SAMPLING

Answer

A

Cluster sampling begins by dividing population into clusters. eg suburbs. Then randomly select clusters. Every unit in the clusters selected are included.

Question 7

Q

CATEGORICAL DATA

Answer

A

Categorical variables are variables that put them into categories, eg. male/female, black/white, age group.

Question 8

Q

NUMERICAL DATA

Answer

A

Numerical data is data that can me measured such as time, height, weight or amount.

Question 9

Q

DISCRETE DATA

Answer

A

A discrete variable is one where data is counted eg. How many eggs a hen lays each day. The variable can never be negative, and there will never be half an egg. All numbers can be written down, and are whole numbers. Can be qualitative or quantitative.

Question 10

Q

CONTINUOUS VARIABLE

Answer

A

A continuous variable is where data is measured. How many litres of milk will a cow give daily.

Question 11

Q

ORDINAL DATA

Answer

A

Ordinal measure of data is where data is arranged in order, however differences between data have no meaning. eg on a scale of 1-10 how happy are you.

Question 12

Q

QUANTATTIVE

Answer

A

Quantitative variable has a value or numerical measurement.

Question 13

Q

QUALITATIVE

Answer

A

Qualitative variable describes an individual by placing it into a category or group, eg male or female.

Question 14

Q

SIMPLE RANDOM SAMPLE

Answer

A

Sample taken from a population randomly where each unit has the same chance of being selected.

Question 15

Q

REPRESENTATIVE

Answer

A

A representative sample is a sample that represents the population.

Question 16

Q

BIAS (Statistics)

Answer

A

The opposite of representative, this is where there is bias in a sample.

Question 17

Q

Co-efficient of variation.

Answer

A

CV= Sample mean / sample standard deviation X 100%. Used to compare the spread of two different data types. eg. pounds to rupees.

Question 18

Q

Variance in regards to standard deviation.

Answer

A

The variance tells us the square of standard deviation.

Question 19

Q

Descriptive statistics.

Answer

A

The explanation of data from a sample through the use of graphs and other descriptive tools. eg averages, modes, etc

Question 20

Q

Statistics

Answer

A

Collection
Organisation
Analysis
Interpretation of
DATA

Question 21

Q

Inferential statistics

Answer

A

Using the data from a sample to infer information about a population.

Question 22

Q

Sampling frame

Answer

A

List of individuals that make up the sample.

Question 23

Q

Sampling error vs non-sampling error

Answer

A

Sampling error is the difference between the measurements from the sample and population. Non-sampling error is from poor sample design, sloppy data collection or faulty measuring equipment etc.

Question 24

Q

Observational study vs Experiment.

Answer

A

Observational study is where observations and measurements are taken in a way that doesn’t change the response of the variable. Experiment is where a treatment is deliberately imposed on the individuals in order to observe a possible change.

Question 25

Q

Control group

Answer

A

This is the group that receives a dummy treatment to compare against the test group.

Question 26

Q

Lurking variable

Answer

A

Will generally have an effect on both the explanatory and response, will generally be difficult to measure.

Question 27

Q

Confounding variable

Answer

A

A variable that cannot be controlled but will have an effect on what is being measured and is taken into account when conducting an experiment. A variable that can produce effects that are confused of confounded with the effects of the independent variable

Question 28

Q

Discrete probability distribution

Answer

A

A discrete probability distribution is a distribution where the possible outcomes are discrete ie. roll of the dice or a toss of the coin.

Question 29

Q

How do you know that a probability distribution is valid

Answer

A

It will add up to 1.

Question 30

Q

≤

Answer

A

Less than and equal to

Question 31

Q

How to write “Probability of between 1 and 3 happening?”

Answer

A

P(1≤X≤3)

Question 32

Q

µ or x bar.

Answer

A

Mu. In statistics represents the population mean. Xbar represents the sample mean.

Question 33

Q

Σ

Answer

A

The sum of

Question 34

Q

σ

Answer

A

Population Standard deviation

Question 35

Q

E(X) statistics

Answer

A

Expected value of X

Question 36

Q

What is a probability distribution

Answer

A

Describes the values that could occur and the probability that each value might occur.

Question 37

Q

X~Bin (n,p)

5 properties?

Answer

A

Binomial distribution.

Must have set number (n) trials
Each trial has only two possible outcomes, “success” or “failure”.
Results of each trial are independent of other trials.
Fixed probability (p) “success” in each trial.
(x) is defined as a number of successes in (n) trials.

Question 38

Q

At most

At least

Answer

A

At most is up to and including the number ≤.

At least is greater than and including ≥.

Question 39

Q

Short cut formulas for µ and σ of binomial distributions.

Answer

A

Mean µ = (np)

STDEV σ = √np(1-p)

Question 40

Q

X~N(µ,σ)

Answer

A

Formula for normal distribution

Question 41

Q

Standardise formula (z score)

Answer

A

x - µ
_____
σ

Question 42

Q

What is the standard deviation of the SAMPLING DISTRIBUTION OF THE MEAN called?

Answer

A

Standard error.

Question 43

Q

n=

Answer

A

sample size

Question 44

Q

SAMPLING DISTRIBUTION OF THE MEAN formula for changing standard deviation to sample error.

Answer

A

σ / √n

Question 45

Q

e

Answer

A

e is the error amount.

Question 46

Q

Rules for CLT?

Answer

A

Sample must be large enough.

Must be random sample. (30)

Question 47

Q

k

Answer

A

Critical value

Question 48

Q

Is it a random sample?

Answer

A

Not sure, read the question, ask the question.

Question 49

Q

What is the rule for T distribution use?

Answer

A

If n> 30, use normal distribution. If n< 30, use T-distribution.
T-distribution must come from a normally distibuted population.

Question 50

Q

Quartile
Decile
Percentile

Answer

A

Quartile distribution divided by 4 0.25
Decile distribution divided by 10 0.1
Percentile distribution divided by 100 0.01

Question 51

Q

Difference between x-bar and p-hat

Answer

A

You need to be a little cautious about assuming that particular symbols like xbar and phat will always have the same meaning, as they are just symbols. However, those two are quite common and consistent. The first is a mean which is the sum of the observations divided by the number of observations. The second is a proportion, the number of ‘successes’ divided by the number of ‘attempts’.

Question 52

Q

p

Answer

A

p is considered to be the exact probability of an event happening on a given trial.

Question 53

Q

Conditional probability

Answer

A

Where one variable effect the next ie if you have a bag of red and blue marbles, pulling one out changes the probability of the colour of the next one

Question 54

Q

Contingency table

Answer

A

Table where frequency proportions of events can be plotted and then cross calculated

Question 55

Q

Statistical independence

Answer

A

When one outcome does not effect another outcome or event.