Statistics Flashcards

Question 1

Q

What is a Sample Statistic?

Answer

A

Is a quantity that describes some characteristic of a sample with respect to a specific variable.

Question 2

Q

Pro and Con of Median

Answer

A

Pro: Is insensitive to extreme scores in the data set

Con: Doesn’t reflect the shape of the scores – i.e. doesn’t care how far away extreme scores are

Question 3

Q

Pro and Con of Mode

Answer

A

Pro: Easy to calculate from a histogram and easy to understand – the most common value.

Con: Data set might have more than 1 mode or no mode at all

Question 4

Q

2 Types of Non-Normally Distributed Data

Answer

A

Skewed

Bimodal

Question 5

Q

Define Conditional Probability

Answer

A

Probability of an event given that something else is known/assumed, i.e. when given/assuming some other additional information.

Question 6

Q

What does the z-score measure?

Answer

A

z measures how far away your sample is from the population mean in multiples of the standard deviation (how many standard deviations away is your sample from the mean)

Question 7

Q

Define sampling Error.

Answer

A

The error associated with examining statistics calculated from a sample rather than the population

Occurs because in our sample we do not have all the members of the population

Pop. Parameters and sample statistics differ due to sampling error.

Question 8

Q

How does sample size effect magnitude of sample error?

Answer

A

BIGGER SAMPLE = BIG SAMPLING ERROR LESS LIKELY

SMALLER SAMPLE = BIG SAMPLING ERROR MORE LIKELY

Question 9

Q

Define Sampling Distribution.

Answer

A

A distribution of a sample statistic (e.g. mean, s.d., median, etc…) obtained by repeatedly sampling from a population.

Tells us important information about how a statistic changes from sample to sample

Question 10

Q

Define Sampling Distribution of the Mean (SDM)

Answer

A

The sampling distribution of the mean describes a distribution ofsample means derived from samples of size N from a parent population.
The standard deviation of this distribution is commonly referred to as the standard error of the mean or standard error for short.

Question 11

Q

Define the Central Limit Theorem.

Answer

A

The sampling distribution of the mean approaches a normal distribution, as the sample size increases.

As a sample size increases, the sample mean and standard deviation will be closer in value to the population mean μ and standard deviation σ.

A sufficiently large sample can predict the parameters of a population such as the mean and standard deviation.

Question 12

Q

What is a Confidence interval?

Answer

A

A confidence interval (CI) describes an interval (i.e. a range) of values for our population parameter, together with a specified level of confidence that the parameter is in that range

It is simply a way to measure how well your sample represents the population you are studying.

Question 13

Q

What is Type 1 Error

Answer

A

When we reject H0 when it is in fact true. (H1 is False)

Type 1 errors will occur naturally (i.e. just due to random sampling error) with probability p = a (i.e. 0.05)

Question 14

Q

What is Type 2 Error?

Answer

A

We Fail to reject H0 when it is in fact incorrect. (H1 is True)

Question 15

Q

Why does Type 1 Error Occur?

Answer

A

Type I errors occur because even if your p-value is small there is still a (small) chance that your data was unusually extreme (and so you rejected the NULL) just due to sampling error.

Question 16

Q

Why does Type 2 Error Occur?

Answer

A

Type II errors often arise because of a problem with your study:

- Perhaps your sample was biased 
- Perhaps there was an error in your experimental task 
- Perhaps your sample size was too small

Question 17

Q

What is Continous Variable?

Answer

A

Continuous variables can take on absolutely any value within a given range.

Question 18

Q

What is a discrete Variable?

Answer

A

Discrete variables can only take on certain discrete values in a range.

Question 19

Q

What is a Categorical Variable?

Answer

A

Categorical variables are those in which we simply allocate people to categories.