Exam 1 Flashcards

1
Q

Population

A

All subjects being studied in a given area (ex. all people with schizophrenia in the United States)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Sample

A

Data from a subset of the population (sample size = n)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Biased Sample

A

The biased sample systematically overestimates or systematically underestimates a characteristic of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Random Sample

A

Every member of the population has the same chance of being included in the sample and the members of the sample are chosen independently from one another (meaning that the chance of a given number of the population being chosen does not depend on which other members are chosen

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Sampling Error

A

Chance effects causing discrepancy between sample and population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Sampling Bias

A

Systematic tendency for some individuals of the population to be selected more readily than others

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Nonresponse bias

A

Bias caused by persons not responding to some of the questions in a survey or not returning a written survey

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Systematic Sampling

A

Samples next to one another will never be picked (every nth term will be included in sample)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Stratified Random

A

Each subpopulation (stratum) is made up of a more homogenous collection of subjects (i.e. sample of state of New York- out of a larger whole)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Subsample size

A

Number of population size of each stratum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Statistical Inference

A

Making conclusions about a population based on a sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Dotplot

A

A simple graph that can be used to show the distribution of a numeric variable when the sample size is small

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Histogram

A

A bar graph version of a dot plot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Skewed to the right

A

Values are more concentrated to the left side in the distribution, so the right side has a longer tail

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Population Mean

Population Standard Deviation

A
mu = population mean
sigma = population standard deviation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Response vs. Explanatory Variable

A

Response Variable: focus of a question in a study or experiment
Explanatory Variable: Explains changes in that variable

17
Q

Qualitative vs. Quantitative Variables

A

Quantitative: measures that can be written by numbers, discrete and continuous
Qualitative: exploratory research on underlying opinions, motivations. Nominal and Ordinal

18
Q

What is the difference between a statistic and a parameter?

A

Statistic: a numerical measure that describes a sample
Parameter: a numerical measure that describes a population

19
Q

What is noise in statistical terms?

A

A term for recognized amounts of unexplained variation in a sample.

20
Q

Efficiency

A

Takes full advantage of all the information in the data, so fewer observations or data points are needed to get the same performance

21
Q

What are 4 ways to measure the spread of your data?

A

Range, Average Deviation, Variance, and Standard Deviation

22
Q

What is a significant result?

A

The likelihood that a relationship between two or more variables is caused by something other than random chance. Statistical hypothesis testing is used to determine whether the result of a data set is statistically significant.

23
Q

(A) U (B) - Union or intersection?

A
Union = one or the other or both events occur
Intersection = event that both occurred
24
Q

Type I Error

A

We reject the null hypothesis when HA is true. Claim that data provide evidence that significantly supports HA when H0 is true.

25
Q

Type I Error

A

We reject the null hypothesis when HA is true (finding significant evidence for HA when H0 is true). If HA is true but we do not observe sufficient evidence to support HA.

26
Q

Type II Error

A

The null is true when its actually not true (not finding significant evidence for HA when HA is true)

27
Q

P-Value of a hypothesis test

A

Probability, under the condition that H0 is true, of the test statistic being at least as extreme as the of the observed value. (measure of compatability between the data and H0)

28
Q

Random variable

A

Takes on numerical values that depend on the outcome of a chance operation

29
Q

QQ Plot

A

We are comparing our sample values to theoretical values (aka comparing our distribution to the normal distribution to find the relationship) to see how the values fit to see how normal our data is

30
Q

Sampling Variability

A

Variability among random samples from the same population

31
Q

Sampling Distribution

A

A probability distribution that characterizes some aspect of sampling variability

32
Q

Central Limit Theorem

A

No matter what distribution Y may have in the population, if the sample size is large enough, then the sampling distribution of Y will be approximately a normal distribution

33
Q

Beta

A

The probability of committing a Type II Error

34
Q

Power

A

1 - Beta
Power = 1 - Pr(accepting H0 when it is false)
A test is more powerful when it correctly rejects the null when there is an alternate hypothesis available

35
Q

ts (test statistic)

A

It is a measure of how far the difference between the sample means (y bar) is from the difference we would expect to see if H0 were true expressed in relation to the SE of the difference - the amount of variation we expect to see in differences of means from random samples.

36
Q

P-value

A

is the area under students t curve in the double tails beyond -ts and +ts

37
Q

Why do we divide the variance by n-1 instead of n?

A

We want average of the sample variances for all possible sets of samples to equal the population variance
Using n – 1 as denominator produces an unbiased estimate of σ2