Data And Sampling Distributions Flashcards

1
Q

Bias

A

Measurement or sampling errors that are systematic and produced by the measurement or sampling process

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

X-bar and mu

A

X-bar = the mean of a sample from a population, whereas μ

is used to represent the mean of a population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Selection bias

A

Practice of selectively choosing data in a way that leads to a conclusion that is misleading or ephemeral

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Data snooping

A

Extensive hunting through data until something interesting emerges

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Vast search effect

A

If you repeatedly run different models and ask different questions with a large data set, you are bound to find something interesting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Target reshuffling

A

A permutation test to test to validity of predictive associations that a data mining model suggests

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Typical forms of selection bias

A
  • no random sampling
  • cherry-picking data
  • selection of time intervals that accentuate a particular statistical effect
  • stopping an experiment when the results look interesting
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Regression to the mean

A

Phenomenon involving successive measurements on a given variable: extreme observations tend to be followed by more central ones
(Not linear regression)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly