Probability Flashcards

1
Q

normal distribution

A

continuous data with a symmetric distribution which is represented by a bell-shape
characterised by two parameters: μ (mean) and σ2 (variance),
where the mean corresponds to the centre and the variance corresponds to the spread/width of the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

check the normality of data by

A

histogram
normal probability plot (Q-Q plot)
box plot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

how to examine a histogram to check distribution

A

It is bell shaped and symmetrical if the data are normally distributed.
also check the symmetry by comparing the mean and the median.
If they are approximately equal, then the data are symmetrical.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

how to use Q-Q plot to check distribution

A

If the data are normally distributed, all the data points will lie on a straight line.
check that assumption.
It’s just a visual check, not an air-tight proof, so it is somewhat subjective

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

how to use box plot to check distribution

A

checking symmetry and outliers by the quartiles (vertical lines) of data from any distribution, because it is non- parametric. The thick vertical line representing the 2nd quartile is actually the median and the circles are outliers.
If the data are symmetrical, the lengths of the two dashed horizontal lines are equal and the thick vertical line should split the box into two equal parts.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is a Q-Q plot

A

scatterplot created by plotting two sets of quantiles against one another.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Two of the main distributions associated with discrete data

A

the binomial and the Poisson distributions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

binomial distribution

A

special distribution which can be thought of as the probability associated with a number, say n, of binary identically distributed events (with the same probability) occurring.
Therefore the maximum number of events is n and the minimum is 0.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

The Poisson distribution

A

Unlike the binomial distribution, there is no upper limit on the value that the variable can take

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is a quantile

A

a value in the data set such that the specified percentage of data lie below that value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Bootstrapping

A

is any test or metric that uses random sampling with replacement, and falls under the broader class of resampling methods. Bootstrapping assigns measures of accuracy to sample estimates. This technique allows the estimation of the sampling distribution of almost any statistic using random sampling methods.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

a confidence interval

A

is a range of estimates for an unknown parameter. A confidence interval is computed at a designated confidence level; the 95% confidence level is most common

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what is distrubtion

A

The pattern by which a
measurement or frequency
varies.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly