Descriptive Statistics Flashcards

1
Q

Define central tedency

A

the tendency for the values of a random variable to cluster round its mean, mode, or median.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define mean, median and mode

A

mean - average
median - middle value of data set
mode - most common number

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what are the 4 measures of variability

A
  • Standard deviation
    • Interquartile range
    • Confidence intervals
    • Z - scores
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define standard deviation

A

The dispersion of values around the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define interquartile range

A

which is the difference between the first and third quartiles.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Define confidence intervals

A

a range of values so defined that there is a specified probability that the value of a parameter lies within it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Define Z - scores

A

A z-score describes the position of a raw score in terms of its distance from the mean when measured in standard deviation units

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a high and low standard deviation

A

Low standard deviation means data are clustered around the mean, and

high standard deviation indicates data are more spread out.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Define correlation

A

Correlation is a statistical measure that expresses the extent to which two variables are linearly related

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Define regression

A

a measure of the relation between the mean value of one variable (e.g. output) and corresponding values of other variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Defije multiple regression

A

explains the relationship between multiple independent or predictor variables and one dependent or criterion variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Define p value

A
  • P-value is the probability that a random chance generated the data or something else that is equal or rarer
  • P value is a number between 0 and 1
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the P value threshold for statistical significance

A
  • Threshold for statistical significance is most commonly <0.05
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Lower the P value = what

A

Greater amount of statistical significance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does a P value of 0.05 denote

A

5% probability that the results happened by chance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Define linear regression

A

Linear regression expresses the relationship of two variables by fitting a linear equation to observed data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Explain what a linear regression graph will look like for each value

R = 0
R = -1.0
R = +1.0
R = +0.06
A

R = 0 will result in a circle on data plot

R = -1.0 will result in a diagonal line from top left to bottom right with dots along line

R = +1.0 will result in a diagonal line from bottom left to top right with dots along line

R = +0.6 will result in diagonal line from bottom left to top right but dots are spread a bit away from line (Same for -0.6 but different direction)

18
Q

Define pearsons correlation

A
  • Pearson’s correlation coefficient is the test statistics that measures the statistical relationship, or association, between two continuous variables.
19
Q

Define high, medium and low degree of correlation for r

A
  • High degree: If the coefficient value lies between ± 0.50 and ± 1, then it is said to be a strong correlation.
    • Moderate degree: If the value lies between ± 0.30 and ± 0.49, then it is said to be a medium correlation.
    • Low degree: When the value lies below + . 29, then it is said to be a small correlation.
20
Q

What does correlation not tell u

A

does not tell you whether one variable causes the other

21
Q

Define and explain regression equation

A
  • Y = bX + C
    • Y is the dependent variable
    • X is the independent variable
    • B is the slope or regression coefficient
    • C is the intercept of the Y axis
22
Q

Define forced entry regression

A

Produce one R value

23
Q

Define stepwise regression

A

Produce one or more R values for variables that explain variance

24
Q

Define hierarchical regression

A

Produces R values at each step

25
Q

Define bivariate regression

A

analysing two variables to establish the strength of the relationship between them.

26
Q

Define degrees of freedom

A
  • Number of individual scores that are free to vary without changing the means
27
Q

Define homogeneity of variance

A
  • The spread of scores around each mean is approximately equal
28
Q

What is used to determine where the difference are

A

Post Hoc

29
Q

A significant what will tell you there is a difference between groups

A

F-ratio

30
Q

What is Benferroni correction and when to use it

A

The Bonferroni correction adjusts probability (p) values because of the increased risk of a type I error when making multiple statistical tests

  • The p value is divided by the number of tests
31
Q

Define outliers

A

Observations that are distant or distinct from the other observations in a dataset

32
Q

Define skewness

A

Measure of the asymmetry of a distribution

33
Q

Describe positive and negative skew

A
  • Positive skew, more data on the left side

- Negative skew, more data on the right side

34
Q

Define leptokurtic

A

When scores either side are very close together

- Resulting in sharp thin bell curve

35
Q

Define platykurtic

A
  • Flat wide bell curve

- Data are far either side

36
Q

what range should skewness and kurtosis values fall

A

-1.0 to +1.0

37
Q

Define Mesokurtic

A

Distributions that are moderate in breadth and curves with a medium peaked height.

38
Q

What are the three types of kurtosis

A

leptokurtic, mesokurtic, platykurtic

39
Q

Define kurtosis

A

the sharpness of the peak of a frequency-distribution curve.

40
Q

What is done to a variable during bivariate regressio

A

Squared

41
Q

Bivariate regression can be used for what two things

A
  1. Assess the shared variance between two variables

2. Predicted a value on one variable, using the value of another variable

42
Q

Multiple regression equation

A
  • Y = bX1 + bX2 + bX3…… + c