Correlation Flashcards
What is correlation
a statistical technique that is used to measure
and describe the relationship between two variables that are
simply observed as they exist naturally in the
environment—there is no attempt to control or manipulate
the variables.
Correlation analysis
studies the closeness of the relationship
between two or more variables
- It is often used to describe the linear relationship between two
continuous variables
Positive correlation
means that as one variable increases,
the other does too.
Negative correlation
means that when one variable
increases, the other one decreases.
The calculation of correlation between two variables is a
descriptive measure
Inferential procedure
testing the correlation for
significance. It tells us the probability of finding that level
of ‘togetherness’ between our samples if there is no actual togetherness in the
population.
Strength of correlation
the degree to which one
variable does tend to vary with the other.
The size of the covariance is influenced by
the scale of the data elements, and so in
order to eliminate the scale factor, the correlation coefficient is used as a scale-free
metric of the linear relationship.
Correlation coefficient
The figure arrived at to express the relationshipGood
Goodness of fit
quality of parameter estimation (sometime called model optimization) depends
on
Parameters
help to define the properties and behaviour of the model.
Least square method
finds best fit values of the parameters that make the model do as
good a job as possible at predicting Y (dependent variable) from X (independent
variable).
Error
the difference between actual and predicted value of the outcome of the
model.