correlation Flashcards
bivariate data
pairs of values as variables
independent variable
x axis ( explanatory variable)
dependent
y axis (response variable )
types of corelation
strong negative, weak negative strogn positive, weak positve
casual relationship
if one variable causes a change in the other !
comment on the claim that hotter coutnries have less rainfall
the graph does not support the statement that hotter coutnries have less rainfall !
“describe and interpret the corellation between 2 variables “
there is a positive/negative corelation since as ……. increases/decrerases, ??????? increases/decreases
there is a weak negative corelation between internet speed and house value. danyal concludes this ; suggest why he may be wrong
there may be 3rd variable that influences house value and internet connection - eg distance from built up areas
outlier formula
upper: q3 + 1.5(IQR) Lower: q1` - 1.5 (IQR)
give a reason why you might exclude an anomaly give a reason for including an anomally ?
exclude: anomally is an outlier and not representative
include: “anomally” part of distribution data so include it
what kind of corellation is this ?
and what does it show
weak negative ( overall downward trend )
a casual relationship between two variables
type of line of best fit
least squares regression line
regression line that minimises sum of squares of distrances of each data point
D=point on graph
minimses value of… D1 ^2 + D2^2 + D3^2etc.. .
(x,y)
formula for regression line
y= a +bx
order of variabels = importnat
regression line of y on x is different from x on y
coefficent (B) = changei n y for each unit of change in x, example: if b is negative…then data negatively correlated
vice versa
w= windspeed ( knots)
g= gust ( knots)
give an interpretation of the value of the gradient of this regression line?
just say what the gradient does…
if the valeu of windspeed is 10 knots (Exmaple ) , the daily maximum ust increases by 18 knots