IA2 - Exam Flashcards

Question 1

Q

Bivariate Data

define explanatory variable

Answer

A

also known as independent variable
used to explain or predict value of response variable

Question 2

Q

Bivariate Data

define response variable

Answer

A

also called dependent variable
changes in response to the explanatory variable

Question 3

Q

Bivariate Data

P(event) = ?

Answer

A

P(event) = (number of successful outcomes/ total number of outcomes)

Question 4

Q

Bivariate Data

how can we tell if there is an association based on percentages?

Answer

A

if the percentages are very different, there IS and association
if they are similar there is NO association

Question 5

Q

Bivariate Data

What are the 6 features of a scatterplot?

Answer

A

Explanatory variable: x - axis
response variable: y - axis
title, axis label (units)
Arrows
use ‘lightning bolt’ to show not starting at 0
use an appropriate scale

Question 6

Q

Bivariate Data

what are the 2 types of Form (type) used to describe patterns/associations?

Answer

A

linear
non-linear

Question 7

Q

Bivariate Data

what are the 2 types of direction used to describe patterns/associations?

Answer

A

positive
negative

Question 8

Q

Bivariate Data

what are the 5 types of strength used to describe patterns/associations?

Answer

A

no correlation
weak
moderate
strong
perfect

Question 9

Q

Bivariate Data

define pearson’s correlation coefficient

Answer

A

does not tell if there is an association
instead assumes there is a linear association
gives a measurement of it’s strength and direction

Question 10

Q

Bivariate Data

how can you tell direction and strength from correlation coefficient?

Answer

A

direction = sign (positive or negative)
strength = value (number)

Question 11

Q

Bivariate Data

how can you tell direction and strength from correlation coefficient?

Answer

A

direction = sign (positive or negative)
strength = value (number)

Question 12

Q

Bivariate Data

define coefficient of determination (r squared)

Answer

A

R^2 tells us how much of our correlation is because of the two variables
- ie. if R^2 = 0.82, then 82% of effect is because of two variables. Other 18% is due other factors

Question 13

Q

Bivariate Data

define least squares regression line

Answer

A

line of best fit
- residual tells us how far away our points are from the line of best fit

Question 14

Q

Bivariate Data

how do you know if your residual is + or -?

Answer

A

data points above the line of best fit have a positive residual
data points below the line of best fit have a negative residual
sum of residuals = 0 in a least squares line of best fit

Question 15

Q

Bivariate Data

what are the assumptions of using a LSRL?

Answer

A

numerical data
linear association
No clear outliers

Question 16

Q

Bivariate Data

what is the equation of LSRL?

Answer

Study These Flashcards

A

refer to photo

Question 17

Q

Bivariate Data

how do you find LSRL using calculator?

Answer

Study These Flashcards

A

refer to photo

Question 18

Q

Bivariate Data

what is the formula for calculating residual values?

Answer

Study These Flashcards

A

residual plots mean same thing as LSRL

Question 19

Q

Bivariate Data

how do you know if residual plots are linear or non-linear?

Answer

Study These Flashcards

A

even number of points above and below line = linear (R = 0)
if there is some sort of patterns = non-linear

Question 20

Q

Bivariate Data

Recall and explain three reasons why causation may not be present?

Answer

Study These Flashcards

A

common response
- when 2 variables are associated because they are both strongly assoicated with a common third variable
confounding variables
- when there is at least two possible causal explanations for the observed association, but we have no way of knowing their separate effects. The effects of the two possible explanatory variables are said to be confounded because there is no way of knowing which is the actual cause of the association
coincidence
- when it is impossible to identify any feasible confounding variable to explain a particular association
- ie. happens by chance

Question 21

Q

how do we describe trends in time series plots?

Answer

Study These Flashcards

A

Ignores fluctutaion but reflects overall trend of plot
- positive (upward)
- negative (downward)
- constant
can have multiple trends in the one plot

Question 22

Q

features of cycles

Answer

Study These Flashcards

A

repeated patterns
usually greater than a year

Question 23

Q

describe seasonal fluctuations

Answer

Study These Flashcards

A

seasonal factors (time of day, day of week, month of year, quarter of year, season of year (winter ect)
- quarter = Jan-mar, Apr-Jun, Jul-Sep, Oct-Dec
- peaks and troughs consistently occur after the same time interval; e.g. ice-cream sales peak in warmer months and drop away in cooler months

Question 24

Q

describe outliers

Answer

Study These Flashcards

A

one-off unanticipated events; can be difficult to recognise, especially if data is irregular or seasonal
- including an outlier may be detrimental for forecasting (predicing)
- possible outliers should be investigated before being ‘eliminated’ from data

IA2 - Exam Flashcards

(24 cards)