Topic 1 - data relationships Flashcards

1
Q

What are cases?

A

Cases are the objects described by a set of data; these may be
customers, companies, subjects in a study, units in an experiment or other objects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a variable?

A

is a characteristic of a case:
different cases can have different values of the variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the 3 types of data with the 2 sub categories of on of them?
and what they mean.

A

Categorial (gender, race..)
Ordial (level of education, how much you agree..)
Numerical - discrete (number of) or continuous (weight, temperature of)
- can compute an average

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How do we know if two variables measured on the same cases are associated?

A

Knowing the values of one of the variables tells us something about the values of the other variable
that we would not know without this information
e.g. number of books in the household is associated with higher children’s grades

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

When needing to see if there is any causation between 2 variables we need decide which is the response and explanatory variable?

A

Response variable (dependent) - this is the measure of the outcome of a study (e.g. grades)
Explanatory variable (independent) - which explains or causes changes in the response variable (e.g. books)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Whats a scatterplot?
and what’s on the vertical and horizontal axis?

A

A scatterplot is a graph showing the relationship between two quantitative variables measured on the same cases
vertical - y - dependent
horizontal - x - independent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What to use when interpreting scatterplots?

A

Form - linear or non-linear
Direction - positive or negative
Strength of the relationship - strong or weak
and striking deviations from the pattern - an outlier

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Positively associated mean?

A

When the 2 variables accompany each in a positive relationship?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Negatively associated mean?

A

When the 2 variables accompany each in a negative relationship?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What correlation measures?

A

Measures the direction and strength of the linear relationships
between two quantitative variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Whats a regression line?

A

Is a straight line that describes how a response/dependent variable y changes as an explanatory variable x changes. (The regression line is the one that best approximates the points in the scatterplot)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Whats is the use of extrapolation?
though…

A

Extrapolation is the use of a regression line for prediction far outside the range of values of the explanatory variable x used to obtain the line
- Such predictions are often not accurate and should be avoided

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Whats an outlier?

A

An outlier is an observation that lies outside the overall pattern of the other observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What happens with points that ra outliers in the y direction?

A

Points that are outliers in the y direction of a scatterplot have large regression residuals,
but other outliers need not have large residuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Whats a lurking variable?

A

A lurking variable is a variable that is not among the explanatory nor response variables in a study
and yet may influence the interpretation of relationships among these variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q
A