Terms and Linear Reggresion Flashcards

1
Q

Response / dependent variable

A

The variable of interest, dependent on other explanatory variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Explanatory / independent variables.

A

Other variables that are used to explain the behaviour of the response variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Factor

A

Is a discrete explanatory variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are covariates

A

Continuous explanatory variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Continuous variable

A

Things that can be counted infinite number of times such as age, measurement, temperature ect (although age can be converted to discrete if you seclude it to years).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Discrete variable

A

A variable you can count a finite amount of time. Counting the change in your pocket.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When a researcher manipulates the explanatory variables (treatments) while holding the other variables constant and notice the consequences of the response variable.

A

Designed experiments

Cause and effect relationships can be concluded

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Researchers observe the differences in explanatory variables and see if these are related to differences in the response variable.

A

Observational studies

Cannot be certain on cause and effect relationships.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Confounding factors

A

Factors that affect both the dependent and independent variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Correlation coefficient

r=

A

-1 to 1 score of linear relationships
0 = no linear relationship
1 = high positive relationship
-1 = high negative relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Beta 0 =

A

The intercept

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Beta 1 =

A

The slope (change in Y when x is increased by 1 unit).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Residual

A

Vertical distance between a data point and the regression line. Often named errors.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Response Variable/s:

A

response variable is also referred to as a dependent variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Explanatory Variable/s:

A

These are variables that are used to explain the effect

that we see in the response variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

independent variables.

A

physical elements that
reflect the design; for example, plots, animals, pens, test tubes or plants. There will also be the treatment elements. These could be diets, varieties, pasture types
etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Predictor Variable (x)

A

Independent or explanatory variables which seek to predict or ‘explain’ the response variable.

18
Q

Dependent Variable (Y ):

A

Response variable of interest to investigators that is (hopefully) dependent on the explanatory variable/s. This is always a random variable.

19
Q

dependent variable goes on the __ axis

A

Y

20
Q

independent variable always sits on the __ axis

A

X

21
Q

calculate rXY (rcmdr)

A

Statistics / Summaries / Correlation Matrix , then press “OK”, then select the variables whose correlation coefficient(s) you want to find

22
Q

Yi = β0 + β1xi + εi

A

simple linear reggression model.

23
Q

The random error term is assumed to:

A

follow a normal distribution,
have a mean of zero: E(εi) = 0,
have a constant variance: Var(εi) =σ2
be uncorrelated with the other error terms.

24
Q

a simple linear regression

model (Rcmdr)

A

Statistics / Fit Models / Linear

Regression

25
Q

correlation coefficient is between;

A

-1 and +1

26
Q

B1 = slope, the change in;

A

y when x is increased by 1.

27
Q

smaller sum of squares indicates;

A

better fitting line.

28
Q

The coefficient of determination

A

a measure of the variability in the response variable that is explained by the predictor variable. r2

29
Q

hypothesis testing for B1 =

A

H0: B1 = 0
H1: B1 not=0

30
Q

The coefficient of determination (R2)

A

a measure of the variability in the response variable that is explained by the predictor variable x2

31
Q

curvilinear

A

contained by or consisting of a curved line or lines.

32
Q

when to use a polynomial reggression model?

A

You would use a polynomial regression model if the aim was to define the relationship between two continuous variables, as you would do for simple linear regression.
The difference is that the relationship between the response variable and the predictor variable is suspected to be curved, rather than being linear.

33
Q

process of fitting a linear reggression model 5 steps:

A
  1. Graph the data to determine the type of relationship
  2. Fit an appropriate model
  3. Test the assumptions
  4. Check the fit of the model
  5. Draw conclusions
34
Q

polynomial function: parabola

A

a polynomial function where the highest power is a squared (2) term (for the parabola)

35
Q

polynomial function: cubic

A

a polynomial function where the highest power is a cubed (3) term (for the cubic)

36
Q

B0 (beta zero) =

A

Value of y when x is equal to 0.

37
Q

Residual is

A

Calculation of distance between line of best fit and points.

38
Q

A smaller sim of squares indicates what?

A

A better fitting line.

39
Q

Best graph to use for 2 continuous variables?

A

Scatterplot

40
Q

Simple linear regression model analysis has 3 major purposes.

A
  1. To describe the linear relationship between X and Y
  2. To determine how much of the variation in Y can be explained by the linear relationship.
  3. To predict new values of Y from given values of X.
41
Q

Coefficient of determination

A

A statistical measurement that examines how differences in one variable can be explained by differences in another when predicting the outcome of an active vent.