week 4 Flashcards
what are trends?
they are part of predictive analysis, used to understand any relationships between one or more variables
- Functional relationships can be converted into analytical models
what are the three types of functions used in predictive analysis?
- Linear function: y = a + bx . Linear functions show steady increases or decreases over the range of x . This is the simplest type of function used in predictive models. It is easy to understand, and over small ranges of values, can approximate behaviour rather well.
- Logarithmic function: y = ln 1 x 2 . Logarithmic functions are used when the rate of change in a variable increases or decreases quickly and then levels out, such as with diminishing returns to scale. often used in marketing models where constant percentage increases in advertising, 3, Polynomial function: y = ax 2 + bx + c (second order— quadratic function), y = ax 3 + bx 2 + dx + e (third order— cubic function), and so on. A second order polynomial is parabolic in nature and has only one hill or valley; a third order polynomial has one or two hills or valleys. Revenue models that incorporate price elasticity are often polynomial functions.
what are the other two functions in predicative analysis?
- Power function: y = ax b . Power functions define phenomena that increase at a specific rate. Learning curves that express improving times in performing a task are often modeled with power functions having a 7 0 and b 6 0. Exponential function: y = ab x .
- Exponential functions have the property that y rises or falls at constantly increasing rates. For example, the perceived brightness of a lightbulb grows at a decreasing rate as the wattage increases. In this case, a would be a positive number and b would be between 0 and 1. The exponential function is often defined as y = ae x , where b = e , the base of natural logarithms (approximately 2.71828).
what is regression analysis?
Regression analysis is a tool for building mathematical and statistical models that characterize relationships between a dependent variable (which must be a ratio variable and not categorical) and one or more independent, or explanatory, variables, all of which are numerical (but may be either ratio or categorical).
what are the two types of regression model?
(1) regression models of cross-sectional data and
(2) regression models of time-series data, in which the independent variables are time or some function of time and the focus is on predicting the future.
what is simple linear regression?
involves finding a linear relationship between one independent variable X and one dependent variable Y.
what is the problem with errors in simple linear regression?
Simply summing up the errors may result in some
cancelling each other out
- We instead sum up either the absolute values or squares:
least squares regression
- done in excel
what is a multiple linear regression?
a linear regression model with more than one independent variable
- we estimate partial regression coefficients
what are the assumptions about the data in regression analysis?
- Normality of errors . Regression analysis assumes that the errors for each individual value of X are normally distributed, with a mean of zero. This can be verified either by examining a histogram of the standard residuals and inspecting for a bell-shaped distribution or by using more formal goodness-of- fit tests. It is usually difficult to evaluate normality with small sample sizes.
what is homoscedasticity?
another assumption of the data in regression analysis which means that the variation about the regression line is constant for all values of the independent variable.
- points should be randomly scattered no pattern
what is independence of errors?
another assumption of data in regression analysis, which means one observation is independent on one another which gives correlation which we dont want in regression analysis. each observation should be as uncorrelated to each other as possible
what is autocorrelation?
its the correlation among successive observations over time and can be found with residual plots having clusters of residuals with the same signs
what does residual mean?
The difference between an observed value of the response variable and the value of the response variable predicted from the regression line.
what is r2 (r-squared)?
r-squared shows how well the data fit the regression model (the goodness of fit)
value is between 0 and 1.
the larger the R2 value the better the fit
what is historical analogy?
its a judgemental technique in which a forecast is obtained through a comparative analysis with a previous situation