Needed Review Areas Flashcards
R-squared
range from 0 to 1
CLOSE To ZERO - data not aligned (r-squared is HIGH).. graph spread out
CLOSE TO 1 - error is SMALL, data close alignment to regression line
one of each variable I and D = correlation coefficient
Time-Series
Time is INDEPENDENT variable to assess influence may have on output
More then ONE independent variable
multiple regression
5 Data PATTERNS
Trend Cyclicality Seasonality Irregularity Random Variation
General slope Upward or Downward over a LONG period of time
Trend
Repetition of up (peaks) or down movements (troughs) that follow or counteract a business cycle that can last several years.
Cyclicality
Regular pattern of volatility, usually within a single year.
Seasonality
ONE-TIME deviation from unforeseen circumstances (war, pandemic, etc)
Irregularity
variability of a process which might be caused by irregular fluctuations due to chance that cannot be anticipated,
detected, or eliminated.
random variation
can be applied when the dependent variable is a categorical, binary variable, such as male/female,
dead/living, gas/electric, etc
logistic regression
occurs when a given data point on a time series analysis is affected by a previous data point for that time
series
autocorrelation
(durbin watson statistic)
check for this when under Time Series
occurs when all of the random variables have the same general finite variance..
data
points in a scatterplot stay approximately the same distance away from the regression line throughout the entire dataset.
Homerscedasticity
OLS regression (ordinary least squares)
Useful in Marketing, Medicine, Soc Media, Education
Cluster Analysis
is designed to establish a logical
sequence for decisions, to consider the decision alternatives available, and to evaluate the results they will produce… limted # potential outcomes possible
Decision Tree
Focus on customers
Lean processes - eliminate anything not adding value to customers
PDCA Cycle - work with customer to create best possible product