Midterm Flashcards
Ch 1-4
Analytics uses..
- Data
- Information Technology
- Statistical Analysis
- Quantitative Methods
- Mathematical or Computer based models
Analytics Purpose
to help managers gain improved insight about their business operations and make better, fact- based decisions
Analytics Applications
- Pricing
- Customer Segmentation
- Merchandising
- Location
- Social Media
BA has strong relationship with…
- profitability
- revenue
- shareholder return
3 Kinds of Analytics
- Descriptive Analytics
- Predictive Analytics
- Prescriptive Analytics
Descriptive Analytics
- Uses data to understand past and present
- Metrics and measures
Predictive Analytics
- Analyzes past performance in an effort to predict the future
- Forecasting, Simulation
Prescriptive Analytics
- Uses optimization techniques to identify the best alternative to maximize or minimize some objective
- operations research or management science.
Data
collected fact and figures
Database
- collection of computer files containing data
- data are interlinked by fields or attributes that are common across files
Information
comes from analyzing, organizing, and transforming data
Big Data
Massive amounts of business data from a wide variety of sources
- most available in real time
- can be uncertain or unpredictable
Big Data datasets generated by
web applications, social networks, click streams, sensors, and cards
Big Data helps
organizations better understand and predict customer behavior and improve customer service
Big Data Characteristics
- Volume
- Variety
- Velocity
- Veracity
Volume
- amount of data increases
- Big today, Bigger tomorrow
Variety
- Many sources
- Unstructured and messy
Velocity
Captured in real time and quickly incorporated into decisions
Veracity
- How reliable is the data?
- Need high-quality data and understanding
Metrics
are used to quantify performance
Measures
are numerical value of metrics
2 types of Descriptive Analytics Metrics
- Discrete Metrics
2. Continuous Metrics
Discrete Metrics
- Involves counting
- Ex) Number or proportion of on time deliveries, incorrect or incomplete orders, # of errors in an invoice
Continuous Metrics
- measured on a contunuum
- ex) Delivery time, package weight, purchase price
Metrics data can be one of 4 types
- Categorical (nominal) data
- Ordinal data
- Interval data
- Ratio data
Categorical (Nominal) Data
categorized according to a specified characteristic that bear no quantitative relationship
ex) location, employee job
Ordinal Data
Data that is ranked or ordered according to some relationship with one another, no fixed units of measurements
ex) college rankings, survey responses
Interval Data
- Ordinal data but with constant differences between observations
- No true zero point
- Ratios are not meaningful
Ex)Temperature readings, SAT scores, Time
Ratio Data
- Continuous values and have a natural zero point
- Ratios are meaningful
- Decimals
Ex) Monthly sales
Models
An abstraction or representation of a real system, idea, or object
-Captures the most important features
Forms of Models
- Written or verbal description
- Visual display
- Mathematical formula
- Spreadsheet representation
Decision Models
A decision model is a model used to understand, analyze, or facilitate decision making
Types of Model Input & Output
Inputs: 1. Data 2. Uncontrollable variables 3. Decision variables (controllable) Outputs: 1. Performance measures 2. Behavioral measures
Types of Model Input & Output
Inputs: 1. Data 2. Uncontrollable variables 3. Decision variables (controllable) Outputs: 1. Performance measures 2. Behavioral measures
Influence Diagrams
Influence diagrams visually show how various model elements relate to each other
(circles, squares, directed arc)
Circles
Variables that cannot be controlled
Squares
Variables that can be controlled, or decisions
Directed arc
One node influences the other
Predictive Decision Models
- often incorporate uncertainty to help managers analyze risk
- Aim to predict what will happen in the future
- Use data from the past to define a potential future relationship (cause/ effect)
- Usually a mathematical model but can also be in written, visual, or spreadsheet forms
Prescriptive Decision Models
Prescriptive decision models help decision makers identify the best solution
- optimization
- Objective Function
- Constraints
- Optimal Solution
Optimization
Finding values of decision variables that minimize (or maximize) something such as cost (or profit)
Objective function
The equation that minimizes (or maximizes) the quantity of interest
Constraints
Limitations or restrictions
Optimal solution
Values of the decision variables at the minimum (or maximum) point
Optimal solution
Values of the decision variables at the minimum (or maximum) point
Six steps in the problem solving process
- Recognizing the problem
- Defining the problem
- Structuring the problem
- Analyzing the problem
- Interpreting results and making a decision
- Implementing the solution
Recognizing the Problem
- Realize the problem exists!
- Problems exist when there is a gap between what is happening and what we think should be happening
Ex) Costs are too high compared with competitors
Defining the Problem
- Clearly defining the problem is not a trivial task!
- Must separate the problem from the symptom
- Complexity increases when the following occur
(Large number of courses of action, Several competing objectives, External groups are affected, Problem owner and problem solver are not the same person, Time constraints exist, Problem belongs to a group vs. an individual
Structuring the Problem
- Stating goals and objectives
- Characterizing the possible decisions
- Identifying any constraints or restrictions
- Develop a formal model
Analyzing the Problem
- Identifying and applying appropriate BA techniques
- Typically involves experimentation, statistical analysis, or a solution process
- Evaluate scenarios, analyze risks associated with alternatives, meet goals, or final optimal solution
Focus for this class!
Interpreting Results and Making a Decision
- Managers interpret the results from the analysis phase
- Incorporate subjective judgment as needed
- Understand limitations and model assumptions
- Make a decision utilizing the above information
Interpreting Results and Making a Decision
- Managers interpret the results from the analysis phase
- Incorporate subjective judgment as needed
- Understand limitations and model assumptions
- Make a decision utilizing the above information
Implementing the Solution
- Translate the results of the model back to the real world
- Make the solution work in the organization by providing adequate training and resources
- Manage change, build trust, etc.
- Inform and balance needs of the stakeholders (Customers, suppliers employees)
- Implementation could be a project!
Data Visualization
is the process of displaying data (often in large quantities) in a meaningful fashion to provide insights to support better decisions
(tables vs graphs)
Data Visualization allows us to
analyze current and past data, which may reveal patterns and relationships
Tables and graphs
improve communication and understanding so Senior management is able to quickly understand the message and impact
Dashboards
is a visual representation of a set of key business measures
- Term related to an automobile’s control panel
Dashboards provide
important summaries of key business information
Dashboards help
manage a business process or function
Graphs and Charts
- Take raw data and display it in a picture
- Map out data associated with variables to determine patterns and relationships
- Communication tool for both model results and data exploration
- One of the most common descriptive analytics
6 Types of Charts
- Column/bar
- Line
- Pie
- Area
- Scatter
- Bubble
Column/ Bar Charts
show current and past performance to compare categorical data
Column/ Bar Chart Forms
- Clustered: compares values across categories
- Stacked: displays contribution of each variable towards the overall total
- 100% Stacked: compares contribution of each variable towards the total as a percent
Line Charts
show trends in data over time
Ex) us exports to china
Pie Charts
- display the relative proportion of each data source to the total
- Works well for a few variables
Ex) Marital status
Area Charts
- combine pie and line charts
- Shows trends as well as the percentage of the total for each variable
- few variables
Ex) energy consumptions
Scatter Charts
- charts show the relationship between two variables
- Need data in pairs to construct
Ex) house size vs market value
Bubble Charts
- are a form of scatter charts
- Size of data marker corresponds to the value of a third variable
- Plotting three variables in two dimensions
Ex) Price, price/earnings ratio, market cap
Pareto Analysis
- involves sorting data and calculating cumulative proportions
- Isolates variables and determines the main drivers
- Provides focus for continuous improvement initiatives
Statistic
is a summary measure of data
Descriptive statistics
are methods that describe and summarize data
Frequency Distributions
Often displayed as a table that shows the number of observations in each group
- groups are non-overlapping
- “Mutually exclusive and collectively exhaustive”
- usually categorical
Relative Frequency Distribution
are frequencies expressed as a fraction (proportion) of the total
Cumulative Relative Frequency
The proportion of the total number of observations that fall at or below the upper limit of each group
Histograms
is a graphical depiction of a frequency distribution for numerical data
Percentiles
Percentiles specify the percent of data entries that are at or below the value of a specific or chosen record
Quartiles
Quartiles break the data into four parts
- 25th
- 50th
- 75th
- 100th
Cross-Tabulations
- displays the number of observations in a data set for different subcategories of two categorical variables
Ex) gender, edu level, martial status
PivotTables
- Used to create cross-tabulations
- Organize and summarize complex data sets into meaningful data
- Drill down into data sets
- Make summary tables and charts
Trendlines
are useful for studying the relationship between variables (for example: price vs. demand, bond rate vs. business cycle, intent to purchase product vs. brand attitude, etc.)
Types of Trendlines
- linear
- polynomial
- exponential
- power
Regression Analysis
a tool for building mathematical and statistical models that characterize relationships between a dependent (ratio) variable and one or more independent, or explanatory variables (ratio or categorical), all of which are numerical
Simple linear regression
involves a single independent variable
Multiple regression
involves two or more independent variables
Two categories of regression models
- Cross-sectional data
2. Time series data (forecasting)
Least Squares Regression
- Goal is to minimize residual error
- The best fit line minimizes the sum of the squares of the residuals
- Errors can be positive or negative, so we square the values to eliminate any negative signs
Residuals
are the observed errors associated with estimating the value of the dependent variable using the regression line
Regression Significance
The t-statistic will also test the significance of the model coefficients
(Slope and y-intercept)
Residual Analysis
- Use the residual plot from the regression tool for analysis
- Residuals in plot should look random, with no clear shape or trend
- Shape or trend means residuals may not be random and could indicate issues with variance, missing terms, outliers, etc.
(Shape => A problem with the regression model)
Best Practices Approach
- Construct a model with all available independent variables. Check for significance of the independent variables by examining the p-values.
- Identify the independent variable that has the largest p-value that exceeds the level of significance.
- Remove the single variable identified in step 2 from the model, rerun the model, and evaluate adjusted R2.
- Repeat steps 2 and 3 until all variables are significant.
Forecasting
-Process of predicting a future event
-Underlying basis of all business decisions
(Production, inventory, personnel, facilities)
Forecasting Time Horizons
- Short-range forecast
- Medium-range forecast
- Long-range forecast
Short-range forecast
- Up to 1 year, generally less than 3 months
Ex) Purchasing, job scheduling, workforce levels, job assignments, production levels
Medium-range forecast
- 3 months to 3 years
Ex) Sales and production planning, budgeting
Long-range forecast
- 3+ years
Ex) New product planning, facility location, research and development
Types of Forecasts
- Economic forecasts
- Technological forecasts
- Demand forecasts
Economic forecasts
Address business cycle – inflation rate, money supply, housing starts, etc
Technological forecasts
- Predict rate of technological progress
- Impacts development of new products
Demand forecasts
- Predict sales of existing products and services
- Payoffs in reduced inventory and obsolescence
Strategic Importance
- Supply chain management
- Human Resources
- Capacity
Supply chain management
Good supplier relations, advantages in product innovation, cost and speed to market
Human Resources
Hiring, training, laying off workers
Capacity
Shortages can result in undependable delivery, loss of customers, loss of market share
7 Forecasting Steps
- Determine the use of the forecast
- Select the items to be forecasted
- Determine the time horizon of the forecast
- Select the forecasting model(s)
- Gather the data needed to make the forecast
- Make the forecast
- Validate and implement the results
Qualitative method of forecasting
- Used when situation is vague and little data exist
- New products, new technology
- Involves intuition, experience
Quantitative methods of forecasting
- Used when situation is stable and historical data exist
- Existing products, current technology
- Involves mathematical techniques
Types of Qualitative Methods
- Jury of executive opinion
- Delphi method
- Sales force composite
- Market survey
Jury of Executive Opinion
- Pool opinions of high-level experts and managers
- Small group
- Group estimates demand by working together
- Combines managerial experience with statistical models
- Relatively quick
- Disadvantage: Group-think
Delphi Method
Panel of experts, queried individually
Iterative process that continues until consensus reached
3 participants: staff, respondents, decision makers
Sales Force Composite
- Estimates from individual salespersons are reviewed for reasonableness, then aggregated
- Combined at district and national levels
- Assumed that sales reps know and understand customer wants
- May be overly optimistic
Market Survey
- Ask customers about purchasing plans
- Useful for demand and product design and planning
- What consumers say and what they actually do may be different
- May be overly optimistic
Quantitative Approaches of forecasting
- Time-series models
2. Associative model
Time-series models
- Naïve approach
- Moving averages
- Exponential smoothing
- Trend projection (trendlines)
Associative model
Linear regression
Naïve approach
- Assumes demand in next period is the same as demand in most recent period
- Can be cost effective and efficient
- good starting point
Moving Averages
- Series of arithmetic means
- Subset of historical actual values to generate forecast
- Little or no trend
Exponential Smoothing
- Form of weighted moving average
Most recent data has stronger influence
Forecast Accuracy
- We need to obtain the most accurate forecast, no matter the technique
- To find the most accurate forecast, select the model that gives us the lowest forecast error
Three measures for error
- Mean absolute deviation (MAD)
- Mean squared error (MSE)
- Mean absolute percent error (MAPE)
Mean absolute deviation (MAD)
How much the forecast missed the target
Mean squared error (MSE)
Square of how much the forecast missed the target
Mean absolute percent error (MAPE)
Average percent error
Seasonal Variations in Data
model adjusts trend data for seasonal variations in demand
Steps for monthly seasons:
- Find average historical demand for each month
- Compute average demand over all months
- Compute seasonal index for each month
- Estimate next year’s total demand
- Divide this estimate of the total demand by the number of months, then multiply by the seasonal index for that month
Tracking Signal
- Measures how well the forecast is predicting actual values
- Ratio of cumulative forecast errors to MAD