Domain 4 - Methodology Selection Flashcards

1
Q

Three categories of analytical models

A

Descriptive
Predictive
Prescriptive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Prescriptive Techniques

A

Optimization
Simulation-Optimization
Stochastic Optimization

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Types of optimization models

A

Linear programming, Integer programming, Nonlinear programming, Mixed integer programming, Network optimization, Dynamic Programming, Metaheuristics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Prescriptive models are used to…

A

provide new ways to improve certain types of performance as agreed upon with the customer and documented in the business and analytics problem statements.
Ask: “what is the best action/outcome?”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Predictive models are used to…

A

predict future trends and possibilities and explain past relationships. Ask: “what could happen?”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Predictive techniques include…

A
Simulation
Regression
Statistical Inferences
Classification
Clustering
Artificial Intelligence
Game Theory
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Simulation methods include…

A

(predictive analysis)
Discrete event simulation
Monte Carol
Agent-based modelling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Regression methods include…

A

(predictive analysis)
Logistic
Linear
Step-wise

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Statistical Inferences include…

A
(predictive analysis)
Confidence intervals
Hypothesis testing
Analysis of variance
Design of experiments
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Descriptive analysis is used to…

A

answer the question“What happened?”
They describe the problem situation for further analysis. It can be based on descriptive statistics that are conveyed through:
(1) charts and graphs such as histograms, scatter plots, etc., and/or
(2) numerical presentations such mean, median, mode, variance, standard deviations of distributions of data, and cross tabulations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Primary factors that an analyst generally considers when selecting an appropriate methodology (7 things)

A
  1. Time to complete project
  2. Accuracy of the model needed
  3. Relevance of the methodology and scope of the project
  4. Accuracy of the data
  5. Data availability and readiness
  6. Staff and resource availability
  7. Methodology popularity (go with the best approach not the most popular)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Discrete event simulation (what is it and why use it)

A

A simulation methodology that is often used to understand bottlenecks in systems.
Handles cases that cannot be handled by queuing theory.
Often used for multistage processes modeling with variations in their arrivals and service time and utilizing shared resources to perform multiple operations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Queuing model (what is it and why use it)

A

Designed to identify the most efficient pathway to
solution; i.e., at a bank it might identify the number of tellers needed to satisfy customers in a particular time frame such as no more than 10 minutes waiting in a queue.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Monte Carlo Simulation (what is it and why use it)

A

Used when a queuing model isn’t needed.
Used primarily to estimate dependent variable randomness out of a set of independent variable randomness. This is especially necessary when distributions of the input variables are not necessarily normally distributed and the relationship to estimate the dependent variable is not simple (e.g. additive)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Agent-based modeling (what is it and why use it)

A

A system modeled (simulated) as a collection of autonomous decision-making entities called agents that are used to discover emergent behavior that is hard to predict without simulating it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

System dynamics (what is it and why use it)

A

A simulation approach used to understanding the interactions of a complex system over time.

17
Q

Game theory (what is it and why use it)

A

Study of strategic decision-making processes through competition and collaboration

18
Q

Probability theory (what is it and why use it)

A

The likelihood of a particular event occurring expressed as a percentage to make decisions under chosen risk or tolerance. Bayesian and conditional probabilities are widely used in analytics.

19
Q

Economic analysis (what is it and why use it)

A

Evaluation often used to guide the optimal allocation

of scarce resources. Can include: IRR, NPV, FV, Payback period

20
Q

Regression analysis (what is it and why use it)

A

A class of statistical methodologies used to map dependent variables with independent variables and understand the significance between the variables and their correlations with one another.

21
Q

Linear regression (what is it and why use it)

A

Compares the relationship between a dependent
variable and one or more explanatory variables. The variables here are linear; however nonlinear functions can be explored here by scaling input data.

22
Q

Stepwise regression (what is it and why use it)

A

method of model building that successively adds or deletes variables based on performance

23
Q

Logistic regression (what is it and why use it)

A

may also be called logit analysis, is a regression analysis often used to predict the outcome of categorical variables.

24
Q

What are some key artificial intelligence models

A

Artificial Neural Networks, Fuzzy Logic, Expert Systems

25
Q

Value-stream mapping

A

Is a lean-management method for analyzing the current state and designing a future state for the series of events that take a product or service from its beginning through to the customer.
Requires more aggregate data compared to a discrete-event simulation model.

26
Q

Pros and Cons of aggregating data at a lower level

A

The lower the level of aggregation, the more accurate and descriptive the model will be of the real-life scenario;
however, it will be harder to validate and will certainly be more prone to mistakes.

27
Q

Pros and Cons of aggregating data at a higher level

A

usually provides faster results that are easier to understand but with less accuracy.

28
Q

How much should you aggregate data?

A

The general rule of thumb is to model at the highest level of aggregation possible that will ensure a satisfactory level of accuracy within the time permitted.

29
Q

What are some types of software tools?

A
Spreadsheets
optimization systems
statistical software
simulation systems
business intelligence systems
data management systems, 
data integration systems, 
Big Data operating systems (like HADOOP)
30
Q

What are the three portions that data should be divided into for model testing

A
Building data (training data) - used to estimate the need parameters
Testing data - used to test (verify) the model's ability to provide accurate results
Validating data - used to test that the model behaves closely to the physical behaviour being modeled.