CAP Terms Flashcards

1
Q

5 Whys

A

iterative process of discovery through repetitively asking ‘why’; used to explore cause and effect relationships underlying and/or leading to problem (http:// en.wikipedia.org/wiki/5_Whys)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

5S

A

workplace organization method promoting efficiency and effectiveness; five terms based on Japanese words for: sorting, set in order, systematic cleaning, standardizing, and sustaining (http://en.wikipedia.org/ wiki/5S_(methodology)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

80/20 Rule

A

AKA the Pareto principle: roughly 80% of results come from 20% of effort

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Accuracy

A

quality or state of being correct or precise, or the degree to which the result of a measurement, calculation, or specification conforms to the correct value or standard (https://www.google.com/#q=accuracy)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Activity-based costing

A

method of assigning costs to products or services on the resources that they consume (http://www.economist. com/node/13933812)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Agent-based modeling

A

a class of computation models for simulating actions and interactions of autonomous agents with a view to assessing their effects on the system as a whole (http:// en.wikipedia.org/wiki/Agent-based_model)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Algorithm

A

set of specific steps to solve a problem

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Amortization

A

allocation of cost of an item or items over a time period such that the actual cost is recovered; often used to account for capital expenditures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Analytics

A

scientific process of transforming data into insight for making better decisions (INFORMS)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Analytics professional

A

person capable of making actionable decisions through the analytic process; also a person holding the Certified Analytics Professional (CAP®) credential

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

ANCOVA

A

acronym for analysis of covariance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

ANOVA

A

acronym for analysis of variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Artificial Intelligence (AI)

A

branch of computer science that studies and develops intelligent machines and software (http://en.wikipedia. org/wiki/Artificial_intelligence)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Artificial Neural Networks

A

computer-based models inspired by animal central nervous systems (https://www.google.com/#q=artificial+ neural+networks)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Assemble-to-Order (ATO)

A

manufacturing process where products are assembled as they are ordered; characterized by rapid production and customization (http://www.investopedia.com/terms/a/ assemble-to-order.asp)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Assignment problem

A

one of the fundamental combinatorial optimization problems in the branch of optimization or operations research in mathematics; consists of finding a maximum-weight matching in a weighted bipartite graph (http:// en.wikipedia.org/wiki/Assignment_problem)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Automation

A

use of mechanical means to perform work previously done by human effort

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Average

A

sum of a range of values divided by the number of values to arrive at a value characteristic of the midpoint of the range; see also, Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Batch production

A

method of production where components are produced in groups rather than a continual stream of production; see also, Continuous production

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Benchmark problems

A

comparison of different algorithms using a large test set (http://www.cs.cmu.edu/afs/cs/project/jair/pub/ volume24/ortizboyer05a-html/node6.html)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Benchmarking

A

act of comparison against a standard or the behavior of another in attempt to determine degree of conformity to standard or behavior

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Bias

A

a tendency for or against a thing, person, or group in a way as to appear unfair; in statistics, data calculated so that it is systematically different from the population parameter of interest (http://en.wikipedia.org/wiki/ Bias_(statistics)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Big data

A

data sets too voluminous or too unstructured to be analyzed by traditional means

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Box-and-whisker plot

A

a simple way of representing statistical data on a plot in which a rectangle is drawn to represent the second and third quartiles, usually with a vertical line inside to indicate the median value. The lower and upper quartiles are shown as horizontal lines either side of the rectangle (http://oxforddictionaries.com/us/definition/ english/box-plot)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Branch-and-Bound

A

a general algorithm for finding optimal solutions of various optimization problems; consists of a system enumeration of all candidate solutions where large subsets of fruitless candidates are discarded en masse using upper and lower estimated bounds of the quantity being optimized (http://en.wikipedia.org/wiki/Branch_ and_bound)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Business analytics (BA)

A

refers to the skills, technologies, applications, and practices for continuous iterative exploration and investigation of past business performance to gain insight and drive business planning; can be descriptive, prescriptive, or predictive; focuses on developing new insights and understanding of business performance based on data and statistical methods (http:// en.wikipedia.org/wiki/Business_analytics and www. informs.org)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Business case

A

reasoning underlying and supporting the estimates of business consequences of an action

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Business intelligence (BI)

A

a set of methodologies, processes, architectures, and technologies that transform raw data into meaningful and useful information (http://en.wikipedia.org/wiki/ Business_intelligence)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Business Process Modeling or Mapping (BPM)

A

act of representing processes of an enterprise so that the current process may be analyzed and improved; typically action performed by business analysis and managers seeking improved efficiency and quality (http:// en.wikipedia.org/wiki/Business_process_modeling)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Chief Analytics Officer (CAO)

A

possible title of one overseeing analytics for a company; may include mobilizing data, people, and systems for successful deployment, working with others to inject analytics into company strategy and decisions, supervising activities of analytical people, consulting with internal business functions and units so they may take advantage of analytics, contracting with external providers of analytics (Davenport, Enterprise Analytics, p. 173)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Chi-squared Automated Interaction Detection (CHAID)

A

a technique for performing decision tree analysis developed by Gordon V. Kass. CHAID is one of several commonly used techniques for decision trees and is based upon hypothesis testing using Bonferroni correction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

Classification

A

assortment of items or entities into predetermined categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

Cleansing

A

AKA cleaning or scrubbing: the process of detecting and correcting (or removing) corrupt or inaccurate records from a record set, table, or database; may also involve harmonization of data, and standardization of data (http://en.wikipedia.org/wiki/Data_cleansing)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

Clustering

A

grouping of a set of objects in such a way that objects in the same group (cluster) are more similar to each other than to those in other groups or clusters (http:// en.wikipedia.org/wiki/Cluster_analysis)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

Combinatorial optimization

A

a topic that consists of finding an optimal object from a finite series of objects; used in applied mathematics and theoretical computer science (http://en.wikipedia.org/ wiki/Combinatorial_optimization)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

Confidence interval

A

a type of interval estimate of a population parameter used to indicate the reliability of an estimate. It is an observed interval (i.e., it is calculated from the observations), in principle different from sample to sample, that frequently includes the parameter of interest if the experiment is repeated (http:// en.wikipedia.org/wiki/Confidence_interval)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

Confidence level

A

if confidence intervals are constructed across many separate data analyses of repeated (and possibly different) experiments, the proportion of such intervals that contain the true value of the parameter will match the confidence level (http://www.usablestats.com/ lessons/ConfidenceLevel)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

Conjoint analysis

A

allows calculation of relative importance of varying features and attributes to customers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

Constraint

A

a condition that a solution to an optimization problem is required by the problem itself to satisfy. There are several types of constraints—primarily equality constraints, inequality constraints, and integer constraints (http://en.wikipedia.org/wiki/Constraint_ (mathematics))

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

Constraint programming

A

a programming paradigm wherein relations between variables are stated in the form of constraints (http:// en.wikipedia.org/wiki/Constraint_programming)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

Continuous production

A

method of production where components are produced in a continuous stream; see also, Batch production

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

Correlation

A

a broad class of statistical relationships involving dependence (http://en.wikipedia.org/wiki/Correlation_ and_dependence)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

Cost of capital

A

the cost of funds used for financing a business. Cost of capital depends on the mode of financing used—it refers to the cost of equity if the business is financed solely through equity, or to the cost of debt if it is financed solely through debt (www.investopedia.com)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

Cube

A

see OLAP cube

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

Cumulative density function

A

probability that a real-valued random variable X with a given probability distribution will be found at a value less than or equal to x; used to specify the distribution of multivariate random variables (http://en.wikipedia.org/ wiki/Cumulative_distribution_function)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

Cutting stock problem

A

optimization or integer linear programming problem arising from applications in industry where high production problems exist (http://en.wikipedia.org/wiki/ Cutting_stock_problem)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

Data

A

(plural form of datum) values of qualitative or quantitative variables, belonging to a set of items; represented in a structure, often tabular (represented by rows and columns), a tree (a set of nodes with parent-children relationship), or a graph structure (a set of interconnected nodes); typically the results of measurements (http://en.wikipedia.org/wiki/Data)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

Data mining

A

relatively young and interdisciplinary field of computer science; the process of discovering new patterns from large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics, and database systems; see also, KDD (Davenport, Enterprise Analytics, p. 14)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

Data warehouse

A

a central repository of data that is created by integrating data from one or more disparate sources; used for reporting and data analysis (http://en.wikipedia.org/wiki/ Data_warehouse)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
50
Q

Database

A

an organized collection of data organized to model relevant aspects of reality to support processes requiring this information (http://en.wikipedia.org/wiki/Database)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
51
Q

Decision tree

A

graphic illustration of how data leads to decision when branches of the tree are followed to their conclusion; different branches may lead to different decisions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
52
Q

Decision variables

A

a decision variable represents a problem entity for which a choice must be made. For instance, a decision variable might represent the position of a queen on a chessboard, for which there are 100 different possibilities (choices) on a 10x10 chessboard or the start time of an activity in a scheduling problem. Each possible choice is represented by a value, hence the set of possible choices constitutes the domain that is associated with a variable (A. Holder, editor. Mathematical Programming Glossary. INFORMS Computing Society, http://glossary. computing.society.informs.org/, 2006-08. Originally authored by Harvey J. Greenberg, 1999-2006.)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
53
Q

Descriptive analytics

A

prepares and analyzes historical data to identify patterns for reporting trends (http://www.informs.org/ Community/Analytics/About-Us)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
54
Q

Design of experiments

A

design of any information gathering exercise where variation is present, whether under the control of the experimenter or not; see also, Experimental design (http://en.wikipedia.org/wiki/Design_of_experiments)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
55
Q

Discrete event simulation

A

models the operation of a system as a discrete sequence of events in time; between events, no change in the system is assumed thus a simulation can move in time from one event to the next (http://en.wikipedia.org/wiki/ Discrete_event_simulation)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
56
Q

Dynamic programming

A

based on the Principle of Optimality, this was originally concerned with optimal decisions over time. For continuous time, it addresses problems in variational calculus. For discrete time, each period is sometimes called a stage, and the DP is called a multistage decision process. Here is the Fundamental Recurrence Equation for an additive process: F(t, s) = Opt{r(t, s, x) + aF(t’, s’): x in X(t, s) and s’=T(t, s, x)}, (A. Holder, editor. Mathematical Programming Glossary. INFORMS Computing Society, http://glossary. computing.society.informs.org/, 2006-08. Originally authored by Harvey J. Greenberg, 1999-2006.)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
57
Q

Effective domain

A

the domain of a function for which its value is finite (A. Holder, editor. Mathematical Programming Glossary. INFORMS Computing Society, http://glossary. computing.society.informs.org/, 2006-08. Originally authored by Harvey J. Greenberg, 1999-2006.)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
58
Q

Efficiency

A

the comparison of what is actually produced or performed with what can be achieved with the same consumption of resources (money, time, labor, etc.). It is an important factor in determination of productivity (www.businessdictionary.com)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
59
Q

Engagement

A

an estimate of the depth of visitor interaction against a clearly defined set of goals; may be measured through analytical models (Davenport, Enterprise Analytics, p. 73-74)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
60
Q

Enterprise resource planning (ERP)

A

a cross-functional enterprise system driven by an integrated suite of software modules that supports the basic internal business processes of a company (http:// en.wikipedia.org/wiki/Enterprise_resource_planning)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
61
Q

ETL (extract, transform, load)

A

refers to three separate functions combined into a single programming tool. First, the extract function reads data from a specified source database and extracts a desired subset of data. Next, the transform function works with the acquired data—using rules or lookup tables, or creating combinations with other data—to convert it to the desired state. Finally, the load function is used to write the resulting data (either all of the subset or just the changes) to a target database, which may or may not previously exist (http://searchdatamanagement. techtarget.com/definition/extract-transform-load)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
62
Q

Experimental design

A

in quality management, a written plan that describes the specifics for conducting an experiment, such as which conditions, factors, responses, tools, and treatments are to be included or used; see also, Design of experiments (www.businessdictionary.com)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
63
Q

Expert systems

A

a computer program that simulates the judgment and behavior of a human or an organization that has expert knowledge and experience in a particular field. Typically, such a system contains a knowledge base containing accumulated experience and a set of rules for applying the knowledge base to each particular situation that is described to the program (http://searchcio-midmarket. techtarget.com/definition/expert-system)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
64
Q

Factor analysis

A

a statistical method used to describe variability among observed, correlated variables in terms of a potentially lower number of unobserved variables called factors. Factor analysis searches for such joint variations in response to unobserved latent variables (http:// en.wikipedia.org/wiki/Factor_analysis)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
65
Q

Failure Mode and Effects Analysis (FMEA)

A

a systematic, proactive method for evaluating a process to identify where and how it might fail, and to assess the relative impact of different failures to identify the parts of the process that are most in need of change (http://intranet.uchicago.edu/quality/ FailureModesandEffectsAnalysis_FMEA_1.pdf)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
66
Q

Fixed cost

A

a cost that is some value, say C, regardless of the level as long as the level is positive; otherwise the fixed charge is zero. This is represented by Cv, where v is a binary variable. When v = 0, the fixed charge is 0; when v = 1, the fixed charge is C. An example is whether to open a plant (v = 1) or not (v = 0). To apply this fixed charge to the non-negative variable x, the constraint x 0), x = 0 is forced by the upper bound constraint. If v = 1 (e.g., plant is open), x

67
Q

Forecasting

A

the use of historic data to determine the direction of future trends (http://www.investopedia.com/terms/f/ forecasting.asp)

68
Q

Fuzzy logic

A

a form of mathematical logic in which truth can assume a continuum of values between 0 and 1 (http:// wordnetweb.princeton.edu/perl/webwn?s=fuzzy logic)

69
Q

Game Theory

A

in general, a (mathematical) game can be played by one player, such as a puzzle, but its main connection with mathematical programming is when there are at least two players, and they are in conflict. Each player chooses a strategy that maximizes his payoff. When there are exactly two players and one player’s loss is the other’s gain, the game is called zero sum. In this case, a payoff matrix A is given where Aij is the payoff to player 1, and the loss to player 2, when player 1 uses strategy i and player 2 uses strategy j. In this representation each row of A corresponds to a strategy of player 1, and each column corresponds to a strategy of player 2. If A is m × n, this means player 1 has m strategies, and player 2 has n strategies (A. Holder, editor. Mathematical Programming Glossary. INFORMS Computing Society, http://glossary.computing.society.informs.org/, 2006-08. Originally authored by Harvey J. Greenberg, 1999-2006.)

70
Q

Genetic algorithms

A

a class of algorithms inspired by the mechanisms of genetics, which has been applied to global optimization (especially for combinatorial programs). It requires the specification of three operations (each is typically probabilistic) on objects, called “strings” (A. Holder, editor. Mathematical Programming Glossary. INFORMS Computing Society, http://glossary.computing.society. informs.org/, 2006-08. Originally authored by Harvey J. Greenberg, 1999-2006.) Originally authored by Harvey J. Greenberg, 1999-2006.)

71
Q

Global optimal

A

refers to mathematical programming without convexity assumptions, which are NP-hard. In general, there could be a local optimum that is not a global optimum. Some authors use this term to imply the stronger condition there are multiple local optima. Some solution strategies are given as heuristic search methods (including those that guarantee global convergence, such as branch and bound). As a process associated with algorithm design, some regard this simply as attempts to assure convergence to a global optimum (unlike a purely local optimization procedure, like steepest ascent). (A. Holder, editor. Mathematical Programming Glossary. INFORMS Computing Society, http://glossary. computing.society.informs.org/, 2006-08. Originally authored by Harvey J. Greenberg, 1999-2006. See the supplement by J.D. Pintér.)

72
Q

Goodness of fit

A

degree of assurance or confidence to which the results of a sample survey or test can be relied upon for making dependable projections. Described as the degree of linear correlation of variables, it is computed with the statistical methods such as chi-square test or coefficient of determination (www.businessdictionary.com)

73
Q

Graphical User Interface (GUI)

A

a human–computer interface (i.e., a way for humans to interact with computers) that uses windows, icons, and menus, and that can be manipulated by a mouse (and often to a limited extent by a keyboard as well) (http:// www.linfo.org/gui.html)

74
Q

Greedy heuristics

A

an algorithm that follows the problem-solving heuristic of making the locally-optimal choice at each stage with the hope of finding a global optimum (http:// en.wikipedia.org/wiki/Greedy_heuristic)

75
Q

Heuristic

A

in mathematical programming, this usually means a procedure that seeks an optimal solution but does not guarantee it will find one, even if one exists. It is often used in contrast to an algorithm, so branch and bound would not be considered a heuristic in this sense. In AI, however, a heuristic is an algorithm (with some guarantees) that uses a heuristic function to estimate the “cost” of branching from a given node to a leaf of the search tree (Also, in AI, the usual rules of node selection in branch and bound can be determined by the choice of heuristic function: best-first, breadth-first, or depth-first search) (A. Holder, editor. Mathematical Programming Glossary. INFORMS Computing Society, http://glossary.computing.society.informs.org/, 2006-08. Originally authored by Harvey J. Greenberg, 1999-2006.)

76
Q

Histogram

A

graphic depiction of data using columns to represent relative size/importance of data grouping

77
Q

Hypothesis testing

A

the theory, methods, and practice of testing a hypothesis by comparing it with the null hypothesis. The null hypothesis is only rejected if its probability falls below a predetermined significance level, in which case the hypothesis being tested is said to have that level of significance (https://www.google.com/#psj=1&q=hypoth esis+testing+definition)

78
Q

Influence diagram

A

depicts structure of decision process and notes the data needed to make the decision

79
Q

INFORMS

A

the largest professional society in the world for professionals in the field of operations research (OR), management science, and analytics (www.informs.org/ About)

80
Q

Innovative Applications in Analytics Award

A

award administered by the Analytics Section of INFORMS to recognize creative and unique developments, applications, or combinations of analytical techniques. The prize promotes the awareness of the value of analytics techniques in unusual applications, or in creative combination to provide unique insights and/or business value (http:// www.informs.org/Community/Analytics/News-Events2/ Innovative-Applications-in-Analytics-Award)

81
Q

Integer program

A

the variables are required to be integer-valued. Historically, this term implied the mathematical program was otherwise linear, so one often qualifies a nonlinear integer program versus a linear IP (A. Holder, editor. Mathematical Programming Glossary. INFORMS Computing Society, http://glossary.computing.society. informs.org/, 2006-08. Originally authored by Harvey J. Greenberg, 1999-2006.)

82
Q

Integrity

A

the measure of the trust that can be placed in the correctness of the information supplied by a navigation system (http://www.navipedia.net/index.php/Integrity; http://www.genengnews.com/gen-articles/preserving-the-integrity-of-statistics/3081/)

83
Q

Internal rate of return (IRR)

A

the rate of growth that a project or investment is expected to create, expressed as a percentage, over a specified term. IRR is, in essence, the theoretical interest rate earned by the project (http://www.askjim.biz/ answers/internal-rate-of-return-irr-definition_4754.php)

84
Q

KDD

A

acronym for knowledge discovery in databases process; see also, Data mining (Davenport, Enterprise Analytics, p. 14)

85
Q

Knapsack problem

A

an integer program of the form, Max{cx: x in Zn+ and ax 0. The original problem models the maximum value of a knapsack that is limited by volume or weight (b), where x_j = number of items of type j put into the knapsack at unit return c_j, that uses a_j units per item (A. Holder, editor. Mathematical Programming Glossary. INFORMS Computing Society, http://glossary. computing.society.informs.org/, 2006-08. Originally authored by Harvey J. Greenberg, 1999-2006.)

86
Q

Lead time

A

time between the initial phase of a process and the emergence of results, as between the planning and completed manufacture of a product (http://www. thefreedictionary.com/lead+time)

87
Q

Lean production

A

a Japanese approach to management that focuses on cutting out waste while ensuring quality. This approach can be applied to all aspects of a business – from design through production to distribution (http://www. tutor2u.net/business/production/introduction-to-lean-production.html)

88
Q

Lift/lift curve

A

a measure of the effectiveness of a predictive model calculated as the ratio between the results obtained with and without the predictive model; lift charts consisting of lift curve and a baseline are visuals aids for measuring model performance (http://www2.cs.uregina.ca/~dbd/ cs831/notes/lift_chart/lift_chart.html)

89
Q

Linear program

A

opt{cx: Ax = b, x >= 0}. (Other forms of the constraints are possible, such as Ax

90
Q

Little’s Law

A

queuing theory where numerator and denominator are halved so queues are roughly equivalent no matter how many are in line; the long-term average number of customers in a stable system L is equal to the long-term average effective arrival rate, ?, multiplied by the (Palm) average time a customer spends in the system, W; or expressed algebraically: L = ?W. The relationship is not influenced by the arrival process distribution, the service distribution, the service order, or practically anything else. (http://en.wikipedia.org/wiki/Little’s_law )

91
Q

Local optimal

A

a solution that is optimal (either maximal or minimal) within a neighboring set of candidate solutions (http:// en.wikipedia.org/wiki/Local_optimum)

92
Q

Logistic regression

A

a type of probabilistic classification model [1] used for predicting the outcome of a categorical dependent variable (i.e., a class label) based on one or more predictor variables (features). Logistic regression can be binomial or multinomial. Binomial or binary logistic regression deals with situations in which the observed outcome for a dependent variable can have only two possible types (for example, “dead” versus “alive”). Multinomial logistic regression deals with situations where the outcome can have three or more possible types (e.g., “better” versus “no change” versus “worse”) (http://en.wikipedia.org/wiki/Logistic_ regression)

93
Q

Machine learning

A

an artificial intelligence (AI) discipline geared toward the technological development of human knowledge. Machine learning allows computers to handle new situations via analysis, self-training, observation, and experience (http://www.techopedia.com/definition/8181/ machine-learning)

94
Q

MANOVA

A

acronym for multivariate analysis of variance for use with multiple independent variables

95
Q

Mean

A

the arithmetic average of a set of values or distribution; however, for skewed distributions, the mean is not necessarily the same as the middle value (median), or the most likely (mode); see also, Average (http:// en.wikipedia.org/wiki/Mean)

96
Q

Mean squared error (MSE)

A

the unbiased estimator of population variance. MSE divides by the error degrees of freedom, e.g., if only the mean is estimated, MSE divides by N-1, if four parameters are estimated, MSE divides by N-4, and so on (http://en.wikipedia.org/wiki/Mean_squared_error )

97
Q

Mean time between failures (MTBF)

A

a measure of how reliable a hardware product or component is. For most components, the measure is typically in thousands or even tens of thousands of hours between failures (http://whatis.techtarget.com/ definition/MTBF-mean-time-between-failures)

98
Q

Median

A

the value such that the number of terms having values greater than or equal to it is the same as the number of terms having values less than or equal to it (http:// searchdatacenter.techtarget.com/definition/statistical-mean-median-mode-and-range)

99
Q

Metaheuristics

A

a general framework for heuristics in solving hard problems. The idea of ``meta’’ is that of level. An analogy is the use of a metalanguage to explain a language. For computer languages, we use symbols, like brackets, in the metalanguage to denote properties of the language being described, such as parameters that are optional. Examples of metaheuristics are: Ant Colony Optimization, Genetic Algorithms, Memetic Algorithms, Neural networks, etc. (A. Holder, editor. Mathematical Programming Glossary. INFORMS Computing Society, http://glossary.computing.society.informs.org/, 2006-08. Originally authored by Harvey J. Greenberg, 1999-2006.)

100
Q

Mode

A

value of the term that occurs the most often (http:// searchdatacenter.techtarget.com/definition/statistical-mean-median-mode-and-range)

101
Q

Monte Carlo simulation

A

a computerized mathematical technique that allows people to account for risk in quantitative analysis and decision making. The technique is used by professionals in such widely disparate fields as finance, project management, energy, manufacturing, engineering, research and development, insurance, oil and gas, transportation, and the environment (http://www. palisade.com/risk/monte_carlo_simulation.asp)

102
Q

Net present value

A

value in today’s currency of an item or service (Davenport, Enterprise Analytics, p. 22)

103
Q

Network optimization

A

the process of striking the best possible balance between network performance and network costs, in consideration of grade of service requirements (www. yourdictionary.com)

104
Q

Next best offer (NBO)

A

a targeted offer or proposed action for customers based on analyses of past history and behavior, other customer preferences, purchasing context, attributes of the produces, or services from which they can choose (Davenport, Enterprise Analytics, p. 83)

105
Q

Nominal group technique (NGT)

A

a structured method for group brainstorming that encourages contributions from everyone (http://asq. org/learn-about-quality/idea-creation-tools/overview/ nominal-group.html)

106
Q

Normalization

A

splits up data to avoid redundancy (duplication) by moving commonly repeating groups of data into new tables. Normalization therefore tends to increase the number of tables that need to be joined to perform a given query, but reduces the space required to hold the data and the number of places where it needs to be updated if the data changes (http://en.wikipedia.org/ wiki/Snowflake_schema)

107
Q

Objective function

A

the (real-valued) function to be optimized. In a mathematical program in standard form, this is denoted f (A. Holder, editor. Mathematical Programming Glossary. INFORMS Computing Society, http://glossary. computing.society.informs.org/, 2006-08. Originally authored by Harvey J. Greenberg, 1999-2006.)

108
Q

OLAP

A

an abbreviation for “Online Analysis and Processing”; a type of database technology that has long been used by the business community to analyze and interactively explore large financial data sets. The basic idea is that data sets are viewed as cubes with hierarchies along each axis (http://biolap.sourceforge.net/whitepaper.pdf)

109
Q

OLAP cube

A

an array of data understood in terms of its zero or more dimensions; each cell of the cube holds a number that represents some measure of the business, such as sales, profits, expenses, budget, and forecast (http:// en.wikipedia.org/wiki/OLAP_cube)

110
Q

Operations management

A

deals with the design and management of products, processes, services, and supply chains. It considers the acquisition, development, and utilization of resources that firms need to deliver the goods and services their clients want (http://mitsloan.mit.edu/omg/om-definition. php)

111
Q

Operations Research

A

a discipline that deals with the application of advanced analytical methods to help make better decisions (http:// en.wikipedia.org/wiki/Operations_research)

112
Q

Opportunity cost

A

the cost of an alternative that must be forgone to pursue a certain action (http://www.investopedia.com/terms/o/ opportunitycost.asp)

113
Q

Optimization

A

procedure or procedures used to make a system or design as effective or functional as possible, especially the mathematical techniques involved (http://www. thefreedictionary.com/optimization)

114
Q

Pareto concept

A

See, 80/20 rule

115
Q

Pattern recognition

A

in machine learning, pattern recognition is the assignment of a label to a given input value (http:// en.wikipedia.org/wiki/Pattern_recognition)

116
Q

Payback

A

the length of time required to recover the cost of an investment (http://www.investopedia.com/terms/p/ paybackperiod.asp)

117
Q

Pie chart

A

graphic depiction of data using a pie with different ‘slices’ to represent the relative size of different groupings of data points to the size of the whole

118
Q

Precision

A

the degree to which repeated measurements under unchanged conditions show the same results (http:// en.wikipedia.org/wiki/Accuracy_and_precision)

119
Q

Predictive analytics

A

any approach to data mining with four attributes: an emphasis on prediction (rather than description, classification, or clustering), rapid analysis measured in hours or days (rather than the stereotypical months of traditional data mining), an emphasis on the business relevance of the resulting insights (no ivory tower analyses), and (increasingly) an emphasis on ease of use, thus making the tools accessible to business users (http://www.gartner.com/it-glossary/predictive-analytics)

120
Q

Prescriptive analytics

A

evaluates and determines new ways of operating targeting business objective and balancing all constraints (http://www.informs.org/Community/ Analytics/About-Us)

121
Q

Pricing

A

a tactic in the simplex method, by which each variable is evaluated for its potential to improve the value of the objective function. Let p = c_B[B^-1], where B is a basis, and c_B is a vector of costs associated with the basic variables. The vector p is sometimes called a dual solution, though it is not feasible in the dual before termination; p is also called a simplex multiplier or pricing vector. The price of the jth variable is c_j - pA_j. The first term is its direct cost (c_j) and the second term is an indirect cost, using the pricing vector to determine the cost of inputs and outputs in the activity’s column (A_j). The net result is called the reduced cost, and its value determines whether this activity could improve the objective value (A. Holder, editor. Mathematical Programming Glossary. INFORMS Computing Society, http://glossary.computing.society.informs.org/, 2006-08. Originally authored by Harvey J. Greenberg, 1999-2006.)

122
Q

Principal Component Analysis (PCA)

A

a dimension-reduction tool that can be used to reduce a large set of variables to a small set that still contains most of the information in the large set (ftp://statgen. ncsu.edu/pub/thorne/molevoclass/AtchleyOct19.pdf)

123
Q

Probability density function

A

the equation used to describe a continuous probability distribution (http://stattrek.com/statistics/dictionary. aspx?definition=Continuous_probability_distribution)

124
Q

Problem assessment/ framing

A

initial step in the analytics process; involves buy in from all parties involved on what the problem is before a solution can be found

125
Q

Project management

A

the application of knowledge, skills, and techniques to execute projects effectively and efficiently. A strategic competency for organizations, enabling them to tie project results to business goals (http://www.pmi.org/ About-Us/About-Us-What-is-Project-Management.aspx)

126
Q

Proprietary data

A

data that no other organization possesses; produced by a company to enhance its competitive posture (Davenport, Enterprise Analytics, p. 37)

127
Q

Queuing theory

A

mathematical study of waiting in lines; results are used when making business decisions about the resources needed to provide service; research begun by A. K. Erlang (http://en.wikipedia.org/wiki/Queuing_theory on 2/20/13)

128
Q

Random

A

of or characterizing a process of selection in which each item of a set has an equal probability of being chosen (http://dictionary.reference.com/browse/random)

129
Q

Range

A

the difference between the maximum and minimum observations providing an estimate of the spread of the data (http://explorable.com/range-in-statistics)

130
Q

Regression

A

a statistical measure that attempts to determine the strength of the relationship between one dependent variable (usually denoted by Y) and a series of other changing variables (known as independent variables) (http://www.investopedia.com/terms/r/regression.asp)

131
Q

Regression analysis

A

statistical approach to forecasting change in a dependent variable (e.g., sales revenue) on the basis of change in one or more independent variables (e.g., population and income); AKA curve fitting or line fitting (www.businessdictionary.com)

132
Q

Response surface methodology (RSM)

A

a surface in (n+1) dimensions that represents the variations in the expected value of a response variable (see, regression) as the values of n explanatory variables are varied. Usually the interest is in finding the combination that gives a global maximum (or minimum) (http://www.answers.com/topic/response-surface)

133
Q

Return on investment (ROI)

A

calculations that provide a basis for comparison with other investment opportunities; typically calculated using ROI = ((Total value/benefits) – (total investment costs))/Total investment costs (Davenport; Enterprise Analytics, p. 20)

134
Q

Revenue management

A

the science and art of enhancing revenues while selling essentially the same amount of product (http://www.ivey. uwo.ca/faculty/Peter_Bell/RM%20Ahmedabad%202005. pdf)

135
Q

RFM

A

data related to customer relationship management; refers to recency, frequency, and monetary value of purchases (Davenport, Enterprise Analytics, p. 49)

136
Q

Risk

A

the potential of loss (an undesirable outcome, however not necessarily so) resulting from a given action, activity, and/or inaction (http://en.wikipedia.org/wiki/Risk)

137
Q

Robust optimization

A

a term given to an approach to deal with uncertainty, similar to the recourse model of stochastic programming, except that feasibility for all possible realizations (called scenarios) is replaced by a penalty function in the objective. As such, the approach integrates goal programming with a scenario-based description of problem data (A. Holder, editor. Mathematical Programming Glossary. INFORMS Computing Society, http://glossary.computing.society. informs.org/, 2006-08. Originally authored by Harvey J. Greenberg, 1999-2006.)

138
Q

Scatter plot

A

graphic depiction of data, used to show/identify relationship between independent variables

139
Q

Scenario analysis

A

a process of analyzing possible future events by considering alternative possible outcomes (scenarios). The analysis is designed to allow improved decision making by allowing more complete consideration of outcomes and their implications (http://www. investordictionary.com/definition/scenario-analysis#sthash.f03iNGP9.dpuf)

140
Q

Scheduling

A

a schedule for a sequence of jobs, say j1,…,jn, is a specification of start times, say t1,…,tn, such that certain constraints are met. A schedule is sought that minimizes cost and/or some measure of time, like the overall project completion time (when the last job is finished) or the tardy time (amount by which the completion time exceeds a given deadline). There are precedence constraints, such as in the construction industry, where a wall cannot be erected until the foundation is laid (A. Holder, editor. Mathematical Programming Glossary. INFORMS Computing Society, http://glossary. computing.society.informs.org/, 2006-08. Originally authored by Harvey J. Greenberg, 1999-2006.)

141
Q

Sensitivity analysis

A

the concern with how the solution changes if some changes are made in either the data or in some of the solution values (by fixing their value). Marginal analysis is concerned with the effects of small perturbations, maybe measurable by derivatives. Parametric analysis is concerned with larger changes in parameter values that affect the data in the mathematical program, such as a cost coefficient or resource limit (A. Holder, editor. Mathematical Programming Glossary. INFORMS Computing Society, http://glossary.computing.society. informs.org/, 2006-08. Originally authored by Harvey J. Greenberg, 1999-2006.)

142
Q

Shadow price

A

an economic term to denote the rate at which the optimal value changes with respect to a change in some right-hand side that represents a resource supply or demand requirement (A. Holder, editor. Mathematical Programming Glossary. INFORMS Computing Society, http://glossary.computing.society.informs.org/, 2006-08. Originally authored by Harvey J. Greenberg, 1999-2006.)

143
Q

Simulated annealing

A

an algorithm for solving hard problems, notably combinatorial programs, based on the metaphor of how annealing works: reach a minimum energy state upon cooling a substance, but not too quickly in order to avoid reaching an undesirable final state. As a heuristic search, it allows a nonimproving move to a neighbor with a probability that decreases over time. The rate of this decrease is determined by the cooling schedule, often just a parameter used in an exponential decay (in keeping with the thermodynamic metaphor). With some (mild) assumptions about the cooling schedule, this will converge in probability to a global optimum (A. Holder, editor. Mathematical Programming Glossary. INFORMS Computing Society, http://glossary.computing.society. informs.org/, 2006-08. Originally authored by Harvey J. Greenberg, 1999-2006.)

144
Q

Six Sigma

A

a set of strategies, techniques, and tools for process improvement. It seeks to improve the quality of process outputs by identifying and removing the causes of defects (errors) and minimizing variability in manufacturing and business processes (http:// en.wikipedia.org/wiki/Six_Sigma)

145
Q

Spreadsheet analysis

A

the analysis of data using special computer software to anticipate marketing performance under a given set of circumstances (http://www.marketing-dictionary.com/s. php)

146
Q

Standard deviation

A

measure of the unpredictability of a random variable, expressed as the average deviation of a set of data from its arithmetic mean and computed as the positive square root of the variance. Customarily represented by the lower-case Greek letter sigma (?), it is considered the most useful and important measure of dispersion that has all the essential properties of the variance plus the advantage of being determined in the same units as those of the original data. Also called root mean square (RMS) deviation (www.businessdictionary.com)

147
Q

Statistical significance

A

probability of obtaining a test result that occurs by chance and not by systematic manipulation of data (www.businessdictionary.com)

148
Q

Statistics

A

branch of mathematics concerned with collection, classification, analysis, and interpretation of numerical facts, for drawing inferences on the basis of their quantifiable likelihood (probability). Statistics can interpret aggregates of data too large to be intelligible by ordinary observation because such data (unlike individual quantities) tend to behave in regular, predictable manner. It is subdivided into descriptive and inferential statistics (www.businessdictionary.com)

149
Q

Stepwise regression

A

a semi-automated process of building a model by successively adding or removing variables based solely on the t-statistics of their estimated coefficients (http:// people.duke.edu/~rnau/regstep.htm)

150
Q

Supply chain management

A

the active management of supply chain activities to maximize customer value and achieve a sustainable competitive advantage (http://scm.ncsu.edu/scm-articles/article/what-is-supply-chain-management)

151
Q

System dynamics

A

a computer-aided approach to policy analysis and design. It applies to dynamic problems arising in complex social, managerial, economic, or ecological systems (http://www.systemdynamics.org/what_is_ system_dynamics.html)

152
Q

Tolerance

A

an approach to sensitivity analysis in linear programming that expresses the common range that parameters can change while preserving the character of the solution (A. Holder, editor. Mathematical Programming Glossary. INFORMS Computing Society, http://glossary. computing.society.informs.org/, 2006-08. Originally authored by Harvey J. Greenberg, 1999-2006.)

153
Q

Traveling salesman problem (TSP)

A

given n points and a cost matrix [cij], a tour is a permutation of the n points. The points can be cities, and the permutation the visitation of each city exactly once, then returning to the first city (called home). (A. Holder, editor. Mathematical Programming Glossary. INFORMS Computing Society, http://glossary. computing.society.informs.org/, 2006-08. Originally authored by Harvey J. Greenberg, 1999-2006.)

154
Q

Uncertainty

A

the estimated amount or percentage by which an observed or calculated value may differ from the true value (http://www.thefreedictionary.com/uncertainty)

155
Q

Validation (of a model)

A

determining how well the model depicts the real-world situation it is describing (http://www.easterbrook.ca/ steve/2010/11/the-difference-between-verification-and-validation/)

156
Q

Variability

A

describes how spread out or closely clustered a set of data is (http://en.wikipedia.org/wiki/Variability)

157
Q

Variable cost

A

a periodic cost that varies in step with the output or the sales revenue of a company. Variable costs include raw material, energy usage, labor, distribution costs, etc. (http://www.businessdictionary.com/definition/variable-cost.html)

158
Q

Variance

A

a parameter in a distribution that describes how far the values are spread apart. Variance is a characteristic of some probability distribution, which distinguishes the concept of variance from the ways to estimate it from sample data(http://en.wikipedia.org/wiki/Variance)

159
Q

Variation reduction

A

reference to process variation where reduction leads to stable and predication process results (http://www. businessdictionary.com)

160
Q

Vehicle routing problem (VRP)

A

finding optimal delivery routes from one or more depots to a set of geographically scattered points (e.g., population centers). A simple case is finding a route for snow removal, garbage collection, or street sweeping (without complications, this is akin to a shortest path problem). In its most complex form, the VRP is a generalization of the TSP, as it can include additional time and capacity constraints, precedence constraints, and more (A. Holder, editor. Mathematical Programming Glossary. INFORMS Computing Society, http://glossary. computing.society.informs.org/, 2006-08. Originally authored by Harvey J. Greenberg, 1999-2006.)

161
Q

Verification (of a model)

A

includes all the activities associated with the producing high quality software: testing, inspection, design analysis, specification analysis (http://www.easterbrook. ca/steve/2010/11/the-difference-between-verification-and-validation/)

162
Q

Web analytics

A

ability to use data generated through Internet-based activities; typically used to assess customer behaviors; see also, RFM (Davenport, Enterprise Analytics, p. 49-51)

163
Q

Yield

A

percentage of ‘good’ product in a batch; has three main components: functional (defect driven), parametric (performance driven), and production efficiency/ equipment utilization (http://www-inst.eecs.berkeley. edu/~ee290h/fa05/Lectures/PDF/lecture%201%20 intro%20IC%20Yield.pdf)