Data Driven Decisions Flashcards
Descriptive analytics
Depict and then describe the characteristics of what is being studied
Predictive analytics
Use data from the past to predict the future
Prescriptive analytics
Include experimental design and optimization to suggest a course of action
True or False?
From data mining, someone is able to make conclusions about the underlying causes of certain variables.
False
Correct. This is a false statement. Data mining is often able to find trends, but it will usually overlook the underlying causes.
True or False?
As technology improves, there will be a greater amount of raw data.
True
Correct. This statement is true. Data collection will become easier as technology improves which will lead to an increase in raw data.
Davenport-Kim three-stage model
A decision-making model developed by Thomas Davenport and Jinho Kim that consists of three stages: framing the problem, solving the problem, and communicating results
Stage 1: Problem recognition consists of the following steps:
Identifying stakeholders
Focusing on decisions
Identifying the kind of story you’re going to tell
Determining the scope of the problem
Getting specific about what you’re trying to find out
Stage 2: Solving the problem
The modeling step
The data collection step
The data analysis step
True or False?
The first step in the Davenport-Kim three-stage model is to frame the problem by recognizing what the problem is and then reviewing previous findings to begin to structure the analysis.
True
Correct. This statement is true. Stage #1 is to frame the problem by recognizing what the problem is and then reviewing previous findings to begin to structure the analysis. Stage #2 is to solve the problem. Stage #3 is the communicate your findings.
True or False?
The stage that involves the most intense statistics and data work is stage 3, communicating results.
False
Correct. This statement is false. The stage that involves the most intense statistics and data work is stage 2, solving the problem. This step includes data modeling, data collection, and data analysis.
Continuous data
Data that can lay along any point in a range of data
Discrete data
can only take on whole values and has clear boundaries.
Nominal data
sometimes called categorical data, is used to label subjects in a study. Nominal data is a type of discrete data.
Ex: The choice of crayon color: burnt sienna, prussian blue, periwinkle, apricot
Ex: Type of tape: masking, packing, Scotch, electric
Ordinal data
is a type of discrete data. It places data objects into an order according to some quality. So, the higher a data object on the scale, the more it has of a certain quality.
Ex: small, medium, and large paperclips
Ex: Level of education: some HS, HS degree/GED, some college, Bachelor’s, Masters
Interval data
Data that is ordered within a range and with each data point being an equal interval apart
Ex: Daily temperature (in Fahrenheit or Celsius)
Ex: The number that signifies the year: 2000, 1987, etc.
Ratio data
Similar to interval data in that the data that is ordered within a range and with each data point being an equal interval apart, also has a natural zero point which indicates none of the given quality.
Ex: Heights of people in your family
Ex: The time it takes the Space Shuttle to orbit once around the earth
True or False?
The following are examples of nominal data:
male/female
red/blue
living/deceased
True
Correct. This statement is true. Nominal data, sometimes called categorical data, places objects into a category.
True or False?
Interval data has an order and all the objects are an equal interval apart.
True
Correct. This statement is true. Interval data has an order and all the objects are an equal interval apart. You cannot have a natural zero point in interval data.
Data Management
The management, including cleaning and storage, of collected data.
Analytics
The discovery, analysis, and communication of meaningful patterns in data.
Big Data
A catch-phrase that describes a massive volume of data that is so large that it’s difficult to process using traditional database and software techniques.
Blind Study
A study performed where the participants are not told if they are in the treatment group or control group
Omission Error
An error because something (for example, data or survey response) is missing.
Reliable Data
Data that is consistent and repeatable
Benchmarks
Standards or points of reference for an industry or sector that can be used for comparison and evaluation.