Pre-Assessment Flashcards
Which activity does an analyst perform in the discovery phase of the data analytics life cycle? Collecting data / Cleaning data / Identifying outliers / Identifying business needs
Identifying business needs
In which phase of the data analytics life cycle does an analyst build a histogram? Data acquisition / Data exploration / Discovery / Predictive modeling
Data exploration
An analyst applies a statistical formula to obtain the average temperature for a city over the last 50 years. Which phase of the data analytics life cycle is represented by this activity? Data acquisition / Exploratory data analysis / Predictive modeling / Data reporting
Exploratory data analysis
An analyst has been tasked with defining data columns that could contain null values. Which activity of the data acquisition phase is represented? Collecting data / Disqualifying data sources / Detecting missing values / Transforming improperly formatted text
Detecting missing values
Which activity in the data analytics life cycle occurs during the data acquisition phase and requires themost time and effort from the data analyst? Selecting the data sources / Importing data into a database / Cleaning data / Defining goals
Cleaning data
What might be developed by data analysts when acquiring data from a data warehouse? The procedures for extracting files from the data warehouse / The procedures for updating tables in the data warehouse / The relational structure of tables / The SQL queries of data within the tables
The SQL queries of data within the tables
What can be identified using a box plot? Frequency / Correlation / Interquartile range / Mean
Interquartile range
What will be a consequence of poor attention to detail during the data exploration/ phase? Not enough variables will be considered in the analysis. / The outcome of the analysis will be misaligned to business needs. / The analyst will lack insight into the/ structure of the data set. / The model will be built using the wrong data set.
The analyst will lack insight into the structure of the data set.
Which aspect of data exploration occurs when an analyst writes code to compile a bar graph of dog foodsales per month? Performance of a correlation analysis / Analysis of data anomalies / Verification through visualization / Determination of variabilities
Verification through visualization
An oil company uses robots and sensors to detect how pipeline corrosion changes over time. The collecteddata is then used in a predictive model that estimates when a pipe should be replaced. How does the predictive model serve this oil company? To minimize interruptions from maintenance shutdowns / To minimize the need for workforce safety training / To improve compliance with pipeline construction standards / To improve compliance with pipeline disposal standards
To minimize interruptions from maintenance shutdowns
During which phase in the data analytics life cycle would a churn analysis be performed? Data cleaning / Data acquisition / Predictive analysis / Representation and reporting
Predictive analysis
Which mistake is commonly made during the predictive analytics phase? The data are separated into different sets. / The variables are separated into response and independent variables. / The data are prepared before the model is developed. / The model is developed before the research question is known.
The model is developed before the research question is known.
Why might a data analyst resample a data set with replacement data in a data mining project? Misidentification of causation due to correlation / Wrong variables chosen for analyzation / Too little data for training and testing data sets / Skewed data resulting from outliers
Too little data for training and testing data sets
A data analyst has identified combinations of sales transactions that frequently occur together in dataover the past 5 years. Which phase of the data analytics life cycle is represented by this analysis? Data acquisition / Representation and reporting / Data mining / Predictive modeling
Data mining
An analyst realizes that the data set has been reduced significantly, resulting in sample sizes that are toosmall. In which phase of the data analytics life cycle did this likely occur? Data exploration / Data modeling / Data mining / Data discovery
Data mining
What strategy will contribute to effective data representation and reporting? Creating a new training data set / Selecting data for a prediction model / Excluding unrelated data / Extracting data from source repositories
Excluding unrelated data
What are TWO purposes of the reporting phase of the data analytics life cycle? Provide the conclusions from the analysis in an engaging manner / Provide a tool for decision-makers to import and analyze more data / Provide actionable insights that can inform decision-making / Provide an automated way for decision-makers to test their own models
Provide the conclusions from the analysis in an engaging manner AND Provide actionable insights that can inform decision-making
During which phase of the data analytics life cycle does an analyst create a story to report data? Data acquisition / Data mining / Data reporting / Data cleaning
Data reporting
Whatis a common duty ofa database administrator? Set projecttimelines,milestones, and goals / Acquire funding for data analytics projects / Maintain data on the IT infrastructure / Define business needs at the onset of a project
Maintain data on the IT infrastructure
What is an example of an external stakeholder for a data analytics project? President/CEO / Projectmanager / Regulatory body / Data analyst’s supervisor
Regulatory body
Which party has the primary vision for a data analytics project and brings resources to complete it? Project sponsors / Project managers / Customers / Data analysts
Project sponsors
Whatdoes the critical pathrepresent indata analytics project management? Minimum time to complete independent tasks / Maximum time to complete independent tasks / Minimum time to completedependent tasks / Maximum time to completedependent tasks
Minimum time to complete dependent tasks
A data analytics project manager has been asked to complete a project on a very short timeline.Whichaction is likely to yieldpositiveresults? Outsourcetheskilledwork to an unprovenvendor / Expand the team with experienced staff / Requirecurrent teamto work overtime / Accept lowered quality standards
Expand the team with experienced staff
Whichtype of project management problemoccurs whenadata mining task has started but a dataacquisition task has not been completed? Scope / Schedule / Procedure / Cost
Schedule