Pre-Assessment Flashcards

1
Q

Which activity does an analyst perform in the discovery phase of the data analytics life cycle? Collecting data / Cleaning data / Identifying outliers / Identifying business needs

A

Identifying business needs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

In which phase of the data analytics life cycle does an analyst build a histogram? Data acquisition / Data exploration / Discovery / Predictive modeling

A

Data exploration

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

An analyst applies a statistical formula to obtain the average temperature for a city over the last 50 years. Which phase of the data analytics life cycle is represented by this activity? Data acquisition / Exploratory data analysis / Predictive modeling / Data reporting

A

Exploratory data analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

An analyst has been tasked with defining data columns that could contain null values. Which activity of the data acquisition phase is represented? Collecting data / Disqualifying data sources / Detecting missing values / Transforming improperly formatted text

A

Detecting missing values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Which activity in the data analytics life cycle occurs during the data acquisition phase and requires themost time and effort from the data analyst? Selecting the data sources / Importing data into a database / Cleaning data / Defining goals

A

Cleaning data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What might be developed by data analysts when acquiring data from a data warehouse? The procedures for extracting files from the data warehouse / The procedures for updating tables in the data warehouse / The relational structure of tables / The SQL queries of data within the tables

A

The SQL queries of data within the tables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What can be identified using a box plot? Frequency / Correlation / Interquartile range / Mean

A

Interquartile range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What will be a consequence of poor attention to detail during the data exploration/ phase? Not enough variables will be considered in the analysis. / The outcome of the analysis will be misaligned to business needs. / The analyst will lack insight into the/ structure of the data set. / The model will be built using the wrong data set.

A

The analyst will lack insight into the structure of the data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Which aspect of data exploration occurs when an analyst writes code to compile a bar graph of dog foodsales per month? Performance of a correlation analysis / Analysis of data anomalies / Verification through visualization / Determination of variabilities

A

Verification through visualization

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

An oil company uses robots and sensors to detect how pipeline corrosion changes over time. The collecteddata is then used in a predictive model that estimates when a pipe should be replaced. How does the predictive model serve this oil company? To minimize interruptions from maintenance shutdowns / To minimize the need for workforce safety training / To improve compliance with pipeline construction standards / To improve compliance with pipeline disposal standards

A

To minimize interruptions from maintenance shutdowns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

During which phase in the data analytics life cycle would a churn analysis be performed? Data cleaning / Data acquisition / Predictive analysis / Representation and reporting

A

Predictive analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Which mistake is commonly made during the predictive analytics phase? The data are separated into different sets. / The variables are separated into response and independent variables. / The data are prepared before the model is developed. / The model is developed before the research question is known.

A

The model is developed before the research question is known.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Why might a data analyst resample a data set with replacement data in a data mining project? Misidentification of causation due to correlation / Wrong variables chosen for analyzation / Too little data for training and testing data sets / Skewed data resulting from outliers

A

Too little data for training and testing data sets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

A data analyst has identified combinations of sales transactions that frequently occur together in dataover the past 5 years. Which phase of the data analytics life cycle is represented by this analysis? Data acquisition / Representation and reporting / Data mining / Predictive modeling

A

Data mining

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

An analyst realizes that the data set has been reduced significantly, resulting in sample sizes that are toosmall. In which phase of the data analytics life cycle did this likely occur? Data exploration / Data modeling / Data mining / Data discovery

A

Data mining

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What strategy will contribute to effective data representation and reporting? Creating a new training data set / Selecting data for a prediction model / Excluding unrelated data / Extracting data from source repositories

A

Excluding unrelated data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What are TWO purposes of the reporting phase of the data analytics life cycle? Provide the conclusions from the analysis in an engaging manner / Provide a tool for decision-makers to import and analyze more data / Provide actionable insights that can inform decision-making / Provide an automated way for decision-makers to test their own models

A

Provide the conclusions from the analysis in an engaging manner AND Provide actionable insights that can inform decision-making

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

During which phase of the data analytics life cycle does an analyst create a story to report data? Data acquisition / Data mining / Data reporting / Data cleaning

A

Data reporting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Whatis a common duty ofa database administrator? Set projecttimelines,milestones, and goals / Acquire funding for data analytics projects / Maintain data on the IT infrastructure / Define business needs at the onset of a project

A

Maintain data on the IT infrastructure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is an example of an external stakeholder for a data analytics project? President/CEO / Projectmanager / Regulatory body / Data analyst’s supervisor

A

Regulatory body

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Which party has the primary vision for a data analytics project and brings resources to complete it? Project sponsors / Project managers / Customers / Data analysts

A

Project sponsors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Whatdoes the critical pathrepresent indata analytics project management? Minimum time to complete independent tasks / Maximum time to complete independent tasks / Minimum time to completedependent tasks / Maximum time to completedependent tasks

A

Minimum time to complete dependent tasks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

A data analytics project manager has been asked to complete a project on a very short timeline.Whichaction is likely to yieldpositiveresults? Outsourcetheskilledwork to an unprovenvendor / Expand the team with experienced staff / Requirecurrent teamto work overtime / Accept lowered quality standards

A

Expand the team with experienced staff

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Whichtype of project management problemoccurs whenadata mining task has started but a dataacquisition task has not been completed? Scope / Schedule / Procedure / Cost

A

Schedule

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
How can an organization improve interprofessional communication among team members?  By setting work priorities for team members  / By requiring weekly updates on project deadlines / By using tools that provide a team-based collaboration space / By ensuring employees can recite the desired outcomes
By using tools that provide a team-based collaboration space
26
A data analyst needs to contact a specific member of the database administration team. Which method should be used to discover the person’s email address?  Ask the project’s customers  / Ask the project’s sponsors  / Send an email to project stakeholders  / Send an email to the team member’s manager
Send an email to the team member’s manager
27
Which feature is commonly found in collaboration tools like Jira, Slack, Teams, and PivotalTracker?  Real-time messaging / Multivariate analysis  / Equation editor  / Source code management 
Real-time messaging
28
Which action can the project manager take to keep the team engaged in the analytics project?  At the end of the project, the team publishes an extensive research reportand includes it in an email to project stakeholders.  / Throughout the project, the project manager communicates insights from thedata analytics team and provides ideas of ways to act on those insights. / At the end of the project, the project manager sends an email with the predictivemodel to the stakeholders so they can use it.  / Throughout the project, the project manager holds regular meetings so the entiredata analytics team can showcase their work to different departments. 
Throughout the project, the project manager communicates insights from thedata analytics team and provides ideas of ways to act on those insights.
29
What is an effective method for a data analyst to prepare for a one-on-one meeting with a manager?  Make a written list of all source code comments  / Ask other inside employees about the manager’s reputation  / Bring a set of questions to draw on to keep the conversation going / Create an essay summarizing steps in the source code  
Bring a set of questions to draw on to keep the conversation going
30
What is a characteristic of active listening?  Actively working on a task while listening to the speaker  / Seeking to understand the speaker’s emotions and intent / Focusing intently on the content of the message  / Waiting patiently to share one’s own thoughts  
Seeking to understand the speaker’s emotions and intent
31
Which circumstance could cause a data analyst to have difficulty developing a model to answer a businessquestion? Project scope creep / Poor project budgeting  / Lack of relevant data sources / Lack of stakeholder support 
Lack of relevant data sources
32
A data analytics project team is preparing to develop a predictive model that will be included within abusiness intelligence tool for upper management. Which step should be considered for inclusion when creating the project schedule?  Model testing and validation for users  / Business intelligence tool interface training  / Model training and testing for stakeholders  / Business intelligence tool data transformation training
Business intelligence tool interface training
33
Which task would an analyst consider first during the discovery phase of the data analytics lifecycle?  Seek out necessary data sources. / Formulate a project plan. / Identify project goals. / Develop key metrics.
Identify project goals.
34
Numerical measurements of the amount of a toxic chemical substance are recorded in a large database. Which hypothesis can the data analyst answer through exploratory data analytic methods?  The chemical will not cause harm to the habitat’s native species.  / The chemical contamination is a result of human activity.  / The statistical distribution of the chemical measurements is normal. / The best analytic approach for analyzing the data is linear regression.
The statistical distribution of the chemical measurements is normal.
35
A restaurant owner wants to sponsor a data analytics project to provide insights regarding hamburgersales before developing a strategy for increasing sales. Which question is framed appropriately for the data analytics project?  What are the characteristics of customers who buy hamburgers?   / What does the supply and demand curve look like for hamburgers?   / Which discount coupons should we send to neighborhood residents?   / Which varieties of hamburgers are featured by competitors?  
What are the characteristics of customers who buy hamburgers?
36
Which organizational objective could be accomplished with a descriptive data analyticsproject using website request logs as a data source?  Explain why web data transfer has increased 25%  / Estimate the traffic increase for a new product launch  / Improve the speed of server request processing  / Recommend a strategy to increase network capacity  
Explain why web data transfer has increased 25%
37
A travel website tabulated the results of their latest marketing campaign to understand the relationship ofclicks-to-sales conversions. Which area of analytics does this activity represent?  Prescriptive  / Proactive  / Descriptive / Predictive 
Descriptive
38
An analyst is looking at data that includes the customer’s address, date of purchase, and age. Which question could be answered from this data?  Which customer has spent the highest dollar amount?   / Which customer is most likely to respond favorably to the next marketingcampaign?   / Which state has the highest total customers? / Which product has sold the most in a certain state?  
Which state has the highest total customers?
39
Which outcome should be expected when working with data aggregated from multiple sources? Select TWO answers.Consistently named fields / Inconsistently named fields / Data needs cleaning / Data does not need cleaning
Inconsistently named fields AND Data needs cleaning
40
Which technique can a project manager use to foster the identification of quality data analytics questions? Organized project planning / Rigorous data cleaning / Frequent collaboration with the team / Acquisition of abundant project resources
Frequent collaboration with the team
41
A data analyst notices that the data selected for an analytics project is slightly misaligned with theresearch question. How can the data analyst resolve this situation?  Halt the data analytics project to pursue a new research question / Dive deeper into the data to identify data quality issues / Adjust the research question to reframe the analysis / Transform the data to a new metric
Adjust the research question to reframe the analysis
42
An analyst has been asked to analyze the open-ended responses from customers on a satisfaction survey. Which type of data is the analyst working with on this project? Transactional / Secondary / Qualitative / Quantitative
Qualitative
43
A U.S. company collects and sells information on consumers. Which law prevents the company from collecting information on European Union consumers withouttheir permission? Electronic Communications Privacy Act / General Data Protection Regulation / Stored Communication Act / Information Nondiscrimination Act
General Data Protection Regulation
44
A consumer sues an entertainment streaming company for leaking personal information regarding herviewing habits. Which ASA ethical standard did the streaming company violate? Conflict of interest / Biases / Privacy / Unfair discrimination
Privacy
45
A specific drug is manufactured for the treatment of depression. The company decides toignore research results on an alternative, less expensive, drug treatment in order to make higher profits. Which ASA ethical standard has the company violated?  Unfair discrimination / Reproducible results / Conflict of interest / Transparent assumptions
Conflict of interest
46
What do open-source software tools and widely available analysis tools, such as spreadsheets,help accomplish?  Data schemas  / Data democratization / Data security  / Data compliance 
Data democratization
47
What is a feature of SQL?  Choose TWO answers. It is an object-oriented programming language.  / The basic language is the same across database servers. / It has built-in chart and graph creation.  / It is used with structured data and unstructured data.
The basic language is the same across database servers. AND It is used with structured data and unstructured data.
48
What is an example of unstructured data?  Names, dates, and addresses  / Credit card numbers that include a credit score  / Text messages that include video / Height, weight, and gender 
Text messages that include video
49
Which tool should a researcher use to conduct a univariate analysis on complex statistical data? Tableau / Power BI / R / SQL
R
50
Which statistical technique should be used to draw conclusions about an entire population based on arepresentative sample? Correlation / Bayes theorem / Hypothesis testing / Measures of central tendency 
Hypothesis testing
51
What is an example of random sampling of college students? Surveying students chosen arbitrarily from around the entire college campus / Surveying every student in the college library / Surveying students chosen arbitrarily in the library of the university / Surveying every student on campus
Surveying students chosen arbitrarily from around the entire college campus
52
Which type of analysis would be used to predict a binary outcome based on a set of independentvariables? Hypothesis testing / Descriptive statistics / Regression / Time Series
Regression
53
Which type of data analysis is appropriate if the goal is to minimize the cost of a diet, using a data setconsisting of the following variables: protein content, fat content, and cost per unit? Decision trees / Calculus / Optimization / Bayes’ theorem
Optimization
54
Which technique can be used to determine the likelihood that a positive diagnostic test result indicateswhether the disease is actually present?  Bayes’ theorem / Central limit theorem / Regression / Optimization
Bayes’ theorem
55
Which concept should be considered when choosing variables for inclusion in a linear regression model? Feasibility of merging the variables / Feasibility of controlling the variables / Feasibility of testing the variables / Feasibility of classifying the variables
Feasibility of controlling the variables
56
A neural network algorithm in machine learning endeavors to recognize underlying relationships in a setof data.  What does this process mimic? The way a computer processes data / The way the human brain operates / The way architects establish functionality / The way that social media builds networks
The way the human brain operates
57
Which characteristics are used to group data together in a cluster analysis?  Choose TWO answers.Distance / Similarity / Shape / Size
Distance AND Similarity
58
Which tool has libraries that expand its visualization capabilities?  Python / Tableau / Adobe Infographics / D3.js
Python
59
Which tools can be used for performing statistics and creating interactive datavisualization for large datasets from various sources?  Choose TWO answers. Gantt Chart / SQL / Tableau / R
Tableau AND R
60
Which type of data representation should a data analyst use to display expense categories as apercentage of total business expenses?  Map visualization / Line chart / Pie chart / Scatter plot
Pie chart