Assessment Flashcards
Which activity does an analyst perform in the discovery phase of the data analytics life cycle?
Identifying business needs
In which phase of the data analytics life cycle does an analyst build a histogram?
Data exploration
An analyst applies a statistical formula to obtain the average temperature for a city over the last 50 years.
Which phase of the data analytics life cycle is represented by this activity?
Exploratory data analysis
An analyst has been tasked with defining data columns that could contain null values.
Which activity of the data acquisition phase is represented?
Detecting missing values
Which activity in the data analytics life cycle occurs during the data acquisition phase and requires the most time and effort from the data analyst?
Cleaning data
What might be developed by data analysts when acquiring data from a data warehouse?
The SQL queries of data within the tables
What can be identified using a box plot?
Interquartile range
What will be a consequence of poor attention to detail during the data exploration phase?
The analyst will lack insight into the structure of the data set.
Which aspect of data exploration occurs when an analyst writes code to compile a bar graph of dog food sales per month?
Verification through visualization
An oil company uses robots and sensors to detect how pipeline corrosion changes over time. The collected data is then used in a predictive model that estimates when a pipe should be replaced.
How does the predictive model serve this oil company?
To minimize interruptions from maintenance shutdowns
During which phase in the data analytics life cycle would a churn analysis be performed?
Predictive analysis
Which mistake is commonly made during the predictive analytics phase?
The model is developed before the research question is known.
Why might a data analyst resample a data set with replacement data in a data mining project?
Too little data for training and testing data sets
A data analyst has identified combinations of sales transactions that frequently occur together in data over the past 5 years.
Which phase of the data analytics life cycle is represented by this analysis?
Data mining
An analyst realizes that the data set has been reduced significantly, resulting in sample sizes that are too small.
In which phase of the data analytics life cycle did this likely occur?
Data mining
What strategy will contribute to effective data representation and reporting?
Excluding unrelated data
What are two purposes of the reporting phase of the data analytics life cycle?
Provide the conclusions from the analysis in an engaging manner
and
Provide actionable insights that can inform decision-making
During which phase of the data analytics life cycle does an analyst create a story to report data?
Data reporting
What is a common duty of a database administrator?
Maintain data on the IT infrastructure
What is an example of an external stakeholder for a data analytics project?
Regulatory body
Which party has the primary vision for a data analytics project and brings resources to complete it?
Project sponsors
What does the critical path represent in data analytics project management?
Minimum time to complete dependent tasks
A data analytics project manager has been asked to complete a project on a very short timeline.
Which action is likely to yield positive results?
Expand the team with experienced staff
Which type of project management problem occurs when a data mining task has started but a data acquisition task has not been completed?
Schedule
How can an organization improve interprofessional communication among team members?
By using tools that provide a team-based collaboration space
A data analyst needs to contact a specific member of the database administration team.
Which method should be used to discover the person’s email address?
Send an email to the team member’s manager
Which feature is commonly found in collaboration tools like Jira, Slack, Teams, and PivotalTracker?
Real-time messaging
Which action can the project manager take to keep the team engaged in the analytics project?
Throughout the project, the project manager communicates insights from the data analytics team and provides ideas of ways to act on those insights.
What is an effective method for a data analyst to prepare for a one-on-one meeting with a manager?
Bring a set of questions to draw on to keep the conversation going
What is a characteristic of active listening?
Seeking to understand the speaker’s emotions and intent
Which circumstance could cause a data analyst to have difficulty developing a model to answer a business question?
Lack of relevant data sources
A data analytics project team is preparing to develop a predictive model that will be included within a business intelligence tool for upper management.
Which step should be considered for inclusion when creating the project schedule?
Business intelligence tool interface training
Which task would an analyst consider first during the discovery phase of the data analytics lifecycle?
Identify project goals.
Numerical measurements of the amount of a toxic chemical substance are recorded in a large database.
Which hypothesis can the data analyst answer through exploratory data analytic methods?
The statistical distribution of the chemical measurements is normal.
A restaurant owner wants to sponsor a data analytics project to provide insights regarding hamburger sales before developing a strategy for increasing sales.
Which question is framed appropriately for the data analytics project?
What are the characteristics of customers who buy hamburgers?
Which organizational objective could be accomplished with a descriptive data analytics project using website request logs as a data source?
Explain why web data transfer has increased 25%
A travel website tabulated the results of their latest marketing campaign to understand the relationship of clicks-to-sales conversions.
Which area of analytics does this activity represent?
Descriptive
An analyst is looking at data that includes the customer’s address, date of purchase, and age.
Which question could be answered from this data?
Which state has the highest total customers?
Which outcome should be expected when working with data aggregated from multiple sources?
Select two answers.
Inconsistently named fields and Data needs cleaning
Which technique can a project manager use to foster the identification of quality data analytics questions?
Frequent collaboration with the team
A data analyst notices that the data selected for an analytics project is slightly misaligned with the research question.
How can the data analyst resolve this situation?
Adjust the research question to reframe the analysis
An analyst has been asked to analyze the open-ended responses from customers on a satisfaction survey.
Which type of data is the analyst working with on this project?
Qualitative
A U.S. company collects and sells information on consumers.
Which law prevents the company from collecting information on European Union consumers without their permission?
General Data Protection Regulation
A consumer sues an entertainment streaming company for leaking personal information regarding her viewing habits.
Which ASA ethical standard did the streaming company violate?
Privacy
A specific drug is manufactured for the treatment of depression. The company decides to ignore research results on an alternative, less expensive, drug treatment in order to make higher profits.
Which ASA ethical standard has the company violated?
Conflict of interest
What do open-source software tools and widely available analysis tools, such as spreadsheets, help accomplish?
Data democratization
What is a feature of SQL?
Choose 2 answers.
The basic language is the same across database servers and it is used with structured data and unstructured data
What is an example of unstructured data?
Text messages that include video
Which tool should a researcher use to conduct a univariate analysis on complex statistical data?
R
Which statistical technique should be used to draw conclusions about an entire population based on a representative sample?
Hypothesis testing
What is an example of random sampling of college students?
Surveying students chosen arbitrarily from around the entire college campus
Which type of analysis would be used to predict a binary outcome based on a set of independent variables?
Regression
Which type of data analysis is appropriate if the goal is to minimize the cost of a diet, using a data set consisting of the following variables: protein content, fat content, and cost per unit?
Optimization
Which technique can be used to determine the likelihood that a positive diagnostic test result indicates whether the disease is actually present?
Bayes’ theorem
Which concept should be considered when choosing variables for inclusion in a linear regression model?
Feasibility of controlling the variables
A neural network algorithm in machine learning endeavors to recognize underlying relationships in a set of data.
What does this process mimic?
The way the human brain operates
Which characteristics are used to group data together in a cluster analysis?
Choose 2 answers.
Distance and similarity
Which tool has libraries that expand its visualization capabilities?
Python
Which tools can be used for performing statistics and creating interactive data visualization for large datasets from various sources?
Choose 2 answers.
Tableau and R
Which type of data representation should a data analyst use to display expense categories as a percentage of total business expenses?
Pie chart