Pre-Assessment Flashcards

1
Q

Which activity does an analyst perform in the discovery phase of the data analytics life cycle? Collecting data / Cleaning data / Identifying outliers / Identifying business needs

A

Identifying business needs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

In which phase of the data analytics life cycle does an analyst build a histogram? Data acquisition / Data exploration / Discovery / Predictive modeling

A

Data exploration

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

An analyst applies a statistical formula to obtain the average temperature for a city over the last 50 years. Which phase of the data analytics life cycle is represented by this activity? Data acquisition / Exploratory data analysis / Predictive modeling / Data reporting

A

Exploratory data analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

An analyst has been tasked with defining data columns that could contain null values. Which activity of the data acquisition phase is represented? Collecting data / Disqualifying data sources / Detecting missing values / Transforming improperly formatted text

A

Detecting missing values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Which activity in the data analytics life cycle occurs during the data acquisition phase and requires themost time and effort from the data analyst? Selecting the data sources / Importing data into a database / Cleaning data / Defining goals

A

Cleaning data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What might be developed by data analysts when acquiring data from a data warehouse? The procedures for extracting files from the data warehouse / The procedures for updating tables in the data warehouse / The relational structure of tables / The SQL queries of data within the tables

A

The SQL queries of data within the tables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What can be identified using a box plot? Frequency / Correlation / Interquartile range / Mean

A

Interquartile range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What will be a consequence of poor attention to detail during the data exploration/ phase? Not enough variables will be considered in the analysis. / The outcome of the analysis will be misaligned to business needs. / The analyst will lack insight into the/ structure of the data set. / The model will be built using the wrong data set.

A

The analyst will lack insight into the structure of the data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Which aspect of data exploration occurs when an analyst writes code to compile a bar graph of dog foodsales per month? Performance of a correlation analysis / Analysis of data anomalies / Verification through visualization / Determination of variabilities

A

Verification through visualization

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

An oil company uses robots and sensors to detect how pipeline corrosion changes over time. The collecteddata is then used in a predictive model that estimates when a pipe should be replaced. How does the predictive model serve this oil company? To minimize interruptions from maintenance shutdowns / To minimize the need for workforce safety training / To improve compliance with pipeline construction standards / To improve compliance with pipeline disposal standards

A

To minimize interruptions from maintenance shutdowns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

During which phase in the data analytics life cycle would a churn analysis be performed? Data cleaning / Data acquisition / Predictive analysis / Representation and reporting

A

Predictive analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Which mistake is commonly made during the predictive analytics phase? The data are separated into different sets. / The variables are separated into response and independent variables. / The data are prepared before the model is developed. / The model is developed before the research question is known.

A

The model is developed before the research question is known.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Why might a data analyst resample a data set with replacement data in a data mining project? Misidentification of causation due to correlation / Wrong variables chosen for analyzation / Too little data for training and testing data sets / Skewed data resulting from outliers

A

Too little data for training and testing data sets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

A data analyst has identified combinations of sales transactions that frequently occur together in dataover the past 5 years. Which phase of the data analytics life cycle is represented by this analysis? Data acquisition / Representation and reporting / Data mining / Predictive modeling

A

Data mining

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

An analyst realizes that the data set has been reduced significantly, resulting in sample sizes that are toosmall. In which phase of the data analytics life cycle did this likely occur? Data exploration / Data modeling / Data mining / Data discovery

A

Data mining

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What strategy will contribute to effective data representation and reporting? Creating a new training data set / Selecting data for a prediction model / Excluding unrelated data / Extracting data from source repositories

A

Excluding unrelated data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What are TWO purposes of the reporting phase of the data analytics life cycle? Provide the conclusions from the analysis in an engaging manner / Provide a tool for decision-makers to import and analyze more data / Provide actionable insights that can inform decision-making / Provide an automated way for decision-makers to test their own models

A

Provide the conclusions from the analysis in an engaging manner AND Provide actionable insights that can inform decision-making

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

During which phase of the data analytics life cycle does an analyst create a story to report data? Data acquisition / Data mining / Data reporting / Data cleaning

A

Data reporting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Whatis a common duty ofa database administrator? Set projecttimelines,milestones, and goals / Acquire funding for data analytics projects / Maintain data on the IT infrastructure / Define business needs at the onset of a project

A

Maintain data on the IT infrastructure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is an example of an external stakeholder for a data analytics project? President/CEO / Projectmanager / Regulatory body / Data analyst’s supervisor

A

Regulatory body

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Which party has the primary vision for a data analytics project and brings resources to complete it? Project sponsors / Project managers / Customers / Data analysts

A

Project sponsors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Whatdoes the critical pathrepresent indata analytics project management? Minimum time to complete independent tasks / Maximum time to complete independent tasks / Minimum time to completedependent tasks / Maximum time to completedependent tasks

A

Minimum time to complete dependent tasks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

A data analytics project manager has been asked to complete a project on a very short timeline.Whichaction is likely to yieldpositiveresults? Outsourcetheskilledwork to an unprovenvendor / Expand the team with experienced staff / Requirecurrent teamto work overtime / Accept lowered quality standards

A

Expand the team with experienced staff

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Whichtype of project management problemoccurs whenadata mining task has started but a dataacquisition task has not been completed? Scope / Schedule / Procedure / Cost

A

Schedule

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

How can an organization improve interprofessional communication among team members? By setting work priorities for team members / Byrequiring weekly updateson project deadlines / By using toolsthatprovideateam-based collaboration space / By ensuring employeescan recitethe desired outcomes

A

By using tools that provide a team-based collaboration space

26
Q

Adata analystneeds to contact a specific member of the database administration team. Which method should be used todiscover the person’s email address? Ask the project’s customers / Ask the project’s sponsors / Send an email to project stakeholders / Send an email to the team member’s manager

A

Send an email to the team member’s manager

27
Q

Whichfeatureiscommonly found in collaboration tools like Jira, Slack, Teams,andPivotalTracker? Real-time messaging / Multivariate analysis / Equation editor / Source code management

A

Real-time messaging

28
Q

Which actioncan the project managertaketo keep theteamengaged in the analytics project? At the end of the project,the teampublishesan extensive research reportandincludesit inan email to project stakeholders. / Throughout the project,the project manager communicates insights from thedata analytics teamand providesideas of ways to act on those insights. / At the end of the project,the project manager sendsan email with the predictivemodel to the stakeholdersso they can use it. / Throughout the project,the project managerholdsregular meetings so the entiredata analytics team can showcase their workto different departments.

A

Throughout the project, the project manager communicates insights from thedata analytics team and provides ideas of ways to act on those insights.

29
Q

What isan effectivemethodfora data analysttoprepare for a one-on-one meeting withamanager? Make awrittenlist ofallsource code comments / Ask other inside employees about themanager’s reputation / Bring a set of questionstodraw on to keep the conversation going / Create an essay summarizingsteps inthesourcecode

A

Bring a set of questions to draw on to keep the conversation going

30
Q

What is a characteristic of active listening? Activelyworking on a task while listeningto the speaker / Seeking to understandthe speaker’semotions and intent / Focusing intently on the content of the message / Waiting patiently to share one’s ownthoughts

A

Seeking to understand the speaker’s emotions and intent

31
Q

Which circumstance could cause a data analyst to have difficulty developing a model to answer a businessquestion? Project scope creep / Poor projectbudgeting / Lack ofrelevant datasources / Lack of stakeholder support

A

Lack of relevant data sources

32
Q

A data analytics project team is preparing to develop a predictive model that will beincluded within abusiness intelligence tool for upper management. Whichstep should be considered for inclusionwhen creating the project schedule? Model testing and validation for users / Businessintelligence tool interface training / Model training and testing for stakeholders / Business intelligence tool data transformation training

A

Business intelligence tool interface training

33
Q

Which taskwouldan analyst considerfirstduring the discovery phase of the data analyticslifecycle? Seek out necessary data sources. / Formulate a project plan. / Identifyprojectgoals. / Developkeymetrics.

A

Identify project goals.

34
Q

Numericalmeasurementsofthe amount ofa toxic chemical substance are recorded in a large database. Whichhypothesiscan thedata analystanswerthroughexploratory data analytic methods? The chemical will notcauseharmto thehabitat’snative species. / Thechemical contaminationis a result of human activity. / Thestatisticaldistribution of the chemicalmeasurementsisnormal. / The best analytic approach foranalyzingthe datais linear regression.

A

The statistical distribution of the chemical measurements is normal.

35
Q

A restaurant owner wants to sponsor a data analytics projecttoprovide insights regarding hamburgersales beforedevelopinga strategy for increasing sales. Which question is framed appropriately for the data analytics project? What are the characteristics of customers who buy hamburgers? / What does the supply and demand curve look like for hamburgers? / Whichdiscountcoupons should we send to neighborhood residents? / Whichvarieties of hamburgers are featured by competitors?

A

What are the characteristics of customers who buy hamburgers?

36
Q

Whichorganizationalobjectivecould be accomplishedwitha descriptivedataanalyticsprojectusingwebsite request logsas adata source? Explainwhy web data transfer has increased 25% / Estimatethe traffic increase for a new product launch / Improvethe speed ofserver requestprocessing / Recommenda strategy to increasenetwork capacity

A

Explain why web data transfer has increased 25%

37
Q

A travel website tabulated the results of their latest marketing campaign to understand the relationship ofclicks-to-sales conversions. Which area of analytics does this activity represent? Prescriptive / Proactive / Descriptive / Predictive

A

Descriptive

38
Q

An analyst is looking at data that includes the customer’s address, date of purchase,and age.Whichquestion could be answered from this data? Which customer hasspent the highest dollar amount? / Which customer is most likely to respond favorably to the next marketingcampaign? / Which state has the highest totalcustomers? / Which product has sold the most in a certain state?

A

Which state has the highest total customers?

39
Q

Which outcome should be expected when working with data aggregated from multiple sources? Select TWO answers.Consistently named fields / Inconsistently named fields / Data needs cleaning / Data does not need cleaning

A

Inconsistently named fields AND Data needs cleaning

40
Q

Which technique can a project manager use to foster the identification of quality data analytics questions? Organized project planning / Rigorous data cleaning / Frequent collaboration with the team / Acquisition of abundant project resources

A

Frequent collaboration with the team

41
Q

A data analyst notices that the data selected for an analytics project is slightly misaligned with theresearch question. How can the data analyst resolve this situation? Halt the data analytics project to pursue a new research question / Dive deeper into the data to identify data quality issues / Adjust the research question to reframe the analysis / Transform the data to a new metric

A

Adjust the research question to reframe the analysis

42
Q

An analyst has been asked to analyze the open-ended responses from customers on a satisfaction survey.Which type of data is the analyst working with on this project? Transactional / Secondary / Qualitative / Quantitative

A

Qualitative

43
Q

A U.S.company collects and sells information on consumers.Whichlawprevents thecompanyfrom collectinginformationonEuropean Unionconsumerswithouttheir permission? Electronic Communications Privacy Act / General Data Protection Regulation / Stored Communication Act / Information Nondiscrimination Act

A

General Data Protection Regulation

44
Q

A consumer sues an entertainment streaming company for leaking personal information regarding herviewing habits.Which ASA ethical standard did the streaming company violate? Conflict of interest / Biases / Privacy / Unfair discrimination

A

Privacy

45
Q

Aspecific drug is manufactured for the treatment of depression. The company decidestoignoreresearchresultson an alternative, less expensive,drug treatmentin order to makehigherprofits.WhichASAethical standardhasthe companyviolated? Unfair discrimination / Reproducible results / Conflict of interest / Transparent assumptions

A

Conflict of interest

46
Q

What do open-source software tools and widely available analysis tools, such as spreadsheets,helpaccomplish? Data schemas / Data democratization / Data security / Data compliance

A

Data democratization

47
Q

What is a feature of SQL? Choose TWOanswers.It is an object-oriented programming language. / The basic language is the same across databaseservers. / It has built-in chart and graph creation. / It is used with structured dataand unstructureddata.

A

The basic language is the same across database servers. AND It is used with structured data and unstructured data.

48
Q

Whatisan example of unstructured data? Names, dates,andaddresses / Credit card numbers that include a credit score / Text messages that include video / Height, weight, and gender

A

Text messages that include video

49
Q

Which tool should a researcher use to conduct a univariate analysis on complex statistical data? Tableau / Power BI / R / SQL

A

R

50
Q

Which statistical technique shouldbe usedto draw conclusions about an entire population based on arepresentative sample? Correlation / Bayes theorem / Hypothesis testing / Measures of central tendency

A

Hypothesis testing

51
Q

What is an example of random sampling ofcollegestudents? Surveying studentschosenarbitrarilyfrom around theentire college campus / Surveying every student in the college library / Surveying students chosen arbitrarily in the library of the university / Surveying every student on campus

A

Surveying students chosen arbitrarily from around the entire college campus

52
Q

Which type of analysis would be used to predict a binary outcome based on a set of independentvariables? Hypothesis testing / Descriptive statistics / Regression / Time Series

A

Regression

53
Q

Which type of data analysis is appropriate if the goal is to minimize the cost of a diet, using a data setconsisting of the following variables: protein content, fat content, and cost per unit? Decision trees / Calculus / Optimization / Bayes’ theorem

A

Optimization

54
Q

Whichtechniquecan be used todetermine thelikelihoodthat a positivediagnostictest resultindicateswhether thediseaseis actually present? Bayes’ theorem / Central limit theorem / Regression / Optimization

A

Bayes’ theorem

55
Q

Which conceptshould beconsidered when choosingvariablesfor inclusionin alinear regressionmodel? Feasibility ofmergingthevariables / Feasibility of controlling thevariables / Feasibility of testing thevariables / Feasibility ofclassifying thevariables

A

Feasibility of controlling the variables

56
Q

A neural network algorithm in machine learning endeavors to recognize underlying relationships in a setof data. What does this process mimic? The way a computer processes data / The way the human brain operates / The way architects establish functionality / The way that social media builds networks

A

The way the human brain operates

57
Q

Whichcharacteristics are used to group data together inacluster analysis? ChooseTWOanswers.Distance / Similarity / Shape / Size

A

Distance AND Similarity

58
Q

Which tool has libraries that expand its visualization capabilities? Python / Tableau / Adobe Infographics / D3.js

A

Python

59
Q

Which toolscanbe usedforperformingstatistics andcreatinginteractive datavisualizationforlargedatasetsfrom various sources? Choose TWO answers.Gantt Chart / SQL / Tableau / R

A

Tableau AND R

60
Q

Whichtype of data representation should adata analystuseto display expense categories as apercentageof total business expenses? Map visualization / Line chart / Pie chart / Scatter plot

A

Pie chart