17 Practice Exam Two Flashcards

1
Q

What does EDA stand for?

A

Exploratory data analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

If the result was a type II error, what was your conclusion?

A

Gerbils and hamsters can lift the same amount

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What type of error does the following dataset represent?

A

Duplicate data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Which of the following represents the percent of observations in each category as compared to the whole?

A

Percentage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the interpretation of a p-value of 0.04 assuming an alpha of 0.05?

A

Accept the alternative hypothesis and reject the null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

The idea that there will be no difference between the performance of two groups is what kind of hypothesis?

A

Null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Which visualization would be most appropriate for the relationship between the weight of a ferret and milk production?

A

A scatter plot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

A flat file delimited by commas is what file type?

A

CSV

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Which element should never be on the cover page of a report?

A

The appendix

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Data type validation is a process specifically used to avoid what type of error?

A

Invalid data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is an appropriate title for the following chart?

A

The Population of India Averaged for the Years 2015, 2016, and 2017 as Sub-Divided by Geographical Regions Determined by the 2018 Land Survey

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does it mean for a dashboard to be real-time?

A

It has the absolute most up-to-date rates and figures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the act of automatically moving and analyzing online transactions called?

A

OLTP

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does the following code snippet represent? Data = ‘This book makes me happy.’ Data = [‘This’, ‘book’, ‘makes’, ‘me’, ‘happy’, ‘.’]

A

Parsing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Which of the following is a valid data storage solution for audio files?

A

A data lake

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What type of analysis is most appropriate for checking the efficiency of each phase of a production process?

A

Performance analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Who is the most appropriate audience for a detailed report on grain-to-egg efficiency ratios?

A

Technical experts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What would be the result of an outer join on the provided tables?

A

Joined Table with NULLs for unmatched records

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What type of report is most appropriate for a project manager at the end of every sprint?

A

A recurring report

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Find the mode of the following dataset: 5, 3, 8, 5, 3, 9, 3, 8, 2

A

3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What type of analysis is most appropriate for examining the connection between hours worked and mistakes made?

A

Link analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What means of updating a table is represented by adding new values to the bottom?

A

Active record

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What conclusion can you draw from the following visualization?

A

Around 350 students achieved a grade of C or higher

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What type of schema consists only of normalized tables?

A

A snowflake schema

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is the following dataset an example of?

A

Recoding a category into a number

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What is a detailed program that explains how software performs a specific query called?

A

An execution plan

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What conclusion can you draw from the following visualization regarding data access?

A

Half of everyone who can access data is either a marketing analyst or a business analyst

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

A small, highly specialized data storage solution following a star schema would most likely be what?

A

A data mart

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What type of report is most appropriate for a detailed report on a potential merger?

A

An ad hoc report

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Which type of schema has two levels of dimension tables?

A

A galaxy schema

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Which variable indicates when a variable stopped being active?

A

Active End

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What is a key process of MDM?

A

Data consolidation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What type of visualization would be most appropriate for displaying the population of Europe by country?

A

A geographic map

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

What type of error does the following dataset represent?

A

Invalid data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What type of survey question does the following screenshot represent?

A

Single choice

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

Which of the following is a conditional operator?

A

OR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

What is something to consider when checking for data quality?

A

Data integrity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What data-validating approach should you take if you believe the results of an analysis to be in error?

A

Data audits

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

Find the standard deviation of the following dataset: 62, 92, 43, 66, 37

A

21.7

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

What is the most appropriate visualization for expressing ideas held within a text file using natural language processing?

A

A word cloud

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

What is the single most important thing to do if you suspect that private data might have been breached?

A

Notify the impacted parties

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

What type of data is represented by the following dataset?

A

Structured

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

What should you do immediately after planning out your data story when creating a dashboard?

A

Get approval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

In an A/B study, which p-value would cause you to accept the null hypothesis assuming an alpha of 0.1?

A

0.09

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

Which analysis is most appropriate for comparing the age of a customer persona against the normally distributed ages of actual customers?

A

Z-score

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

What sort of cardinality do the provided employee tables have?

A

One-to-one

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

What variable type would a variable called BirdPassed that tracks whether a bird passed by your window be?

A

Binary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

What is the difference between your average clicks per minute and your competitor’s?

A

7%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

What is the average number of clicks per minute for the year for your website?

A

12.8

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
50
Q

What is the average number of clicks per minute for the year for a major competitor?

A

13.7

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
51
Q

What is the difference in clicks per minute between your website and your competitor’s website?

A

9%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
52
Q

Given a t-value of 1.86, what is the confidence interval for the dataset 8, 7, 8, 8, 10, 6, 8, 8, 9, 8?

A

7.6 to 9.5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
53
Q

What type of join is represented in the following example?

A

Outer join

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
54
Q

What analysis specifically tells you whether or not two categorical variables are related?

A

Chi-square

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
55
Q

A social media ID is considered what type of protected data?

A

PII

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
56
Q

What is duplicate data?

A

The same information recorded in multiple rows

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
57
Q

What type of error is represented by a specification mismatch?

A

Specification mismatch

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
58
Q

Which of the following is considered a public source of data?

A

Web scraping

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
59
Q

Average, sum, and count are all examples of what?

A

Reduction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
60
Q

Under which circumstances should you check the quality of your data?

A

Data acquisition

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
61
Q

What data-validating approach should you take if you need a formal process to apply to an entire database?

A

Data audits

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
62
Q

What analysis is most appropriate to predict how wide a tree must be to hold a 400 lb sumo wrestler?

A

Simple linear regression

63
Q

Making sure your data is not full of gaps and missing data is considered what data quality dimension?

A

Completeness

64
Q

The following chart represents what type of distribution?

65
Q

A person’s medical record is considered what type of protected data?

66
Q

In general, dashboards are considered what type of report?

A

A self-service report

67
Q

What type of analysis is most appropriate for predicting future values based on historical data?

A

Trend analysis

68
Q

Which analytical tool is specialized for visualizations?

A

AWS QuickSight

69
Q

What is the most appropriate data range for a report on machine efficiency at the end of the week?

70
Q

Deleting only the missing values and only as they are needed is what type of deletion?

A

Pairwise deletion

71
Q

What type of report is most appropriate when requested for a one-time business question?

A

A static report

72
Q

What part of the dashboard should you update to save time if you receive repeated questions?

73
Q

Unstructured databases include which of the following data types?

A

Undefined fields and machine data

74
Q

Which file type can be used to structure a website or pass data through a website?

75
Q

Find the middle quartile (Q2) of the dataset 70, 21, 34, 48, 27.

76
Q

Watching things and taking notes as a form of data collection is called what?

A

Observation

77
Q

What is the name of the action performed on a dataset when sorting?

78
Q

What do you call the process of filling gaps in the data by calculating the most likely value?

A

Imputation

79
Q

How do nonparametric distributions relate to normal distributions?

A

Nonparametric distributions are sometimes normal

80
Q

What type of analysis would be most appropriate to analyze the relationship between an employee’s job title and hair color?

A

Chi-square test for independence

81
Q

What happens during a delta load?

A

Only load information that is new or has changed

82
Q

What type of database schema is represented by a snowflake schema?

A

A snowflake schema

83
Q

What type of visualization is most appropriate for showing the distribution of shirt sizes sold?

A

A histogram

84
Q

Which of the following would you find in a structured database?

A

Key-value pairs

85
Q

What analysis compares quantitative variables to see whether there is a relationship between them?

A

Correlation

86
Q

What security process is described by translating data from plaintext to cyphertext?

A

Data encryption

87
Q

What type of analysis compares two groups of quantitative variables to determine significant differences?

88
Q

What is a major benefit of MDM?

A

Streamlining data access

89
Q

What is the most suitable approach for creating a dashboard that automatically refreshes weekly?

A

Scheduled delivery

90
Q

Which section of the data use agreement includes information on data destruction?

A

Data deletion

91
Q

What is an execution plan?

A

An execution plan

Review Chapter 3, Collecting Data – Optimizing Query Structure

92
Q

Who comprises half of everyone who can access data?

A

Marketing analyst or business analyst

Review Chapter 13, Common Visualizations – Charting Lines, Circles, and Dots

93
Q

What is a data mart?

A

A data mart

Review Chapter 2, Data Structures, Types, and Formats – Understanding the Concept of Warehouses and Lakes

94
Q

What is a research report?

A

A research report

Review Chapter 11, Types of Reports – Understanding Ad hoc and Research Reports

95
Q

What schema type is described as a snowflake schema?

A

A snowflake schema

Review Chapter 2, Data Structures, Types, and Formats – Going Through the Data Schema and its Types

96
Q

What is the term for the end of an active data process?

A

Active End

Review Chapter 2, Data Structures, Types, and Formats – Updating Stored Data

97
Q

What is data consolidation?

A

Data consolidation

Review Chapter 15, Data Quality and Management – Understanding Master Data Management (MDM)

98
Q

What type of visualization is a geographic map?

A

A geographic map

Review Chapter 13, Common Visualizations – Understanding Heat Maps, Tree Maps, and Geographic Maps

99
Q

What does specification mismatch refer to?

A

Specification mismatch

Review Chapter 4, Cleaning and Processing Data – Understanding Invalid Data, Specification Mismatch, and Data Type Validation

100
Q

What type of question is a single choice question?

A

Single choice

Review Chapter 3, Collecting Data – Collecting Your Own Data

101
Q

What is the logical operator represented by ‘OR’?

A

OR

Review Chapter 5, Data Wrangling and Manipulation – Shaping Data with Common Functions

102
Q

What is data integrity?

A

Data integrity

Review Chapter 15, Data Quality and Management – Understanding Quality Control

103
Q

What are reasonable expectations in data quality?

A

Reasonable expectations

Review Chapter 15, Data Quality and Management – Validating Quality

104
Q

What is the variance value mentioned?

A

21.7

Review Chapter 7, Measures of Central Tendency and Dispersion – Finding Variance and Standard Deviation

105
Q

What type of visualization is a word cloud?

A

A word cloud

Review Chapter 13, Common Visualizations – Understanding Infographics and Word Clouds

106
Q

What should be done when data issues arise?

A

Notify the impacted parties

Review Chapter 14, Data Governance – Knowing Use Requirements

107
Q

What data type is structured?

A

Structured

Review Chapter 2, Data Structures, Types, and Formats – Understanding Structured and Unstructured Data

108
Q

What is the first step in the report development process?

A

Get approval

Review Chapter 12, Reporting Process – Understanding the Report Development Process

109
Q

What is the p-value mentioned?

A

0.3

Review Chapter 9, Hypothesis Testing – Learning p-Value and Alpha

110
Q

What is a Z-score?

A

Z-score

Review Chapter 8, Common Techniques in Descriptive Statistics – Understanding Z-Scores

111
Q

What type of relationship is one-to-one?

A

One-to-one

Review Chapter 14, Data Governance – Handling Entity Relationship Requirements

112
Q

What data type is binary?

A

Binary

Review Chapter 2, Data Structures, Types, and Formats – Going Through Data Types and File Types

113
Q

What is the percentage mentioned?

A

7%

Review Chapter 8, Common Techniques in Descriptive Statistics – Calculating Percent Change and Percent Difference

114
Q

What is the confidence interval range provided?

A

7.3 to 8.7

Review Chapter 8, Common Techniques in Descriptive Statistics – Discovering Confidence Intervals

115
Q

What type of join is a left join?

A

Left join

Review Chapter 5, Data Wrangling and Manipulation – Merging Data

116
Q

What statistical test is known as Chi-square?

A

Chi-square

Review Chapter 10, Introduction to Inferential Statistics – Knowing Chi-Square

117
Q

What does PII stand for?

A

PII

Review Chapter 14, Data Governance – Understanding Data Classifications

118
Q

What does duplicate data refer to?

A

The same information recorded in multiple rows

Review Chapter 4, Cleaning and Processing Data – Managing Duplicate and Redundant Data

119
Q

What is an outlier?

A

An outlier

Review Chapter 4, Cleaning and Processing Data – Finding Outliers

120
Q

What are web services in data collection?

A

Web services

Review Chapter 3, Collecting Data – Utilizing Public Sources of Data

121
Q

What does reduction refer to in data manipulation?

A

Reduction

Review Chapter 5, Data Wrangling and Manipulation – Calculating Derived and Reduced Variables

122
Q

What is data acquisition?

A

Data acquisition

Review Chapter 15, Data Quality and Management – Understanding Quality Control

123
Q

What is data profiling?

A

Data profiling

Review Chapter 15, Data Quality and Management – Validating Quality

124
Q

What is simple linear regression?

A

Simple linear regression

Review Chapter 10, Introduction to Inferential Statistics – Simple Linear Regression

125
Q

What is transposition in data manipulation?

A

Transposition

Review Chapter 5, Data Wrangling and Manipulation – Shaping Data with Common Functions

126
Q

What does completeness refer to in data quality?

A

Completeness

Review Chapter 15, Data Quality and Management – Understanding Quality Control

127
Q

What is the distribution type mentioned?

A

Uniform

Review Chapter 7, Measures of Central Tendency and Dispersion – Discovering Distributions

128
Q

What does PHI stand for?

A

PHI

Review Chapter 14, Data Governance – Understanding Data Classifications

129
Q

What is a self-service report?

A

A self-service report

Review Chapter 11, Types of Reports – Knowing about Self-Service Reports

130
Q

What type of analysis is trend analysis?

A

Trend analysis

Review Chapter 6, Types of Analytics – Discovering Trends

131
Q

What analytical tool is AWS QuickSight?

A

AWS QuickSight

Review Chapter 11, Types of Reports – Knowing Important Analytical Tools

132
Q

What time unit is mentioned for making a report?

A

Weeks

Review Chapter 12, Reporting Process – Knowing What to Consider When Making a Report

133
Q

What method is known as pairwise deletion?

A

Pairwise deletion

Review Chapter 4, Cleaning and Processing Data – Dealing with Missing Data

134
Q

What is a static report?

A

A static report

Review Chapter 11, Types of Reports – Distinguishing Static and Dynamic Reports

135
Q

What are the FAQs in reporting?

A

The FAQs

Review Chapter 12, Reporting Process – Understanding Report Elements

136
Q

What are undefined fields and machine data?

A

Undefined fields and machine data

Review Chapter 2, Data Structures, Types, and Formats – Understanding Structured and Unstructured Data

137
Q

What data format is XML?

A

XML

Review Chapter 2, Data Structures, Types, and Formats – Going Through Data Types and File Types

138
Q

What is the range value mentioned?

A

34

Review Chapter 7, Measures of Central Tendency and Dispersion – Calculating Range and Quartiles

139
Q

What is an observation in data collection?

A

Observation

Review Chapter 3, Collecting Data – Collecting Your Own Data

140
Q

What does filtering refer to in data collection?

A

Filtering

Review Chapter 3, Collecting Data – Optimizing Query Structure

141
Q

What is interpolation in data processing?

A

Interpolation

Review Chapter 4, Cleaning and Processing Data – Dealing with Missing Data

142
Q

Are nonparametric distributions ever normal?

A

Nonparametric distributions are never normal

Review Chapter 4, Cleaning and Processing Data – Understanding Non-Parametric Data

143
Q

What is the Chi-square test for independence?

A

Chi-square test for independence

Review Chapter 10, Introduction to Inferential Statistics – Knowing Chi-Square

144
Q

What does it mean to only load new or changed information?

A

Only load information that is new or has changed

Review Chapter 3, Collecting Data – Differentiating ETL and ELT

145
Q

What is a snowflake schema?

A

A snowflake schema

Review Chapter 2, Data Structures, Types, and Formats – Going Through the Data Schema and its Types

146
Q

What is a histogram?

A

A histogram

Review Chapter 13, Common Visualizations – Comprehending Charts with Bars

147
Q

What are key-value pairs?

A

Key-value pairs

Review Chapter 2, Data Structures, Types, and Formats – Understanding Structured and Unstructured Data

148
Q

What is correlation in statistics?

A

Correlation

Review Chapter 10, Introduction to Inferential Statistics – Calculating Correlations

149
Q

What is data encryption?

A

Data encryption

Review Chapter 14, Data Governance – Understanding Data Security

150
Q

What is a T-test?

A

T-test

Review Chapter 10, Introduction to Inferential Statistics – Understanding T-Tests

151
Q

What does streamlining data access refer to?

A

Streamlining data access

Review Chapter 15, Data Quality and Management – Understanding Master Data Management (MDM)

152
Q

What is the term for subscription in reporting?

A

Subscription

Review Chapter 12, Reporting Process – Understanding Report Delivery

153
Q

What does data deletion refer to?

A

Data deletion

Review Chapter 14, Data Governance – Knowing Use Requirements