16 Practice Exam One Flashcards

1
Q

What does a practice exam provide?

A

A good idea of what material you know well and what material you may need to review.

It simulates the experience of taking the actual exam.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the average time per question in a 90-minute exam with 90 questions?

A

1 question per minute.

This requires efficient time management during the exam.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Which option would be most appropriate for delivering a dashboard only once on a specific date?

A

D. Scheduled delivery

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

A schema with denormalized tables is what type of schema?

A

C. Star schema

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the result of a right join on two tables?

A

Includes all records from the right table and matched records from the left, filling with NULLs where no match exists.

This is typical behavior of right joins in SQL.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What represents the count of observations that falls into each category?

A

D. Frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Forecasting falls into what general type of analysis?

A

D. Trend analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Which analytical tool is considered a programming language?

A

B. Python

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

A person’s credit card information is considered what type of protected data?

A

C. PCI

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What type of report is most appropriate to determine the relationship between an employee’s height and productivity?

A

D. An ad hoc report

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Which of the following contains the true mean?

A

A. The confidence interval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When creating a new dashboard, what should be done first?

A

D. Create a mockup/wireframe

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What visualization is appropriate for showing revenue from each division and department?

A

C. A stacked bar chart

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does updating a table by adding new values to the bottom exemplify?

A

D. Active record

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What type of error does a dataset with incorrect formatting represent?

A

D. Invalid data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What type of visualization is best for explaining complicated relationships to a non-technical audience?

A

C. Infographic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What does ETL stand for?

A

B. Extract, transform, load

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Which section of a data use agreement includes information on what happens if consent is withdrawn?

A

C. Data deletion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Which schema is most appropriate for a database requiring the fewest number of joins?

A

B. A star schema

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What type of data error is characterized by having the same information recorded multiple times?

A

A. Duplicate data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

When checking for data quality, which factor is important?

A

C. Data accuracy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What data validation approach should be taken to see if analysis results can be generalized?

A

D. Cross-validation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What type of analysis gives basic information about the shape and structure of data?

A

C. Exploratory data analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is the alternative hypothesis when comparing two product designs?

A

B. There is a significant difference between Design A and Design B

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
What type of analysis is requested to check KPIs for schedule adherence?
B. Performance analysis
26
What is the variance of the dataset 56, 46, 27, 31, 40?
C. 112.8
27
Using the independent variable to predict the dependent variable describes what analysis?
B. A simple linear regression
28
What type of distribution is represented by a bell curve?
A. Normal
29
Which factor directly impacts the speed and efficiency of a dashboard?
B. Interactive saved searches
30
Which file type is used to pass data through websites without structural dependencies?
A. JSON
31
What is the mean of the dataset 39, 37, 42, 39, 43?
B. 39
32
What data storage solution is most appropriate for large amounts of unstructured data?
B. Data lake
33
What analysis is appropriate to compare two baseball athletes' batting records?
A. Correlation
34
What variables are appropriate data content for a report on robot performance?
B. Date, Vol, Res, Distance(m), and RobotWeight(lb)
35
What analysis compares a sample to the population for representation?
B. T-test
36
What data quality dimension ensures data is reported in the same format?
C. Consistency
37
What type of report gives customer-facing agents the most up-to-date customer information?
C. A dynamic report
38
What is defined by having rows and columns in a database?
B. Defined rows/columns
39
Which values will change when updating an old report with new data?
C. The reference dates
40
What should be the first consideration when choosing fonts for a report?
D. Branding
41
What type of report is appropriate for the sales department needing access to the latest sales data?
C. A self-service report
42
What type of deletion involves removing an entire row of data due to one value?
B. Listwise deletion
43
What is code or software that provides information about the environment called?
A. System functions
44
What are if, and, or, and not examples of?
D. Conditional operators
45
What is the process of filling gaps in data with average values called?
C. Imputation
46
What kind of visualization is represented by a waterfall chart?
A. A waterfall chart
47
Under what circumstances should you check the quality of your data?
A. Data transformation
48
What type of correlation is depicted when values move in the same direction?
C. Positive correlation
49
What analysis is appropriate to compare your exam results to classmates’ scores?
C. Z-score
50
What type of data is characterized by a lack of structure?
A. Unstructured
51
What business requirement should be considered when making a report for a manager with little experience?
C. Filters
52
How much has the average number of clicks per minute increased since last year if the average was 12.8 this year and 8.7 last year?
B. 38%
53
What type of report are compliance reports generally considered?
C. A recurring report
54
What is redundant data?
A. The same information recorded in multiple columns
55
What was the average number of clicks per minute calculated last year?
8.7
56
How much has the average number of clicks per minute increased since last year?
A. 35% ## Footnote Options include A. 35%, B. 38%, C. 47%, D. 52%
57
Compliance reports are generally considered what type of report?
C. A recurring report ## Footnote Options include A. An ad hoc report, B. A research report, C. A recurring report, D. A self-service report
58
What is redundant data?
A. The same information recorded in multiple columns ## Footnote Options include A. The same information recorded in multiple columns, B. Data that is incomplete or blank, C. The same information recorded in multiple rows, D. Data that does not meet formatting requirements
59
What type of distribution does the following visualization represent?
A. Normal ## Footnote Options include A. Normal, B. Uniform, C. Non-parametric, D. Exponential
60
An IP address is considered what type of protected data?
A. PII ## Footnote Options include A. PII, B. PHI, C. PCI, D. PIFI
61
In a simple AB study, which p-value would cause you to reject the null hypothesis assuming an alpha of 0.05?
D. 0.008 ## Footnote Options include A. 0.3, B. 0.5, C. 0.06, D. 0.008
62
If you had a variable called BirdCount that kept track of the number of birds that flew by your window on a specific day, what variable type would it be?
D. Discrete ## Footnote Options include A. Nominal, B. Continuous, C. Ordinal, D. Discrete
63
The act of automatically collecting, processing, and storing online transactions is called what?
D. OLTP ## Footnote Options include A. OLAP, B. ETL, C. ELT, D. OLTP
64
Breaking large chunks of data down into small, usable pieces is called what?
B. Parsing ## Footnote Options include A. Interpretation, B. Parsing, C. Reduction, D. Interpolation
65
A new variable that specifically holds a calculation of other variables is called what?
A. A derived variable ## Footnote Options include A. A derived variable, B. A binary variable, C. A variable deletion, D. An ordinal variable
66
What analysis is most appropriate to determine if a sample accurately reflects the larger population of customers?
C. The chi-square goodness of fit ## Footnote Options include A. The chi-square test for independence, B. The chi-square test for homogeneity, C. The chi-square goodness of fit, D. The chi-square test for linearity
67
What type of analysis would be most appropriate to determine the connection between customer age and purchase likelihood?
C. Link analysis ## Footnote Options include A. Trend analysis, B. Performance analysis, C. Link analysis, D. Exploratory data analysis
68
What is the most appropriate visualization to track ROI over the past 4 years?
A. A line chart ## Footnote Options include A. A line chart, B. A scatter plot, C. A heat map, D. A bubble chart
69
Where on a dashboard do you put instructions for how to use it?
C. The FAQs ## Footnote Options include A. The appendix, B. The cover page, C. The FAQs, D. The reference dates
70
What conclusion can you draw from the following visualization?
B. Administration is the smallest department ## Footnote Options include A. This accounts for every employee in the company, B. Administration is the smallest department, C. Administration is the biggest department, D. There are at least 60 employees in the company
71
What kind of visualization does the following graph represent?
C. A pie chart ## Footnote Options include A. A waterfall chart, B. A scatter plot, C. A pie chart, D. A bubble chart
72
A data point that is so much smaller than every other data point that it drastically lowers the average is an example of what?
C. An outlier ## Footnote Options include A. A specification mismatch, B. Duplicate data, C. An outlier, D. Data type validation
73
Which section of a data use agreement includes information on the consequences of using the data improperly?
A. The acceptable use policy ## Footnote Options include A. The acceptable use policy, B. Data processing, C. Data deletion, D. Data retention
74
What type of join is represented by the following example?
B. Inner Join ## Footnote Options include A. Outer join, B. Inner Join, C. Left Join, D. Right Join
75
When is an ideal time to implement MDM?
C. When a company is purchased ## Footnote Options include A. When data is manipulated, B. When data is transferred, C. When a company is purchased, D. When data is transformed
76
Which variable indicates that a row is the most recent value?
B. Active Record ## Footnote Options include A. Magic Number, B. Active Record, C. Active Start, D. Active End
77
A large data storage solution for relational data, focusing on efficiency and following a snowflake schema, would most likely be what?
A. A data warehouse ## Footnote Options include A. A data warehouse, B. A data mart, C. A data lake, D. A data puddle
78
What concept does the following dataset exemplify?
C. Dummy coding ## Footnote Options include A. Recoding a number into a category, B. Recoding a category into a number, C. Dummy coding, D. Transposition
79
What is the range of the following dataset: 25, 38, 50, 49, 38?
C. 40 ## Footnote Options include A. 25, B. 38, C. 40, D. 50
80
What type of database schema does the following diagram represent?
A. A snowflake schema ## Footnote Options include A. A snowflake schema, B. A star schema, C. A galaxy schema, D. A fast constellation schema
81
What type of access requirement is it if a database can only be accessed by people with the job title data analyst?
C. Role-based ## Footnote Options include A. Data encryption, B. User group-based, C. Role-based, D. Data transmission
82
How do you interpret a p-value of 0.4 in a study about a memory-enhancing drug assuming an alpha of 0.05?
B. Reject the alternative hypothesis and accept the null hypothesis ## Footnote Options include A. Accept the alternative hypothesis and reject the null hypothesis, B. Reject the alternative hypothesis and accept the null hypothesis, C. Accept the alternative and null hypotheses, D. Reject the alternative and null hypotheses
83
What is a prewritten query that allows the user to enter specific information to target data?
C. Filter ## Footnote Options include A. Index, B. Parameterization, C. Filter, D. Sort
84
What do you call the process of checking the data type of a variable to avoid errors?
A. Data type validation ## Footnote Options include A. Data type validation, B. Imputation, C. Specification mismatch, D. Interpolation
85
What sort of connection does a piece of code that requests information from an API and waits for a response have?
C. Synchronous ## Footnote Options include A. Structured, B. Unstructured, C. Synchronous, D. Asynchronous
86
What sort of cardinality do the following tables have? Employee Information and Sales Information
B. One-to-many ## Footnote Options include A. One-to-one, B. One-to-many, C. Many-to-many, D. There is no entity relationship
87
Which data validation approach should you take to see whether a dataset is appropriate for a specific goal?
C. Data profiling ## Footnote Options include A. Reasonable expectations, B. Data audits, C. Data profiling, D. Cross-validation
88
What conclusion can you draw from the following visualization?
C. The taller you are, the more likely you are to have a larger hat size ## Footnote Options include A. Being taller makes your head bigger, B. Having a small head makes you taller, C. The taller you are, the more likely you are to have a larger hat size, D. The taller you are, the more likely you are to have a smaller hat size
89
A WAV file holds what type of information?
C. Audio ## Footnote Options include A. Image, B. Video, C. Audio, D. Text
90
What is the process of using code to automatically collect data from websites called?
A. Web scraping ## Footnote Options include A. Web scraping, B. Web services, C. Non-relational, D. Semi-structured
91
What type of data is the following dataset an example of?
C. Both structured and relational ## Footnote Options include A. Structured, B. Relational, C. Both structured and relational, D. Neither structured nor relational
92
Which of the following is a key process of MDM?
C. Data transformation ## Footnote Options include A. Data dictionary, B. Data encryption, C. Data transformation, D. Data manipulation
93
What action has been performed on a dataset in the following depiction?
A. Filtering ## Footnote Options include A. Filtering, B. Subsets, C. Parameterization, D. Sorting
94
What type of error occurs if you wrongly conclude that a software update made a program faster?
A. Type I ## Footnote Options include A. Type I, B. Type II, C. Type III, D. Type IV
95
What is the answer to the question regarding interactive saved searches?
Interactive saved searches ## Footnote Review Chapter 12, Reporting Process – Understanding Report Delivery
96
What data format is mentioned in relation to data structures?
JSON ## Footnote Review Chapter 2, Data Structures, Types, and Formats – Going through Data Types and File Types
97
What is the value given in relation to measures of central tendency?
40 ## Footnote Review Chapter 7, Measures of Central Tendency and Dispersion – Understanding Measures of Central Tendency
98
What type of data storage is referred to as a data lake?
Data lake ## Footnote Review Chapter 2, Data Structures, Types, and Formats – Understanding the Concept of Warehouses and Lakes
99
Which statistical test is mentioned?
T-test ## Footnote Review Chapter 10, Introduction to Inferential Statistics – Understanding t-Tests
100
What are the variables listed related to reporting?
Date, Distance(m), and RobotWeight(lb) ## Footnote Review Chapter 12, Reporting Process – Knowing what to Consider when Making a Report
101
What statistical test is associated with the term chi-square?
Chi-square ## Footnote Review Chapter 10, Introduction to Inferential Statistics – Knowing Chi-Square
102
What concept is described as consistency in data quality?
Consistency ## Footnote Review Chapter 15, Data Quality and Management – Understanding Quality Control
103
What type of report is described as dynamic?
A dynamic report ## Footnote Review Chapter 11, Types of Reports – Distinguishing Static and Dynamic Reports
104
What defines structured data?
Defined rows/columns ## Footnote Review Chapter 2, Data Structures, Types, and Formats – Understanding Structured and Unstructured Data
105
What are reference dates in reporting?
The reference dates ## Footnote Review Chapter 12, Reporting Process – Understanding Report Elements
106
What aspect of reporting is related to branding?
Branding ## Footnote Review Chapter 12, Reporting Process – Designing Reports
107
What is a self-service report?
A self-service report ## Footnote Review Chapter 11, Types of Reports – Knowing about Self-Service Reports
108
What method is used to deal with missing data?
Listwise deletion ## Footnote Review Chapter 4, Cleaning and Processing Data – Dealing with Missing Data
109
What functions are used in data shaping?
System functions ## Footnote Review Chapter 5, Data Wrangling and Manipulation – Shaping Data with Common Functions
110
What type of operators are used in data manipulation?
Conditional operators ## Footnote Review Chapter 5, Data Wrangling and Manipulation – Shaping Data with Common Functions
111
What technique is used to fill in missing data?
Imputation ## Footnote Review Chapter 4, Cleaning and Processing Data – Dealing with Missing Data
112
What type of chart is described as a stacked bar chart?
A stacked bar chart ## Footnote Review Chapter 13, Common Visualizations – Comprehending Charts with Bars
113
What process is involved in data quality management?
Data transformation ## Footnote Review Chapter 15, Data Quality and Management – Understanding Quality Control
114
What correlation type is mentioned?
Positive correlation ## Footnote Review Chapter 10, Introduction to Inferential Statistics – Calculating Correlations
115
What statistic measures standard deviations from the mean?
Z-score ## Footnote Review Chapter 8, Common Techniques in Descriptive Statistics – Understanding z-Scores
116
What type of data structure is referred to as relational?
Relational ## Footnote Review Chapter 2, Data Structures, Types, and Formats – Understanding Structured and Unstructured Data
117
What are views in reporting?
Views ## Footnote Review Chapter 12, Reporting Process – Knowing what to Consider when Making a Report
118
What percentage is mentioned in relation to descriptive statistics?
47% ## Footnote Review Chapter 8, Common Techniques in Descriptive Statistics – Calculating Percent Change and Percent Difference
119
What is defined as a recurring report?
A recurring report ## Footnote Review Chapter 11, Types of Reports – Understanding Recurring Reports
120
What is described as the same information recorded in multiple columns?
The same information recorded in multiple columns ## Footnote Review Chapter 4, Cleaning and Processing Data – Managing Duplicate and Redundant Data
121
What type of data is referred to as non-parametric?
Non-parametric ## Footnote Review Chapter 4, Cleaning and Processing Data – Understanding Non-parametric Data
122
What does PII stand for?
PII ## Footnote Review Chapter 14, Data Governance – Understanding Data Classifications
123
What is the p-value in hypothesis testing mentioned?
0.008 ## Footnote Review Chapter 9, Hypothesis Testing – Learning p-Value and Alpha
124
What type of data is described as discrete?
Discrete ## Footnote Review Chapter 2, Data Structures, Types, and Formats – Going through Data Types and File Types
125
What does OLTP stand for?
OLTP ## Footnote Review Chapter 3, Collecting Data – Understanding OLTP and OLAP
126
What process involves parsing data?
Parsing ## Footnote Review Chapter 5, Data Wrangling and Manipulation – Parsing Your Data
127
What is referred to as a derived variable?
A derived variable ## Footnote Review Chapter 5, Data Wrangling and Manipulation – Calculating Derived and Reduced Variables
128
What test is known as the chi-square goodness of fit?
The chi-square goodness of fit ## Footnote Review Chapter 10, Introduction to Inferential Statistics – Knowing Chi-Square
129
What type of analysis is mentioned as link analysis?
Link analysis ## Footnote Review Chapter 6, Types of Analytics – Finding Links
130
What type of chart is described as a line chart?
A line chart ## Footnote Review Chapter 13, Common Visualizations – Charting Lines, Circles, and Dots
131
What is the cover page in reporting?
The cover page ## Footnote Review Chapter 12, Reporting Process – Understanding Report Elements
132
What statement is made about the number of employees in a company?
There are at least 60 employees in the company ## Footnote Review Chapter 13, Common Visualizations – Comprehending Charts with Bars
133
What type of chart is described as a bubble chart?
A bubble chart ## Footnote Review Chapter 13, Common Visualizations – Charting Lines, Circles, and Dots
134
What is an outlier in data analysis?
An outlier ## Footnote Review Chapter 4, Cleaning and Processing Data – Finding Outliers
135
What policy is related to acceptable use in data governance?
The acceptable use policy ## Footnote Review Chapter 14, Data Governance – Knowing Use Requirements
136
What type of join is mentioned?
Inner join ## Footnote Review Chapter 5, Data Wrangling and Manipulation – Merging Data
137
What occurs when a company is purchased?
When a company is purchased ## Footnote Review Chapter 15, Data Quality and Management – Understanding Master Data Management (MDM)
138
What is referred to as active record?
Active Record ## Footnote Review Chapter 2, Data Structures, Types, and Formats – Updating Stored Data
139
What is defined as a data warehouse?
A data warehouse ## Footnote Review Chapter 2, Data Structures, Types, and Formats – Understanding the Concept of Warehouses and Lakes
140
What coding technique is referred to as dummy coding?
Dummy coding ## Footnote Review Chapter 5, Data Wrangling and Manipulation – Recoding Variables
141
What is the calculated range and quartiles value mentioned?
25 ## Footnote Review Chapter 7, Measures of Central Tendency and Dispersion – Calculating Range and Quartiles
142
What type of schema is referred to as a star schema?
A star schema ## Footnote Review Chapter 2, Data Structures, Types, and Formats – Going through the Data Schema and its Types
143
What is described as role-based in data governance?
Role-based ## Footnote Review Chapter 14, Data Governance – Understanding Data Security
144
What decision is made regarding the null hypothesis?
Reject the alternative hypothesis and accept the null hypothesis ## Footnote Review Chapter 9, Hypothesis Testing – Learning p-Value and Alpha
145
What process is referred to as parameterization?
Parameterization ## Footnote Review Chapter 3, Collecting Data – Optimizing Query Structure
146
What is the term for validating data types?
Data type validation ## Footnote Review Chapter 4, Cleaning and Processing Data – Understanding Invalid Data, Specification Mismatch, and Data Type Validation
147
What is described as synchronous in data collecting?
Synchronous ## Footnote Review Chapter 3, Collecting Data – Utilizing Public Sources of Data
148
What type of relationship is referred to as one-to-many?
One-to-many ## Footnote Review Chapter 14, Data Governance – Handling Entity Relationship Requirements
149
What process is involved in validating data quality?
Data audits ## Footnote Review Chapter 15, Data Quality and Management – Validating Quality
150
What statement is made about height and hat size?
The taller you are the more likely you are to have a larger hat size ## Footnote Review Chapter 13, Common Visualizations – Charting Lines, Circles, and Dots
151
What type of data is referred to as audio?
Audio ## Footnote Review Chapter 2, Data Structures, Types, and Formats – Going through Data Types and File Types
152
What method is used for collecting your own data?
Web scraping ## Footnote Review Chapter 3, Collecting Data – Collecting Your Own Data
153
What type of data structure is both structured and relational?
Both structured and relational ## Footnote Review Chapter 2, Data Structures, Types, and Formats – Understanding Structured and Unstructured Data
154
What is a data dictionary?
Data dictionary ## Footnote Review Chapter 15, Data Quality and Management – Understanding Master Data Management (MDM)
155
What process is referred to as sorting?
Sorting ## Footnote Review Chapter 3, Collecting Data – Optimizing Query Structure
156
What type of error is defined as Type I?
Type I ## Footnote Review Chapter 9, Hypothesis Testing – Understanding Type I and Type II Error