16 Practice Exam One Flashcards
What does a practice exam provide?
A good idea of what material you know well and what material you may need to review.
It simulates the experience of taking the actual exam.
What is the average time per question in a 90-minute exam with 90 questions?
1 question per minute.
This requires efficient time management during the exam.
Which option would be most appropriate for delivering a dashboard only once on a specific date?
D. Scheduled delivery
A schema with denormalized tables is what type of schema?
C. Star schema
What is the result of a right join on two tables?
Includes all records from the right table and matched records from the left, filling with NULLs where no match exists.
This is typical behavior of right joins in SQL.
What represents the count of observations that falls into each category?
D. Frequency
Forecasting falls into what general type of analysis?
D. Trend analysis
Which analytical tool is considered a programming language?
B. Python
A person’s credit card information is considered what type of protected data?
C. PCI
What type of report is most appropriate to determine the relationship between an employee’s height and productivity?
D. An ad hoc report
Which of the following contains the true mean?
A. The confidence interval
When creating a new dashboard, what should be done first?
D. Create a mockup/wireframe
What visualization is appropriate for showing revenue from each division and department?
C. A stacked bar chart
What does updating a table by adding new values to the bottom exemplify?
D. Active record
What type of error does a dataset with incorrect formatting represent?
D. Invalid data
What type of visualization is best for explaining complicated relationships to a non-technical audience?
C. Infographic
What does ETL stand for?
B. Extract, transform, load
Which section of a data use agreement includes information on what happens if consent is withdrawn?
C. Data deletion
Which schema is most appropriate for a database requiring the fewest number of joins?
B. A star schema
What type of data error is characterized by having the same information recorded multiple times?
A. Duplicate data
When checking for data quality, which factor is important?
C. Data accuracy
What data validation approach should be taken to see if analysis results can be generalized?
D. Cross-validation
What type of analysis gives basic information about the shape and structure of data?
C. Exploratory data analysis
What is the alternative hypothesis when comparing two product designs?
B. There is a significant difference between Design A and Design B
What type of analysis is requested to check KPIs for schedule adherence?
B. Performance analysis
What is the variance of the dataset 56, 46, 27, 31, 40?
C. 112.8
Using the independent variable to predict the dependent variable describes what analysis?
B. A simple linear regression
What type of distribution is represented by a bell curve?
A. Normal
Which factor directly impacts the speed and efficiency of a dashboard?
B. Interactive saved searches
Which file type is used to pass data through websites without structural dependencies?
A. JSON
What is the mean of the dataset 39, 37, 42, 39, 43?
B. 39
What data storage solution is most appropriate for large amounts of unstructured data?
B. Data lake
What analysis is appropriate to compare two baseball athletes’ batting records?
A. Correlation
What variables are appropriate data content for a report on robot performance?
B. Date, Vol, Res, Distance(m), and RobotWeight(lb)
What analysis compares a sample to the population for representation?
B. T-test
What data quality dimension ensures data is reported in the same format?
C. Consistency
What type of report gives customer-facing agents the most up-to-date customer information?
C. A dynamic report
What is defined by having rows and columns in a database?
B. Defined rows/columns
Which values will change when updating an old report with new data?
C. The reference dates
What should be the first consideration when choosing fonts for a report?
D. Branding
What type of report is appropriate for the sales department needing access to the latest sales data?
C. A self-service report
What type of deletion involves removing an entire row of data due to one value?
B. Listwise deletion
What is code or software that provides information about the environment called?
A. System functions
What are if, and, or, and not examples of?
D. Conditional operators
What is the process of filling gaps in data with average values called?
C. Imputation
What kind of visualization is represented by a waterfall chart?
A. A waterfall chart
Under what circumstances should you check the quality of your data?
A. Data transformation
What type of correlation is depicted when values move in the same direction?
C. Positive correlation
What analysis is appropriate to compare your exam results to classmates’ scores?
C. Z-score
What type of data is characterized by a lack of structure?
A. Unstructured
What business requirement should be considered when making a report for a manager with little experience?
C. Filters
How much has the average number of clicks per minute increased since last year if the average was 12.8 this year and 8.7 last year?
B. 38%
What type of report are compliance reports generally considered?
C. A recurring report
What is redundant data?
A. The same information recorded in multiple columns
What was the average number of clicks per minute calculated last year?
8.7
How much has the average number of clicks per minute increased since last year?
A. 35%
Options include A. 35%, B. 38%, C. 47%, D. 52%
Compliance reports are generally considered what type of report?
C. A recurring report
Options include A. An ad hoc report, B. A research report, C. A recurring report, D. A self-service report
What is redundant data?
A. The same information recorded in multiple columns
Options include A. The same information recorded in multiple columns, B. Data that is incomplete or blank, C. The same information recorded in multiple rows, D. Data that does not meet formatting requirements
What type of distribution does the following visualization represent?
A. Normal
Options include A. Normal, B. Uniform, C. Non-parametric, D. Exponential
An IP address is considered what type of protected data?
A. PII
Options include A. PII, B. PHI, C. PCI, D. PIFI
In a simple AB study, which p-value would cause you to reject the null hypothesis assuming an alpha of 0.05?
D. 0.008
Options include A. 0.3, B. 0.5, C. 0.06, D. 0.008
If you had a variable called BirdCount that kept track of the number of birds that flew by your window on a specific day, what variable type would it be?
D. Discrete
Options include A. Nominal, B. Continuous, C. Ordinal, D. Discrete
The act of automatically collecting, processing, and storing online transactions is called what?
D. OLTP
Options include A. OLAP, B. ETL, C. ELT, D. OLTP
Breaking large chunks of data down into small, usable pieces is called what?
B. Parsing
Options include A. Interpretation, B. Parsing, C. Reduction, D. Interpolation
A new variable that specifically holds a calculation of other variables is called what?
A. A derived variable
Options include A. A derived variable, B. A binary variable, C. A variable deletion, D. An ordinal variable
What analysis is most appropriate to determine if a sample accurately reflects the larger population of customers?
C. The chi-square goodness of fit
Options include A. The chi-square test for independence, B. The chi-square test for homogeneity, C. The chi-square goodness of fit, D. The chi-square test for linearity
What type of analysis would be most appropriate to determine the connection between customer age and purchase likelihood?
C. Link analysis
Options include A. Trend analysis, B. Performance analysis, C. Link analysis, D. Exploratory data analysis
What is the most appropriate visualization to track ROI over the past 4 years?
A. A line chart
Options include A. A line chart, B. A scatter plot, C. A heat map, D. A bubble chart
Where on a dashboard do you put instructions for how to use it?
C. The FAQs
Options include A. The appendix, B. The cover page, C. The FAQs, D. The reference dates
What conclusion can you draw from the following visualization?
B. Administration is the smallest department
Options include A. This accounts for every employee in the company, B. Administration is the smallest department, C. Administration is the biggest department, D. There are at least 60 employees in the company
What kind of visualization does the following graph represent?
C. A pie chart
Options include A. A waterfall chart, B. A scatter plot, C. A pie chart, D. A bubble chart
A data point that is so much smaller than every other data point that it drastically lowers the average is an example of what?
C. An outlier
Options include A. A specification mismatch, B. Duplicate data, C. An outlier, D. Data type validation
Which section of a data use agreement includes information on the consequences of using the data improperly?
A. The acceptable use policy
Options include A. The acceptable use policy, B. Data processing, C. Data deletion, D. Data retention
What type of join is represented by the following example?
B. Inner Join
Options include A. Outer join, B. Inner Join, C. Left Join, D. Right Join
When is an ideal time to implement MDM?
C. When a company is purchased
Options include A. When data is manipulated, B. When data is transferred, C. When a company is purchased, D. When data is transformed
Which variable indicates that a row is the most recent value?
B. Active Record
Options include A. Magic Number, B. Active Record, C. Active Start, D. Active End
A large data storage solution for relational data, focusing on efficiency and following a snowflake schema, would most likely be what?
A. A data warehouse
Options include A. A data warehouse, B. A data mart, C. A data lake, D. A data puddle
What concept does the following dataset exemplify?
C. Dummy coding
Options include A. Recoding a number into a category, B. Recoding a category into a number, C. Dummy coding, D. Transposition
What is the range of the following dataset: 25, 38, 50, 49, 38?
C. 40
Options include A. 25, B. 38, C. 40, D. 50
What type of database schema does the following diagram represent?
A. A snowflake schema
Options include A. A snowflake schema, B. A star schema, C. A galaxy schema, D. A fast constellation schema
What type of access requirement is it if a database can only be accessed by people with the job title data analyst?
C. Role-based
Options include A. Data encryption, B. User group-based, C. Role-based, D. Data transmission
How do you interpret a p-value of 0.4 in a study about a memory-enhancing drug assuming an alpha of 0.05?
B. Reject the alternative hypothesis and accept the null hypothesis
Options include A. Accept the alternative hypothesis and reject the null hypothesis, B. Reject the alternative hypothesis and accept the null hypothesis, C. Accept the alternative and null hypotheses, D. Reject the alternative and null hypotheses
What is a prewritten query that allows the user to enter specific information to target data?
C. Filter
Options include A. Index, B. Parameterization, C. Filter, D. Sort
What do you call the process of checking the data type of a variable to avoid errors?
A. Data type validation
Options include A. Data type validation, B. Imputation, C. Specification mismatch, D. Interpolation
What sort of connection does a piece of code that requests information from an API and waits for a response have?
C. Synchronous
Options include A. Structured, B. Unstructured, C. Synchronous, D. Asynchronous
What sort of cardinality do the following tables have? Employee Information and Sales Information
B. One-to-many
Options include A. One-to-one, B. One-to-many, C. Many-to-many, D. There is no entity relationship
Which data validation approach should you take to see whether a dataset is appropriate for a specific goal?
C. Data profiling
Options include A. Reasonable expectations, B. Data audits, C. Data profiling, D. Cross-validation
What conclusion can you draw from the following visualization?
C. The taller you are, the more likely you are to have a larger hat size
Options include A. Being taller makes your head bigger, B. Having a small head makes you taller, C. The taller you are, the more likely you are to have a larger hat size, D. The taller you are, the more likely you are to have a smaller hat size
A WAV file holds what type of information?
C. Audio
Options include A. Image, B. Video, C. Audio, D. Text
What is the process of using code to automatically collect data from websites called?
A. Web scraping
Options include A. Web scraping, B. Web services, C. Non-relational, D. Semi-structured
What type of data is the following dataset an example of?
C. Both structured and relational
Options include A. Structured, B. Relational, C. Both structured and relational, D. Neither structured nor relational
Which of the following is a key process of MDM?
C. Data transformation
Options include A. Data dictionary, B. Data encryption, C. Data transformation, D. Data manipulation
What action has been performed on a dataset in the following depiction?
A. Filtering
Options include A. Filtering, B. Subsets, C. Parameterization, D. Sorting
What type of error occurs if you wrongly conclude that a software update made a program faster?
A. Type I
Options include A. Type I, B. Type II, C. Type III, D. Type IV
What is the answer to the question regarding interactive saved searches?
Interactive saved searches
Review Chapter 12, Reporting Process – Understanding Report Delivery
What data format is mentioned in relation to data structures?
JSON
Review Chapter 2, Data Structures, Types, and Formats – Going through Data Types and File Types
What is the value given in relation to measures of central tendency?
40
Review Chapter 7, Measures of Central Tendency and Dispersion – Understanding Measures of Central Tendency
What type of data storage is referred to as a data lake?
Data lake
Review Chapter 2, Data Structures, Types, and Formats – Understanding the Concept of Warehouses and Lakes
Which statistical test is mentioned?
T-test
Review Chapter 10, Introduction to Inferential Statistics – Understanding t-Tests
What are the variables listed related to reporting?
Date, Distance(m), and RobotWeight(lb)
Review Chapter 12, Reporting Process – Knowing what to Consider when Making a Report
What statistical test is associated with the term chi-square?
Chi-square
Review Chapter 10, Introduction to Inferential Statistics – Knowing Chi-Square
What concept is described as consistency in data quality?
Consistency
Review Chapter 15, Data Quality and Management – Understanding Quality Control
What type of report is described as dynamic?
A dynamic report
Review Chapter 11, Types of Reports – Distinguishing Static and Dynamic Reports
What defines structured data?
Defined rows/columns
Review Chapter 2, Data Structures, Types, and Formats – Understanding Structured and Unstructured Data
What are reference dates in reporting?
The reference dates
Review Chapter 12, Reporting Process – Understanding Report Elements
What aspect of reporting is related to branding?
Branding
Review Chapter 12, Reporting Process – Designing Reports
What is a self-service report?
A self-service report
Review Chapter 11, Types of Reports – Knowing about Self-Service Reports
What method is used to deal with missing data?
Listwise deletion
Review Chapter 4, Cleaning and Processing Data – Dealing with Missing Data
What functions are used in data shaping?
System functions
Review Chapter 5, Data Wrangling and Manipulation – Shaping Data with Common Functions
What type of operators are used in data manipulation?
Conditional operators
Review Chapter 5, Data Wrangling and Manipulation – Shaping Data with Common Functions
What technique is used to fill in missing data?
Imputation
Review Chapter 4, Cleaning and Processing Data – Dealing with Missing Data
What type of chart is described as a stacked bar chart?
A stacked bar chart
Review Chapter 13, Common Visualizations – Comprehending Charts with Bars
What process is involved in data quality management?
Data transformation
Review Chapter 15, Data Quality and Management – Understanding Quality Control
What correlation type is mentioned?
Positive correlation
Review Chapter 10, Introduction to Inferential Statistics – Calculating Correlations
What statistic measures standard deviations from the mean?
Z-score
Review Chapter 8, Common Techniques in Descriptive Statistics – Understanding z-Scores
What type of data structure is referred to as relational?
Relational
Review Chapter 2, Data Structures, Types, and Formats – Understanding Structured and Unstructured Data
What are views in reporting?
Views
Review Chapter 12, Reporting Process – Knowing what to Consider when Making a Report
What percentage is mentioned in relation to descriptive statistics?
47%
Review Chapter 8, Common Techniques in Descriptive Statistics – Calculating Percent Change and Percent Difference
What is defined as a recurring report?
A recurring report
Review Chapter 11, Types of Reports – Understanding Recurring Reports
What is described as the same information recorded in multiple columns?
The same information recorded in multiple columns
Review Chapter 4, Cleaning and Processing Data – Managing Duplicate and Redundant Data
What type of data is referred to as non-parametric?
Non-parametric
Review Chapter 4, Cleaning and Processing Data – Understanding Non-parametric Data
What does PII stand for?
PII
Review Chapter 14, Data Governance – Understanding Data Classifications
What is the p-value in hypothesis testing mentioned?
0.008
Review Chapter 9, Hypothesis Testing – Learning p-Value and Alpha
What type of data is described as discrete?
Discrete
Review Chapter 2, Data Structures, Types, and Formats – Going through Data Types and File Types
What does OLTP stand for?
OLTP
Review Chapter 3, Collecting Data – Understanding OLTP and OLAP
What process involves parsing data?
Parsing
Review Chapter 5, Data Wrangling and Manipulation – Parsing Your Data
What is referred to as a derived variable?
A derived variable
Review Chapter 5, Data Wrangling and Manipulation – Calculating Derived and Reduced Variables
What test is known as the chi-square goodness of fit?
The chi-square goodness of fit
Review Chapter 10, Introduction to Inferential Statistics – Knowing Chi-Square
What type of analysis is mentioned as link analysis?
Link analysis
Review Chapter 6, Types of Analytics – Finding Links
What type of chart is described as a line chart?
A line chart
Review Chapter 13, Common Visualizations – Charting Lines, Circles, and Dots
What is the cover page in reporting?
The cover page
Review Chapter 12, Reporting Process – Understanding Report Elements
What statement is made about the number of employees in a company?
There are at least 60 employees in the company
Review Chapter 13, Common Visualizations – Comprehending Charts with Bars
What type of chart is described as a bubble chart?
A bubble chart
Review Chapter 13, Common Visualizations – Charting Lines, Circles, and Dots
What is an outlier in data analysis?
An outlier
Review Chapter 4, Cleaning and Processing Data – Finding Outliers
What policy is related to acceptable use in data governance?
The acceptable use policy
Review Chapter 14, Data Governance – Knowing Use Requirements
What type of join is mentioned?
Inner join
Review Chapter 5, Data Wrangling and Manipulation – Merging Data
What occurs when a company is purchased?
When a company is purchased
Review Chapter 15, Data Quality and Management – Understanding Master Data Management (MDM)
What is referred to as active record?
Active Record
Review Chapter 2, Data Structures, Types, and Formats – Updating Stored Data
What is defined as a data warehouse?
A data warehouse
Review Chapter 2, Data Structures, Types, and Formats – Understanding the Concept of Warehouses and Lakes
What coding technique is referred to as dummy coding?
Dummy coding
Review Chapter 5, Data Wrangling and Manipulation – Recoding Variables
What is the calculated range and quartiles value mentioned?
25
Review Chapter 7, Measures of Central Tendency and Dispersion – Calculating Range and Quartiles
What type of schema is referred to as a star schema?
A star schema
Review Chapter 2, Data Structures, Types, and Formats – Going through the Data Schema and its Types
What is described as role-based in data governance?
Role-based
Review Chapter 14, Data Governance – Understanding Data Security
What decision is made regarding the null hypothesis?
Reject the alternative hypothesis and accept the null hypothesis
Review Chapter 9, Hypothesis Testing – Learning p-Value and Alpha
What process is referred to as parameterization?
Parameterization
Review Chapter 3, Collecting Data – Optimizing Query Structure
What is the term for validating data types?
Data type validation
Review Chapter 4, Cleaning and Processing Data – Understanding Invalid Data, Specification Mismatch, and Data Type Validation
What is described as synchronous in data collecting?
Synchronous
Review Chapter 3, Collecting Data – Utilizing Public Sources of Data
What type of relationship is referred to as one-to-many?
One-to-many
Review Chapter 14, Data Governance – Handling Entity Relationship Requirements
What process is involved in validating data quality?
Data audits
Review Chapter 15, Data Quality and Management – Validating Quality
What statement is made about height and hat size?
The taller you are the more likely you are to have a larger hat size
Review Chapter 13, Common Visualizations – Charting Lines, Circles, and Dots
What type of data is referred to as audio?
Audio
Review Chapter 2, Data Structures, Types, and Formats – Going through Data Types and File Types
What method is used for collecting your own data?
Web scraping
Review Chapter 3, Collecting Data – Collecting Your Own Data
What type of data structure is both structured and relational?
Both structured and relational
Review Chapter 2, Data Structures, Types, and Formats – Understanding Structured and Unstructured Data
What is a data dictionary?
Data dictionary
Review Chapter 15, Data Quality and Management – Understanding Master Data Management (MDM)
What process is referred to as sorting?
Sorting
Review Chapter 3, Collecting Data – Optimizing Query Structure
What type of error is defined as Type I?
Type I
Review Chapter 9, Hypothesis Testing – Understanding Type I and Type II Error