17 Practice Exam Two Flashcards
What does EDA stand for?
Exploratory data analysis
If the result was a type II error, what was your conclusion?
Gerbils and hamsters can lift the same amount
What type of error does the following dataset represent?
Duplicate data
Which of the following represents the percent of observations in each category as compared to the whole?
Percentage
What is the interpretation of a p-value of 0.04 assuming an alpha of 0.05?
Accept the alternative hypothesis and reject the null hypothesis
The idea that there will be no difference between the performance of two groups is what kind of hypothesis?
Null hypothesis
Which visualization would be most appropriate for the relationship between the weight of a ferret and milk production?
A scatter plot
A flat file delimited by commas is what file type?
CSV
Which element should never be on the cover page of a report?
The appendix
Data type validation is a process specifically used to avoid what type of error?
Invalid data
What is an appropriate title for the following chart?
The Population of India Averaged for the Years 2015, 2016, and 2017 as Sub-Divided by Geographical Regions Determined by the 2018 Land Survey
What does it mean for a dashboard to be real-time?
It has the absolute most up-to-date rates and figures
What is the act of automatically moving and analyzing online transactions called?
OLTP
What does the following code snippet represent? Data = ‘This book makes me happy.’ Data = [‘This’, ‘book’, ‘makes’, ‘me’, ‘happy’, ‘.’]
Parsing
Which of the following is a valid data storage solution for audio files?
A data lake
What type of analysis is most appropriate for checking the efficiency of each phase of a production process?
Performance analysis
Who is the most appropriate audience for a detailed report on grain-to-egg efficiency ratios?
Technical experts
What would be the result of an outer join on the provided tables?
Joined Table with NULLs for unmatched records
What type of report is most appropriate for a project manager at the end of every sprint?
A recurring report
Find the mode of the following dataset: 5, 3, 8, 5, 3, 9, 3, 8, 2
3
What type of analysis is most appropriate for examining the connection between hours worked and mistakes made?
Link analysis
What means of updating a table is represented by adding new values to the bottom?
Active record
What conclusion can you draw from the following visualization?
Around 350 students achieved a grade of C or higher
What type of schema consists only of normalized tables?
A snowflake schema