Other Flashcards
True or false: Predictive analytics tools may be used to analyze security logs to anticipate and block bad web requests.
True
Understand Common Data Analysis Use Cases Module
What is an advantage of doing data analytics in the cloud?
A
It doesn’t require knowledge of the network and operating system.
B
You can stop paying for infrastructure resources when they aren’t needed.
C
You get to experience data-center operations.
D
A and B
E
B and C
A and B
True or false: Serverless technology enables organizations to focus on the analytics instead of server configuration.
A
True
B
False
True
Which of the following best describes how Salesforce Data Cloud works?
A. By using a data lake that does not unify customer profiles
B. By activating across Service Cloud and Marketing Cloud only
C. By creating personalized experiences that are not in real-time
D. By harmonizing customer data from any source in real time into a unified profile
D. By harmonizing customer data from any source in real time into a unified profile
True or false: Data Cloud enables the entire Salesforce Platform to act on real-time customer data.
A
True
B
False
True
Which of these definitions best fits the trait of data being clear?
A
It contains both quantitative and qualitative variables.
B
It goes back over time to show patterns.
C
It is written in understandable terms, not code.
D
It contains groups based on similar characteristics.
C
It is written in understandable terms, not code.
How does Tableau organize data?
A
Into a flat table
B
Into a dimensional structure
C
Into data types and data roles
D
Into workbooks and worksheets
B
Into a dimensional structure
What are the 10 Traits of Useful Data?
- High Volume: A large amount of relevant, available data means that there’s a better chance you’ll have what you need to answer your questions.
- Historical: Data that goes back in time allows you to see how the present situation arose due to patterns that have arisen over time, such as looking at sales trends over the last 10 years to see increases or decreases.
- Consistent: As things change, data should be adjusted for consistency. Salary and price data adjusted for inflation is a good example of this.
- Multivariate: Data should contain both quantitative (numerically measurable) and qualitative (characteristic, not numerically measurable) variables. The more variables in the data, the more you can discover from it.
- Atomic: The more finely detailed the data, the more you are able to examine it at various levels of detail. For example, if you wanted to understand bicycle riding trends in your state, it would be helpful to see these trends as impacted by county, city, and neighborhood.
- Clean: In order for data to be useful, it should be accurate, complete, and free from errors.
- Clear: Data should be written in terms that can be easily understood, not in code. For example, the housing type values single family, two-family conversion, and end-unit townhouse are much easier to understand than 1Fam, 2fmCon, and TwnhsE.
- Dimensionally Structured: An accessible way to structure data is to organize it into two types: Dimensions (qualitative values) and Measures (quantitative values). This is the organizational structure Tableau uses when interpreting data.
- Richly Segmented: Groups, based on similar characteristics, should be built into data for easier analysis. For example, data about movies could be grouped by genre (action, science fiction, romance, comedy, and so on).
- Of Known Pedigree: In order to trust the data, you should know its background—where it comes from and how it has since been altered.
Which is a characteristic of well-structured data?
A
Each variable (field) is in one column, with a column header.
B
Each different observation of a variable (value) is consolidated into a single row.
C
Titles are not formatted as rows above the column headers or as extra columns.
D
Column headers appear in the third row.
A
Each variable (field) is in one column, with a column header.
Which of these accurately defines columns and rows?
A
Columns are horizontal, and each column represents one unit of analysis. Rows are vertical, and each row represents a different variable (or field).
B
Columns contain qualitative values, and rows contain quantitative values.
C
Columns are vertical, and each column represents a different variable (or field). Rows are horizontal, and each row represents one unit of analysis.
D
Rows contain qualitative values, and columns contain quantitative values.
C
Columns are vertical, and each column represents a different variable (or field). Rows are horizontal, and each row represents one unit of analysis.
Consider the example of a table that includes columns for City and State values and sales figures for each quarter (Q1, Q2, Q3, Q4). Which restructuring operation would you use so it has a single Quarters column and a single column for Sales values?
A
Split the Q1, Q2, Q3, and Q4 fields.
B
Pivot the Q1, Q2, Q3, and Q4 fields.
C
Split the City and State fields.
D
Pivot the City and State fields.
B
Pivot the Q1, Q2, Q3, and Q4 fields.
What is the purpose of a split?
A
Changes columns into rows.
B
Removes unwanted data.
C
Identifies rows that contain the same values for a specific field.
D
Separates a column that contains multiple pieces of information into multiple columns, one for each piece of information.
D
Separates a column that contains multiple pieces of information into multiple columns, one for each piece of information.
What is the purpose of a pivot?
A pivot changes columns into rows, and in some cases, vice versa.
Use a pivot on “wide” data sets when the information is captured with many columns, and the columns contain similar information.
In terms of Variables and field types, what are the three types of variables?
Nominal qualitative variables
Ordinal qualitative variables
Quantitative variable
Which is an example of a nominal qualitative variable?
A
Types of residence (house, apartment, dorm, houseboat, and more)
B
Gold medal, silver medal, bronze medal
C
Hot, warm, tepid, cool, cold
D
Mild, medium, spicy
A
Types of residence (house, apartment, dorm, houseboat, and more)
Which of the following is an example of a continuous variable?
A
Volume of water in the Pacific ocean
B
Number of students in a class
C
Shoe sizes of the members of a dance team
D
Home runs hit by a baseball team
A
Volume of water in the Pacific ocean
Children in a specific third grade class chose an apple, a banana, or a plum as a snack one day. You want to show how many children chose each type of fruit. Which would you use to show this?
A frequency distribution
A proportion distribution
A normal distribution
A density distribution
A frequency distribution
Which of the following is a characteristic of normal distributions?
A. Most natural phenomena perfectly follow normal distributions.
B. In a normal distribution, the median is greater than the mean.
C. A normal distribution is the same as a symmetrical histogram showing a finite set of continuous values.
D. When using a data sample that represents the total population, the normal distribution can be a useful estimation tool.
D. When using a data sample that represents the total population, the normal distribution can be a useful estimation tool.
What is a density curve?
A. Continuous distributions that represent all possible data points at once.
B. Continuous distributions where the height of the curve determines the number of data points occurring within that curve.
C. Normal distributions where the probability within an interval equals the area above the curve within that interval.
D. Normal distributions where the x-axis represents the density of probability.
A. Continuous distributions that represent all possible data points at once.
What does the SCAM checklist remind you to evaluate when viewing data visualizations?
A
Source, Color, Axes, and More
B
Source, Chart, Axes, and Message
C
Space, Color, Audio, and Magnitude
D
Strength, Character, Audacity, and Mode
B
Source, Chart, Axes, and Message