The Data Analytics Journey Flashcards
WGU Class D596
Quantative Data
Quantitative data represents numerical values that can be measured or counted. It answers questions like “How many?” or “How much?”
Discrete data
Countable values. Distinct and separate; they cannot take on values between the defined points.
Like number of students in a class or pets in a home
Continuous data
Continuous data is a type of quantitative data that can take on any value within a range.
Height, temperature, time
categorical data
Categorical data represents categories or labels rather than numerical values.
Can be nominal or ordinal
Nominal data
Categories have no natural order.
ex: colors, types of pets, martital status, favorite sports, car brands
Ordinal data
Categories have a meaningful order, but the intervals between them are not equal.’ Examples: Ratings (poor, fair, good, excellent), educational levels (high school, bachelor’s, master’s).
Part to whole
Shows how individual parts contribute to the whole. Great for when you want to display proportions or percentages.
Distribution
Shows how values in a dataset are spread or distributed across a range. Understanding the spread, skewness, or patterns in your data.
Nominal Comparison
Compares values for categorical (nominal) variables without any specific order. Comparing quantities between categories.
Time Series
Data collected over time (e.g., daily, monthly, yearly) to track trends or patterns. Analyzing how data changes over time.
Correlation
Shows the relationship between two variables, indicating whether they move together (positive correlation), move oppositely (negative correlation), or show no relationship.
Ranking
Compares items in a dataset by sorting them in ascending or descending order. Highlighting the relative positions or hierarchy of categories.
Deviation
Shows how data deviates from a baseline, expected value, or the mean. Highlighting differences or anomalies in the data.
What charts are good for deviation?
Diverging bar chart
Line chart (with baseline or reference line)
Error bars.
What charts are good for ranking?
Bar chart (sorted by value)
Column chart
Dot plot.
What charts are good for correlation?
Scatter plot
Bubble chart
Heatmap.
What charts are good for time series?
Line chart
Area chart.
What charts are good for nominal comparison?
Bar chart
Column chart
What charts are good for distribution
Histogram
Box plot
Violin plot.
What charts are good for part-to-whole?
Pie chart
Donut chart
Stacked bar chart (with percentages).
What are visual elements to use when designing charts?
Similarity & Contrast
Dominance & Emphasis
Scale & Proportion
Hierarchy
Balance & Symmetry
Regression
Regression is a technique that allows an analyst to predict an outcome (either numerical or categorical) based on a set of predictor variables.