6 Types of Analytics Flashcards

1
Q

What are the four general categories of analyses needed for the exam?

A

Exploratory data analysis (EDA), performance analysis, trend analysis, link analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is EDA?

A

A general term for analyses used to understand your data better

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What common types of analyses fall under EDA?

A

Descriptive statistics, relationships, dimension reduction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the four categories of descriptive statistics?

A
  • Measures of central tendency
  • Measures of dispersion
  • Measures of frequency
  • Measures of position
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What do measures of central tendency include?

A
  • Mean
  • Median
  • Mode
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What do measures of dispersion explain?

A

How spread out your data is

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are some examples of measures of dispersion?

A
  • Standard deviation
  • Range
  • Variance
  • Min
  • Max
  • Quartiles
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How are measures of frequency represented?

A

Counts, ratios, or percentages

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What visualization is often used for measures of frequency?

A

Bar chart or heat maps

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the purpose of relationships in EDA?

A

To see whether there is a correlation between two variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a common visualization used to assess relationships between variables?

A

Scatter plot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is dimension reduction?

A

The idea of simplifying data before analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are some advanced methods for dimension reduction?

A
  • Principal component analysis (PCA)
  • Non-negative matrix factorization (NMF)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a key step before performing analyses on a dataset?

A

Understanding basic information about the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the function used in Python’s pandas package to get descriptive statistics?

A

.describe()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What command is used to import the pandas package in Python?

A

import pandas as pd

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What does the command MyData.info() provide?

A

A rough description of the dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What are the common variable types in programming languages?

A
  • Integers
  • Floats
  • Strings
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What does the command MyData[‘Age_Bracket’].value_counts() do?

A

It provides counts of each unique value in the Age_Bracket variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What does the command pd.plotting.scatter_matrix(MyData) create?

A

A scatter plot matrix of all numerical variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

True or False: EDA includes data cleaning and wrangling processes.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Fill in the blank: EDA encompasses any preliminary information gathering that you must do before you can jump into what you actually want to know, including frequencies, averages, trends, or the _______.

A

relationships between your variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is the purpose of using scatter plots in EDA?

A

To visually assess potential correlations between variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What does the command MyData[‘Total_Spent’].describe() return?

A

Descriptive statistics for the Total_Spent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is the main goal of exploratory data analysis?

A

To lay the groundwork for later analyses

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What does the scatter plot matrix display?

A

Relationships between numerical variables and their distributions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What does the distribution of Client_ID indicate?

A

The distribution is random and does not show any trends.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What does the AB_Recode / Client_ID matrix indicate?

A

Including a categorical variable in this matrix means nothing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What relationship does Client_ID have with Orders and Total_Spent?

A

Client_ID relates to Orders and Total_Spent.

30
Q

What does Figure 6.8 illustrate?

A

It shows random scattering without any relationship between Client_ID and other variables.

31
Q

How is the distribution of AB_Recode characterized?

A

It shows a pretty normal distribution.

32
Q

What makes Age_Bracket an ordinal variable?

A

It has an inherent order: <21, 21–30, 31–40, 41–50, and >50.

33
Q

Why is a histogram based on a categorical variable like color meaningless?

A

There is no logical order to the categories.

34
Q

What does the histogram of Orders indicate?

A

It is skewed to the right and not a normal distribution.

35
Q

What does the scatter plot of Orders and Total_Spent indicate?

A

There might be a positive relationship between Orders and Total_Spent.

36
Q

What is the focus of performance analysis?

A

Meeting goals across various metrics.

37
Q

What are KPIs?

A

Metrics used to gauge performance.

38
Q

What is an example of a common KPI?

A

Return on investment (ROI).

39
Q

What does project management performance analysis focus on?

A

Team performance, task completion, and productivity consistency.

40
Q

What type of analytics does process analytics focus on?

A

The efficiency of a process involving multiple steps.

41
Q

What is trend analysis?

A

Analyzing trends over time to identify patterns.

42
Q

What is forecasting in trend analysis?

A

Extending a trendline into the future.

43
Q

What is link analysis?

A

Tools and techniques explaining the relationships between variables.

44
Q

What are the three main types of link analysis methods?

A
  • Scatter plots * Correlation analyses * Structural Equation Modeling
45
Q

Why is choosing the correct analysis challenging?

A

There are many analyses for different situations with variations and debates on effectiveness.

46
Q

What do statistical analyses make about data?

A

Assumptions that must be met for reliable results.

47
Q

What should you include in your list of analyses?

A
  • Purpose of the analysis * Required variables * Assumptions made
48
Q

True or False: The results of an analysis can be reliable even if assumptions are not met.

49
Q

What does EDA stand for?

A

Exploratory Data Analysis

50
Q

What is the purpose of EDA?

A

Understanding basic information about your data before performing more advanced analytics

51
Q

What type of analysis focuses on comparing key metrics?

A

Performance analysis

52
Q

What is trend analysis primarily used for?

A

Looking at things over time and forecasting future metrics

53
Q

What type of analysis is used to find relationships between variables?

A

Link analysis

54
Q

What are assumptions in statistical analyses?

A

A list of prerequisites that must be met

55
Q

Fill in the blank: The three steps to choose the correct statistical analysis are knowing your ______, knowing your ______, and checking your ______.

A

goal, variables, assumptions

56
Q

True or False: There is a magic bullet that will instantly help you choose the perfect analysis every time.

57
Q

What should you do first when picking a specific type of analysis?

A

Know your goal

58
Q

What is the goal of performance analysis?

A

Checking the progress of a company, team, or process

59
Q

Which analysis type includes descriptive statistics?

A

Exploratory Data Analysis

60
Q

What does KPI stand for in the context of performance analysis?

A

Key Performance Indicator

61
Q

What is the main focus of project management analysis?

A

Tracking how a team is performing

62
Q

What type of analysis is often used to track how a group of people feels about something?

A

Trend analysis

63
Q

Fill in the blank: Scatter plots and correlation are part of ______ analysis.

64
Q

What is the first step in the process of choosing an analysis?

A

Know your goal

65
Q

In the context of analysis, what does it mean to check your assumptions?

A

Ensure the prerequisites for an analysis are met

66
Q

True or False: You should never make assumptions; confirm everything.

67
Q

What is often necessary if you lack the required variables for a specific analysis?

A

Find or create the necessary variables

68
Q

Which analysis type focuses on tracking the efficiency of a multi-stage process?

A

Process analysis

69
Q

Fill in the blank: Choosing the correct statistical analysis can be ______.

70
Q

What is the main goal of exploratory data analysis?

A

To get a feel for your data and learn the basics about it