Data Analysis Flashcards

1
Q

What is EDA?

A

Exploratory Data Analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is the process of eda

A
  1. prep (collecting, cleaning)
  2. explore (learn variables, compute summaries, find correlations/trends, visualize the data
  3. hypothesis and further analysis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what are summary statistics?

A

measures of central tendency: mean, median, avg

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Format Sum of Licenses Bought (in your pivot table) as a number with a , separator and 0 decimal places.

A

Right-click # - number format - number

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Create a Calculated Field that divides Sales Amount by Number of Users and call it “Sales per Sub”.

A

click in pivot - pivottable analyze - fields,items, and sets - calculated field

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

which is switch and which is and IF statement?

B2>100,”you’re rich!”,”Keep saving.”)

B2,1,”One”,2,”Two”,3,”Three”

A

IF=B2>100,”you’re rich!”,”Keep saving.”)

SWITCH=B2,1,”One”,2,”Two”,3,”Three”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

how to find relationship between discounts and number of units sold

A

calculated field - =discounts/units sold

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

explain SWITCH logic

A

=switch[@[from this column]],
(if)”Basic”,(then)10,
“Premium”,10,
“Business”,25,
“Enterprise”,27,
(if none of these, then) 0)

0 is a necessary default value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

IF formula for new column called “Non-Paying Users” that finds the difference on the row level between the Number of Users and Licenses Bought

A

=IF(#ofusers -licensesbought<0,0,#ofusers -licensesbought)

logic: if #users-licenses is less than 0, then 0, if not, then show the difference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

how do you re-include a moved column into the same nominal table as the rest of the sheet?

A

click in table - table design - resize table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

how do i Update a formula to only multiply if the Sales Month is the latest month, otherwise default to 0?

A
  1. find latest Sales Month =MAX([Sales Month])
  2. Modify the formula in your “Current Upsell $” column to only multiply when the Sales Month matches the latest month
    =IF([@[Sales Month]]=[@[Max Sales month]],[@[Non-paying users]]*[@[Upsell Per non-paying customer]],0)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

how do you Count the number of customers per month using Account Sales History, Customer ID, and Sales Month

A

=countif(whole sales month column, sales month specified on new sheet)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

how do you calculate the total number of sales for the month

A

=sumif(range:Sales month from account sales history og sheet, single cell sales month from sales trends sheet, sales amount from og sheet)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

how do you average the Total Sales per month by Number of Customers

A

=averageif(range:#of customers columnB:B, criteria:first cell #ofcust B:2,average_range: Total Sales column)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

how do you test your hypothesis that hat the share of non-paying customers (“Basic” subscriptions) are growing faster than our paying customers?

A

use COUNTIFS() to count the number of customers with a Basic subscription for the Sales Month.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly