A/B Testing Flashcards

1
Q

A/B Testing is a type of _________ testing

A

hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

A/B testing and multi-armed bandits can help you determine if a change adds ______

A

value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

In what situations might you want to use A/B testing?

A

Exploring usability improvements
Establishing the effectiveness of promotions
Staged rollout of major changes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the general process of performing A/B testing?

A

Create a hypothesis between A and B
Determine what data to collect (population, metric, size)
Randomly serve A to one pop and B to the other
Use a t-test to measure the differences in the populations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a type 1 and type 2 error?

A

Type 1: Reject the null hypothesis even though it’s true

Type 2: Fail to reject the null hypothesis even though it’s false

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are some problems when choosing populations for your hypothesis?

A

Hypothesis may not apply to everyone
Hypothesis may affect subpopulations differently
Population needs to be representative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are some problems when choosing your hypothesis?

A

Need to define clear goals, otherwise they are useless

Testing many things increases the likelihood of false positives and p-hacking

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are some problems when selecting stopping criteria and confidence?

A

Size of a test campaign must be set up front, instead of running the process and stopping when significance is reached

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is regression to the mean? Why is it a problem?

A

Following an extreme event, the next event is likely less extreme
Can cause the illusion of siginificance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are novelty effects?

A

The novelty of a change for the sample may bias the underlying results of the study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Review sequential hypothesis testing on video

A

Review sequential hypothesis testing on video

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are multi armed bandits used for?

A

Figuring out how to make a good choice now. It chooses between exploration vs exploitation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the epsilon-greedy multi armed bandit strategy? What are some issues with it?

A

Choose a percentage. If over, pull best arm so far. Else pull a random arm. Update arm stats.

It is sensitive to variance and performs worse than other approaches

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the Thompson Sampling multi armed bandit strategy?

A

For each arm, sample from it’s successes and failures. Pull the arm that has the max value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Why might you prefer multiarmed bandits over A/B testing?

A

You can start using bandits immediately, as they don’t require a large population or much setup. You will, however, not make any long term business decisions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly