EXTRA Flashcards
When do we use a quasi-experiment?
When we cannot run a true experiment. If you do not have permission to randomly assign people to groups.
So, when we cannot run a field or lab experiment.
If you would study Netflix the same way as Datta, Knox, and Bronnenberg (2018), how would you set up the study?
- Find consumers who have and have not adopted Netflix.
- Match the adopters with the non-adopters based on likelihood to adopt.
- Investigate how many (different types of) movies they consume.
- Investigate this with a difference-in-differences analysis to see the change in consumption due to the Netflix adoption.
Standard error of the coefficient
Measures how precisely the model estimates the coefficient’s unknown value.
The smaller the standard error, the more precise the estimate.
What can be concluded from a positive correlation?
That as the independent variable increases, the dependent variable also tends to increase.
p-value > significance level
p-value is higher than the significance level
It is not statistically significant. The influence of this independent variable could have been due to random chance.
There is insufficient evidence to conclude a relationship. We cannot reject the null hypothesis.
p-value < significance level
p-value is lower than the significance level
It is statistically significant.
There is enough/sufficient evidence that there is a relationship between the IV and the DV. We reject the null hypothesis.
What do the p-values indicate?
Whether the relationships are statistically significant.
If the relationship you observe in the sample also exists in the larger population.
What do coefficients describe?
The mathematical relationship between each independent variable (IV) and the dependent variable (DV).
Long-tail concept
A business strategy that allows companies to realise significant profits by selling low volumes of hard-to-find items to customers, instead of only selling low volumes of a reduced number of popular items.
Two factors that indirectly influence sales:
Consumer-brand usage (CBU)
Consumer-brand endorsement (CBE)
Propensity Score Matching
Quasi-experimental method in which the researcher uses techniques to construct an artificial control group by matching each treated unit with a non-treated unit of similar characteristics.
Difference supervised and unsupervised machine learning.
Supervised learning uses labeled input and output, while an unsupervised learning algorithm does not.
Unsupervised learning method
Machine learning algorithm that analyses and clusters unlabelled datasets.
These algorithms discover hidden patterns in data without the need for human intervention. Hence, they are unsupervised.
Supervised learning method
Machine learning approach that’s defined by its use of labeled datasets.
Using labelend inputs and outputs, the model can measure its accuracy and learn over time.
Machine learning methods can be subdivided into:
- Supervised methods
- Unsupervised methods