Statistics Flashcards by Aiden Dawes

Can you ever be sure about disproving a hypothesis?

No you cannot be completely sure, however you can be arbitrarily sure if the results are statisitically significant

How well did you know this?

Not at all

Perfectly

What does it mean for a result to be statistically significant?

A result is called statistically significant if it is unlikely to have occurred by chance. Normally meaning that the p-value is less than 0.05 (5%). (However can alter the threshold)

How well did you know this?

Not at all

Perfectly

What is the p-value?

The p-value is the probability of obtaining the given results if the null-hypothesis is true.

How well did you know this?

Not at all

Perfectly

What is a null hypothesis?

A null hypothesis is what is assumed to be true and is being tested against to be disproven. Functionally meaning that both data sets are from the same mechanism, wheras we are trying to prove they are different aka the alternate hypothesis.

How well did you know this?

Not at all

Perfectly

How to prove/disprove something with stats

It is not possible to prove/disprove something with stats
You can only reject the null hypothesis given enough statistically significant data
Otherwise the test “didn’t find a statistically significant difference” and “fails to reject the null hypothesis”

How well did you know this?

Not at all

Perfectly

What is a research question?

A statement that identifies a phenomenon to be studied.
Ex: I believe that rewards improve memorization skills

How well did you know this?

Not at all

Perfectly

What is a hypothesis?

A statement of the predicted relationship between at least two experimental variables.
A provisional answer to a research question
Ex: group chocolate will have a higher memorisation score than group with no reward

How well did you know this?

Not at all

Perfectly

Independent vs dependent variable

The dependent variable is the event studied and expected to change whenever the independent variable is altered.

How well did you know this?

Not at all

Perfectly

What is a controlled variable

The variables that are** kept constant** to prevent their influence on the effect of the independent variable on the dependent. Ideally everything besides dependent and independent variable is controlled.

How well did you know this?

Not at all

Perfectly

What is a confounding variable

Extraneous variables that correlates with both the dependent variable and the independent variable.
Example: Weather temperature correlates with both ice-cream sales and murders.

How well did you know this?

Not at all

Perfectly

The goal of experimental design

Experimental design aims at maximizing your chances of finding the signal and not the noise (noise being randomness, confounding variables etc, that may show correlation not causality)

How well did you know this?

Not at all

Perfectly

Within vs. between subjects

Within = All participants do the same thing (everyone does A and B)
Between = Certain participants do only certain conditions (certain people do A, certain people do B)

How well did you know this?

Not at all

Perfectly

Comparison of within vs. between experiments

Within pros:
+ Less user variation (between groups)
+ Statisical power with less participants

Between pros
+ No baises from other conditions (eg. transfer of learning from doing A before B)

How well did you know this?

Not at all

Perfectly

What is counterbalancing?

A method of avoiding confounding among variables/
Presenting conditions in a different order

How well did you know this?

Not at all

Perfectly

How is a latin square used for counterbalancing?

A latin square is an n × n array filled with n different Latin letters, each occurring exactly once in each row and exactly once in each column, where each letter corresponds to treatment/condition. Varying the order in this way avoids counfounding variables and transfer of learning.

How well did you know this?

Not at all

Perfectly

Sweet spot of number of trials.

Study These Flashcards

Ideally as many trials as possible but 30 - 40 is the sweet spot.

What does a t-test measure?

Study These Flashcards

T-statistics tell us how many standard errors away from the mean our observed difference is.

What is bonferroni correction?

Study These Flashcards

To reduce type I error when testing n hypotheses, test each one against 0.05/n. This is because when conducting n tests, the chance of one of them being invalid increases by a factor of n (0.05 * n)

Use of a t-test

Study These Flashcards

Comparing two groups

What is an ANOVA test used for?

Study These Flashcards

An ANOVA is an analysis of varience and is used to compare multiple variables. Often ANOVA tests prove there is a significant difference and follow-up t-tests show where there difference is.

What is regression?

Study These Flashcards

A machine learning technique for determining the statistical relationship between two or more variables where a change in a dependent variable is associated with, and depends on, a change in one or more independent variables. Basically drawing a line which minimises the distance to each data point.

Ways to determine the goodness of fit for a regression

Study These Flashcards

Standard error
R squared

How to calculate standard error

Study These Flashcards

Lower is better

How to calculate r squared

Study These Flashcards

Percentage, higher is better.

What is a CHI-Square used for

When both the independent variable and dependent variable are categorical. Can have multiple variables if needed

CHI-Square formula

## Footnote Use a table to find p from X^2

CHI-Square degrees of freedom (contingency table)

(number of rows−1)∗(number of columns−1)

How to operate a CHI-Square contigency table

1. We compute the sums in all direction 2. For each cell, multiplying that cells row and column totals and dividing by our total sample size e.g. case (sport, male)= (29 * 50) / 75 e.g. case (family, male= (46 * 50) / 75 ... 4. Use the Chi-square formula 5. Calculate degree of freedom as DF =(number of rows−1)∗(number of columns−1) (here = 1) 6. Use the Chi-square table to conclude!

Statistics Flashcards

(28 cards)