Questions based on my summaries and notes Flashcards
What is a type 1 error?
False positive, falsely rejecting the null hypothesis
What is a type 2 error?
False negative, false accepting the null hypothesis
What does the power say?
1-B(beta)=Power of the test, the chance we got a true positive. B stands for the false positive when we should have rejected the alternative hypothesis and we didn’t. If B is small the power is high and that means we likely got a true positive.
What is a parametric test and when can it be used?
For example Chi2 test, t-test and regression analysis. It can be used when the data is normally distributed and that the data is homogenous
What is a chi2 test and when can it be used?
There are 2 different types of Chi2 test, the goodness of fit test and test of independance. For Chi2 test of independance:
H0=The two variables are independent
H1=The two variables are not independent(they are associated with each other)
This could for example be used to test whether two genes are linked or unlinked by looking at the frequency distribution of potential phenotypes.
What is a t test and what does it analyze?
There are two types of t-test, one sample and two sample. The basis of a t-test is analyzing the mean. For example a two sample t-test you can analyze the gene expression mean between a control group and a patient group and see if they have the same mean for gene expression. For a one sample t-test you can analyze the the patient group has x as a mean, you compare it to a set value.
What is a regression analysis and what does it analyze?
Simple linear regression is a statistical method you can use to understand the relationship between two variables, x and y.
One variable, x, is known as the predictor variable.
The other variable, y, is known as the response variable.
For example weight and height. You put the values of height and weight for a set of individuals into a scatterplot and find a regression line that fits best with these values. This regression line can then be used to predict the weight or height of a certain individual.
One way to measure how well the least squares regression line “fits” the data is using the coefficient of determination, denoted as R2. It states between 0-1 that will show to what percentage this model can explain the data. For example 0.77 explains 77% of the data with this model.
What are some examples of non parametric analysis and when can it be used?
Mann Whitney U test, spearman etc. This can be used when the data is not normally distributed.
What does the mann whitney u test analyze and when can it be used?
For small populations that are not normally distriobuted.
H0=The population are equal
H1=The population is not equal
This can for example be used to analyze if two different patient groups on different diets lose the same amount of weight or if they lose a different amount of weight.
What is the difference between spearman and pearsson?
Both are correlation analysis, to see if two values are correlated or not. They both use a scatterplot to make a line but Pearson can be used for linear data and Spearman when there are extreme outliers or ranked data.
H0=There is no correlation
H1=There is correlation
Pearson: -1 there is perfect negative correlation, 0 there is no correlation at all, 1 there is perfect positive correlation
What are some exmaples of descriptive analysis and what does it do?
It summarizes and bisualize the data for example scatteprlot and PCA plot
What are a bivariate analysis and what are some examples?
It shows two variables relation to each other, for example Chi2 test and regression analysis
What is multivariate analyzes?
It shows multiple varibales in relation to each other for example PCA and cluster analyzis
When are generalized linear models used?
When there is no normal distribution or the data is nonlinear
What is E-value and score significance and what is acceptable values for these?
E-value shows the probability that we got the match by random chance.- A E value below 0.01 is considered good for homologous and below 1e-50 is a very good fit. Score significance shows how trustable the match is.