Week 6: MCC-ROI analysis Flashcards
Definition of the p-value
- P(T >= t | H0); the probability of finding a result as extreme or more extreme than the observed statistic
- Expresses the surprise of observing the data if the null hypothesis was actually true
- Can only be used to refute H0 and doesn’t provide evidence for the truth of the H0
Power
- The chance that a testing procedure correctly rejects H0 when there is a true effect; percent of TP that we will detect; one minus the Type II error rate
- Varies as a function of: 1) size of the true effect; 2) the efficiency of the statistical procedure; 3) and the sample size
Type I error
the probability of rejecting the null hypothesis given that it is true. The test is designed to keep the type I error rate below a prespecified bound called the significance level, usually denoted by the Greek letter α
Type II error
the probability of failing to reject the null hypothesis when it is actually false.
Expected number of false positives given the H0
e.g., if our alpha level is set to 0.05, then we are implying that it is acceptable to have a 5% probability of incorrectly rejecting the true null hypothesis
(think of the normal distribution of thr H0 and of the little tail on the right, that means that we are taking that amount of risk of committing a type I error > same reasoning for type II error, but think of Ha distribution)
High sensitivity leads to…
…few false negatives
Low specificity leads to…
…many false positives
Levels of inference
- voxel level
- cluster level
- peak level
- set level
Voxel level
testing each individual voxel for significance and retaining the ones above a certain threshold (think of y-axis threshold); gives best spatial specificity IF the threshold is picked correctly; we can say something about a specific voxel.
Cluster level
- takes into account the spatial information available in the images, by finding connected clusters of activated voxels and testing the significance of each
- has two stages: 1) defining clusters given a width and 2) retaining those clusters according to another threshold
Why would we generally expect the fMRI signal to be spatially extended?
Because 1) the brain regions that are activated in fMRI are often much larger than the size of a single voxel and 2) fMRI data are often spatially smoothed, which results in a spreading of the signal across many voxels in the image
Using the cluster level inference gives better…, but also worse spatial … . Picking a low cluster retaining threshold will cause a …, while picking a very high threshold will cause …
sensitivity;specificity;very large cluster that encompasses most of the brain;way less and smaller clusters
When a 1,000 voxel cluster is marked as statistically significant, all we can conclude is that…
…one or more voxels within that cluster have evidence against the null. This is not a problem when cluster sizes are small, but it is if on the contrary we have a big cluster size; the only solution is to increase the retaining threshold, but this is not scientifically sound
Peak level
similar to cluster level, so first we define the clusters (e.g., given their width) but then here we retain those clusters that go above a certain peak
Set level
asks “is there any significant activation anywhere”?; it tests whether anyhwhere across the whole brain there is any activation, but cannot localise the activation; it is an omnibus test; has no localizing power whatsoever
Types of error rates (+ error rate definition)
Error rate = a measure of the degree of prediction error of a model made with respect to the true model.
* per comparison error rate (PCER)
* family-wise error rate (FWER)
* false discovery rate (FDR)