Data science very basic Flashcards
1
Q
chi square test
A
You use a Chi-square test for hypothesis tests about whether your data is as expected. The basic idea behind the test is to compare the observed values in your data to the expected values that you would see if the null hypothesis is true.
2
Q
clean a conda environment of cache files
A
conda clean –all
“Be cautious while using”
3
Q
subcomponent to sentence phrase matching ngram technique
A
ngram window = len(token(subcomponent))
use this window to slide through the sentence to get the best matching phrase
4
Q
Biomedical data noise filtering
A
use ner packages and entity linkers to filter the best sentences
Pfinder topic modeling project