Big Data Flashcards
What is Big Data integration?
Combining Big Data techniques
What is an advantage of Big Data integration?
Big Data integration (combine Big Data techniques) reveals and refines insights into biological processes
What is a SNP?
single nucleotide variants
What do Genome-Wide Association Studies (GWAS) identify?
GWAS studies identify SNP alleles that are found more frequently in patients (cases) than in healthy individuals (controls), these SNPs are presented on a Manhattan Plot.
The SNP does not necessarily affect the closest gene why?
The SNP could affect regulatory DNA which helps to regulate a more distant gene, this could even be found on a different chromosome, so the SNP could be acting from a distance.
The lead SNP itself does not always cause the increased/ decreased disease risk why?
Far more SNPs in actual genome than the SNPs we have measured, so the SNP we have detected could just be close to the SNP that actually causes the altered disease risk. Due to linkage disequilibrium.
What is linkage disequilibrium?
Genes close together tend to be inherited together.
What does Big Data usually comprise of?
Large volumes of data, often comprising multiple data types meaning substantial variation within the data
What does it mean that Big data experiments tend to be hypothesis-generating instead of hypothesis-driven?
Look at the data and come up with a hypothesis after woulds so discover new things
What are the 2 types of “OMICs” big data collecting technique?
Two types of OMICs: Genomics (DNA), transcriptomics (RNA)
What are the 3 types of Big Data collecting techniques?
1) OMICs
2) Microscopy
3) Human Physiology and health
What are 2 of the most used ways in biology that big data used for?
Epidemiology (identifies relationships between exposures / predispositions and disease risk) and disease pathobiology ( understand how interactions between exposures and predispositions affect health)
What are the 4 types of OMICs used?
Genomics, Transcriptomics, Proteomics, Epigenomics