Genomics Flashcards
what is high throughput sequencing
technology that quickly generates large volumes of sequence data
quantitative and qualitative
what are sequence census assays
use quantitative data from high throughput sequencing to understand genome function
- RNA-seq for transcriptome
- PCR- seq for metagenomics
define genomics
the study of whole sets of genes, their products and their interactions
encompasses a bunch of -omics
define bioinformatics
application of computational methods of storage and analysis of bio data
define gene annotation
identifying protein encoding genes and their functions
T/F: majority of our DNA consists of repetative DNA
T
what % of human genes are conserved in other organisms
50%
what do genomes vary in
size, density, and number of genes
diff between euk and prok genomes
1) prok genomes have higher density
2) euk larger, more genes
what are the 3 kinds of variation
1) SNPs
2) Various types of chormosomal structural rearrangements
3) CNVs
explain SNPs
a single bp substitution at a particular site, must occur in pop at least > 1%
most do not have ptypes or cause disease
can be used as bio markers associated with disease causing genes
some directly causes disease
-ex FGFR3 gene in achondroplasia
can vary in freq
minor allele freq- freq at which least common allele occurs in a pop
common/freq variants >5%
rare variants < 1%
various chromosomal structural rearrangements
genetic rearranement between diff chromosome or same one
balanced translocation are harmless
unbalanced translocations have lots of health affects
CNVs
due to segmental duplication and deletion
loci w/ diff numbers of copies between indiv in pop; some indiv have >=1 copies of a particular gene rather than standard 2 copies
can be 50 bp to whole chromosome
may or may not cause phenotype or disease
cover large part of genome
4 types of repetitive DNA?
1) STR
2) TR
3) Interspersed repeats (DNA transposons and retretransposons)
4) segmental duplications (large blocks of 10,000-300,000 bp that have been copied to another region of genome)
STR
duplications of simple sets of 1-5 bp
directly involved in ptype variation