Genomics Flashcards
Lesson 6
What has been the trend with cost of sequencing 1 human genome over the years?
It has dropped dramatically from $100,000,000 in 2001 to around $1,000 in 2021, with a massive drop-off in 2007-2010
What is high throughput sequencing?
It is qualitative and quantitative
How are assays used in sequencing?
Numerous assays that harness the quantitative power of sequencing technologies are used to understand genome function
Genomics
the study of whole sets of genes, their products, and their interactions
Bioinformatics
the application of computational methods to the storage and analysis of biological data
Gene annotation
the process of identifying all protein-coding genes in a genome sequence and ultimately their functions
What are some centralized resources for analyzing genome sequences?
NCBI, GenBank (database of sequences), BLAST (allows searching GenBank - the google of genomics)
Is all of our DNA necessary
No, most of the DNA in our genomes in junk!!!
What are the different types of variation of different genomes?
Single Nucleotide Polymorphisms (SNPs)
various types of chromosomal structural rearrangements (deletions, transversions etc)
Copy number variants (CNVs)
Repetitive DNA
What is a Single Nucleotide Polymorphism (SNP)
a substitution of a single nucleotide that occurs at a specific position in the genome; NOT a mutation (substitutions occur at an appreciable degree within a population)
What is Minor Allele Frequency (MAF)
the frequency at which the least common allele occurs in a given population
Common variants (MAF > 5%)
Rare variants (MAF < 1%)
What is Copy Number Variants (CNVs)?
Loci where some individuals have one or multiple copies of a particular gene or genetic region, rather than the standard two copies
Caused by duplications and deletions
Like SNPs, may or may not have phenotypes or cause disease
Where does CNVs stem from?
Many CNVs stem from segmental duplications (large blocks of 10,000 to 300,000 bp that have been copied to another region of the genome)
CNVs vs SNPs
Total number: 38000 vs 14 million
Size: 100-3000 bp vs 1 bp
Type: deletions, duplication, complex vs transition, transversion,
Effects on genes: gene dosage, interruption vs missense, nonsense, frameshift
Percentage of reference genome covered: 30% vs 1%
What are the four categories of repetitive DNA?
Short tandem repeats: Duplications of sample sets of DNA bases (1-5 bp)
Tandem repeats: Duplications found at the centromeres and telomeres of chromosomes that are 100-200bp long
Interspersed Repeats- DNA transposons and retrotransposons
Segmental Duplications