Genomics Flashcards
Explain how a single sequence is turned into a genome (the steps)
1) Split the genome into manageable chunks
2) Work out which order they are in
3) Get the DNA sequence of each chunk
4) Put them together
Describe what ‘shotgun’ sequencing is
1) Fragment genomic DNA
2) Clone into sequencing vector
3) Pick colonies and sequence
Describe the steps of ‘shotgun’ sequencing for high throughput (next generation
1) Fragment genomic DNA , select sizes
2) Ligate adaptor oligos
3) Hybridise to complementary oligos on sequencing plate
4) Amplify
5) Sequence
What is the HTS cycle?
1) Template (amplifies on the sequencing plate) added to growing chain 5’-3’
2) Detect label with a camera
3) Chemically cleave the label revealing the 3’ OH
What is meant by the term contig?
A ‘contiguous’ (continous) consensus sequence from an assembly
What is mean by the term scaffold?
A series of contigs where we have additional information to place them together in the right order and orientation but the sequence between the contigs is not complete
What is meant by the term assembly?
The set of scaffolds for one genome
What is meant by the term N50?
The size of the largest contig/scaffold of which 50% of the assembled data is in a contig/scaffold of that size or larger
Describe protein coding genes
Generally not repetitive but there are some exceptions e,g fillagrin and high copy number genes
What are the repetitive regions?
Microsatellites
Telomeres
Intron sequences
What are transposons?
Mobile genetic elements- sequences of a few kb that can move about the genome. Thousands of copies in eukaryotes
Describe what the read length is
-A single read cannot span a repetitive region that is longer that the read length. This prevents long contigs from forming. The longer the read length, the larger the repeat region that can be assembled
Describe read depth/coverage
-The average number of times each base appears is the final assembly. A coverage of 10x means that each base is on average found in 10 reads. The deeper the coverage, the more clearly any sequence or structure changes can be discerned from sequence error
Describe what ploidy is
The number of copies of the genome in the organism