Genomes and Sequencing Flashcards

1
Q

What do you do before you can start to sequence?

A

Extract DNA and fragment it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is shotgun sequencing?

A

Splitting up DNA into fragments randomly to be sequenced

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How is shotgun sequencing achieved?

A

Sonifaication

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the best sequencing method to use in reality?

A

Illumina

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What type of PCR goes along with Illumina?

A

Bridge PCR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why is Illumina the best to use?

A

454 (pyrosequencing) has a homopolymer problem i.e. can’t distinguish between strings of same nucleotide and can’t get volume
Ion torrent is mostly for specialised sequencing
Sanger is old
Nanopore not consistent, high error rate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Why shouldn’t you discuss all of them?

A

Because that would make it too fragmented and it wouldn’t work in real life

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is an amplicon?

A

A PCR product

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What should your target coverage be and why?

A

30x because above that you probably can’t improve the error rate anymore

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What should your length be?

A

At least 15 - 20 base pairs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How does Illumina work?

A

After PCR, fluorescently label all bases with different colour (reversible terminator bases) and synthesise complementary strand of DNA to already acquired strand. Bases compete to form a regular second strand of DNA by matching up with base pair, other 3 are washed away and the camera can detect the fluro colour so allowing us to work out the original base.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Outline your methodology for sequencing a genome

A

DNA extracted
Fragmented by sonification shotgun sequencing
Adaptors added to both ends of a fragment
Adaptors attach to inside of flow cell
Bridge PCR performed
After amplification, nucleotides labelled with own fluorescent colour
Bases compete to form regular complementary strand, opposite nucleotide binds and the other 3 are washed away
Observe colour under light to work out original base
Modified dNTP inhibits extension at 3’ end so only 1 base added at a time
Resulting contigs (overlapping bits of DNA) are scaffolded by filling in the gaps and linking

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Describe the assembly process

A

Ordering sequenced DNA fragments into genome using paired end reads
Contigs are consensus read of fragments so they are grouped to create a contig
Contigs lined up and joined using paired end data resulting in scaffolds

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is meant by coverage?

A

The number of times a section of the genome is represented in the sequenced fragment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is meant by identity?

A

The percentage of the reads that agree with the same nucleotide

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are identity and coverage used for?

A

To discount errors and differentiate between errors and heterozygosity. 50% or close indicates heterozygosity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is bin size?

A

When boundaries are set to distinguish between statistical variation and error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is genomics?

A

The study of all the genes of a cell or tissue at the DNA, mRNA or protein level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is comparitive genomics?

A

The study of the relationship of genome structure and function across different biological species or strains

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is functional genomics?

A

The study of gene and protein functions and interactions utilising the data produced by genome projects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What are the 2 methods of gene duplication?

A

Nonhomologous recombination and retrotransposition

22
Q

What is unequal crossing over?

A

Two non homologous DNA double helices align and undergo recombination which is greatly facilitated if the two strands already contain repeat units

23
Q

What are the 4 levels of duplication?

A

Exons duplicate/shuffle to change size of function of genes
Entire genes duplicate to make multigene families
Multigene families duplicate to produce gene superfamilies
Genomes duplicate to double number of copies of every gene and gene family

24
Q

What is a multigene family?

A

A set of genes descended by duplication and diversification from one ancestral gene
Can be tandem or dispersed

25
Q

What are pseudogenes?

A

The decomposing remnants of either failed duplication events, transposition genes or disabled genes
Unnecessary copies of genes

26
Q

What are tandem arrays?

A

The same sequence repeated over and over

27
Q

What is the crossover fixation model?

A

Duplication of a gene is likely to result in an immediate relaxation of selection on the new members of the gene family but there are instances where all of the duplicated genes retain the same sequence and function e.g. rRNA genes

28
Q

What are Hox genes?

A

A family of genes that specify the anterior/posterior axis and segment identity of metazoan organisms during early embryonic development which produce regulatory transcription factors

29
Q

How did Hox genes arise?

A

Gene duplication

30
Q

What is the immunoglobulin superfamily?

A

A large and functionally diverse group of proteins that share a common structural feature, the immunoglobulin fold
The fold can interact with other Ig folds to form dimeric molecules, display an enormous diversity of recognition sites and resist proteolysis in the blood

31
Q

What is site-specific recombination?

A

Rag1 and Rag2 enzymes catalyse reactions that result in a single V, D and J segment. They are combined and the others are excised. Different combinations lead to increased diversity

32
Q

What are transposable elements?

A

DNA segments that transpose themselves

33
Q

Describe what happens when transposition occurs to an already replicated recipient site

A

Transposition occurs to the opposite strand leaving a vacant donor site
Replication is completed
Vacant donor strand remains on one strand
No net increase in number of Ac elements

34
Q

Describe what happens when transposition occurs to an unreplicated recipient site

A

Transposition occurs to parental strand before replication fork
Completion of replication
Net increase in number of Ac elements

35
Q

Name the 4 classes of transposons

A

LINEs
SINEs
Long Terminal Repeat retrotransposons
DNA transposons

36
Q

What is simple transposition?

A

Cut and paste

Bacteria and eukarya

37
Q

What is replicative transposition?

A

Copy and paste
Uncommon
Bacteria

38
Q

What is retrotransposition?

A

Common in eukarya

39
Q

Where are direct repeats?

A

Found within the host DNA

40
Q

Where are inverted repeats?

A

At the ends of most transposable elements

41
Q

Describe the structure of a transposon

A

Inverted repeats at the ends

Encode transposase to recognise inverted repeats

42
Q

How does transposase work?

A

Cuts at the borders between the transposon and the adjacent genomic DNA and also helps the excised transposon integrate at a new site

43
Q

How are direct repeats created?

A

Transposase enzyme recognises inverted repeats and catalyses removal of the TE from it’s original site
Transposase brings together the inverted repeats
Transposase cleaves target DNA sequence at staggered recognition sites
Repeats created in the same direction and repeated at both ends of element therefore direct repeats

44
Q

How does replicative transposition work?

A

Transposase catalyses movement of DNA strand carrying the TE to a new recipient site
Gap repair synthesis at the previous and new sites produces two double stranded elements with circular molecules known as cointergrant
Resolvase separates cointergrant into two separate structures each containing a TE

45
Q

What is the selfish DNA hypothesis?

A

TE’s exist because they contain the characteristics that allow them to multiply within the host cell DNA therefore resembling parasites because they offer no selective advantage to the host

46
Q

What are some possible advantages to transposition?

A
Promote exon shuffling
Promote gene duplication
New gene combinations
New innovative gene functions
Success of evolutionary lineage
47
Q

How is methylation releated to transposition?

A

When DNA is methylated, transposition is inhibited meaning just after fertilisation is when most transposition occurs giving way to ‘spontaneous’ mutations

48
Q

Describe LINEs

A

1-5 kb in length
20 - 30k copies per genome
Probably the source of LTR transposons and retroviruses
Reverse transcriptase encoding region on the transcript is translated into an enzyme that preferentially associates with and uses the transcript it came from as a template to produce LINE cDNA
RT often stops before it has made a full length DNA copy of the RNA transcript. cDNA is incomplete and forms a second strand which is integrated back into the genome but is dormant

49
Q

Describe SINEs

A
Less than 500 bp length
Nonautonomous
Can disperse throughtout genome
Evolved from small cellular RNA species
Form hairpin loops, RT recognises loop as a primer for elongation
50
Q

How are human diseases related to transposons?

A

Retrotransposons can induce mutations by inserting near or within genes. A retrotransposon induced mutation is relatively stable because the sequence at the site of insertion is retained
As multiple copies of the elements build up the potential for unequal crossing over increases leading to disease