Genome sequencing + DNA sequence variants Flashcards
How are types of DNA variants described with respect to?
mRNA sequence
Proteins amino acid sequence
Reading frame
Exon and intron boundaries.
Whats exon
Regions that code for protein
Will be part of the final mature RNA produced
Whats Intron
Non-coding
Removed by RNA splicing during maturation of the final RNA product
Start codon
Triplet of nucleotides which serves as the initiation point of transcription of the gene.
This is always an ATG sequence,
WHats splicing?
Removal of introns from mRNA molecule
Whats a codon
3 nucleotides together (codon) code for an amino acid.
How does a reading frame start and end?
The Start ATG codon begins the reading frame = the sequence of codons required for correct amino acid sequence for the protein
Stop codon ends the reading frame
State the types of DNA variants
STOP and START variants
Missense
Nonsense
Synonymous – does not alter amino acid
Splice site
duplications/deletions (frameshift)
May be single nucleotide change
May be multiple nucleotide changes
Stop and start variants
What happens if variant in start codon?
WHat happens if variant in stop codon?
What does this produce?
DNA changes occuring in the start and stop codons.
If there is a variant in the start codon, in this examples and G to C change, transcription will not be initiated and there will be no protein product.
If there is a variant in the Stop codon, transcription will continue into non-coding DNA 5’ of the gene.
This will produce a protein with amino acids at the 5’ end that may interfere with structure and function and is likely disease causing.
Missense variant
Give example
Cause a change in the amino acid that the DNA codon codes for.
Example, a G in a GGC codon is change to a A, producing a GAC codon
This causes a change of the glycine amino acid for an Aspartic acid.
This may or may not be pathogenic,
Nonsense
DNA variants that alter the amino acid that the codon codes for.
Instead of a different amino acid, it can encode a stop codon - introducing premature stop codon.
Translation stops prematuraely
Shortened/incomplete protein.
Protein may lose function
Nonsense mediated decay
Deletion variants
Loss of nucleotides - range from one to whole gene
Example - 2 nucleotides from a GTA codon are deleted result in altered reading frame. - frameshift
Frameshift often result in truncated protein
Any varient that causes a fraemshift is almost always disease causing
Duplication variant
The addition of nucleotides alters the reading frame, causing a frameshift and an altered amino acid sequence. (new amino acid sequence)
And like deletions a premature stop codon often causes premature truncation of the protein.
Example - an additional T nucleotide is added.
You can see the triplet GTA is changed to a GTTA, therefore the reading frame is disrupted.
The GTA triplet is replaced by a GTT triplet follow by an ACC, then CGC.
Amino acid sequence altered compared to WT
RNA splicing
Splicing, the removal of introns from the mRNA, is controlled by the splices sites that flank the intron.
The Acceptor splice site, is at the 5’ end of the intron and is always an A G dinucleotide.
The donor splice site, is at the 3’ end of the intron and is always a G T dinucleotide.
The splicing machinery identifies the donor splice site and begins cutting out the DNA from that point on.
This will only stop once the splice machinery identifies the acceptor splice site which is then follow by the exonic sequence.
The resulting mRNA strand contains sequences only derived from exons.
Note that in RNA, uracil replaces thymine.
Donor splice site variants
Mutations that occur in introns
Example, the G of the GT donor splice is changed to an A.
This will mean that the splice machinery will not recognise the donor splice site.
Therefore it will not begin the removal of the intronic DNA.
The resulting mRNA will include the entire intron.
This will alter the proteins structure and possibly function.
It can also cause an alteration in the reading frame, causing a frameshift and as discussed with deletions and duplications leading to an inevitable prematurely truncated protein.
Such splice changes are often disease causing.
Alternative splicing of genes
This process uses different splice sites within one gene to generate related but different protein products.
i.e one gene can produce a variety of different protein products dependent on which splice sites are used.
Acceptor splice site varients
A mutation in the AG of an acceptor splice site has a similarly damaging effect.
However, rather than including the intronic DNA, the result it is that the exon is excluded.
The splice machinery recognises the donor splice and beings removed nucleotides,
But as the acceptor splice site is no longer recognised, it does not stop once it reaches the exon and continues to remove all DNA until the next active acceptor site is reached.
Removing an entire exon would be most likely disease causing.
The exon may encode vital parts of the protein, such as active sites, DNA or protein binding sites.
Why is DNA variant Nomenclature important?
To describe variant so that others can understand
Share genetic knowledge
Chaos otherwise!
Similar to ISCN for chromosome karyotype description
What p.Gly4Asp
Aminoacid position 4
Amino acid change Glycine to Aspartic
c.11G>A
I misssense change
a G to A change at position 11.
11 nucleotides from the A of the ATG start codon
> indicates a chnage of nucleotide, therefore G>A