human and viral genetics Flashcards

Question

how are twin studies used to investigate heritability?

Answer 1

twin studies can help calculate relative genetic and environmental contributions of complex traits, how much of it is genetics? you compare monozygotic twins who are 100% genetically identical, with dizygotic twins who share 50% of alleles both sets of twins will share an environment that is assumed to have equal influence on both

Answer 2

A = genetic variance (constant for identical, variable for not) C = common environment (constant for both) E = specific environmental (variable for both)

Answer 3

it's the idea that in complex disease there can be a wide variety of severity and symptoms, many loci can contribute to a disease, and in different combinations they can give rise to different phenotypes an example of this would be major depression - it has 44 significant risk loci, with all humans carrying a different combo of these alleles, resulting in different phenotypes

Answer 4

when you reach a number of mutations in risk alleles, you get the disease, its either you have it or you don't (opposite of continuous phenotype?) an example is pyloric stenosis (causes vomiting in infants and is more likely to affect males as allele is on X chromosome) female carriers carry more of the risk alleles as they're XX so their relatives have far greater risk than relatives of affected males

Answer 5

no, changes to non-coding regions can affect expression and regulation of the associated genes

Answer 6

synonymous doesn't change the amino acid coded for non-synonymous does change the amino acid, resulting in nonsense mutations, missense mutations

Answer 7

conducts disease-specific and population wide genetic studies, sequencing exomes of unrelated individuals it has records of 7.4 million variants mapped includes frequency of alleles in a population documents rare mutations

Answer 8

they take a population and study those suffering a disease as well as a large matched control group a panel of SNPs are investigated, they look at whether the disease group has a higher frequency of particular alleles when compared to the control group, and if a significant difference is found it constitutes an 'association' with the disease many risk loci have been identified, with many to still be found as well

Answer 9

the SNP itself can increase the risk OR the SNP correlates with the real risk allele due to 'linkage disequilibrium' the non-random association of alleles at different genomic sites dependent on distance between alleles and the recombination rate - they are together more often than can be accounted for by chance because of their physical proximity on a chromosome

Answer 10

alleles are split into blocks based on proximity/patterns of linkage disequilibrium, and therefore the likelihood they are linked (areas of high linkage disequilibrium) a SNP appearing to be a risk allele might be indicating a different allele it is linked to that wasn't originally panelled for in the GWAS, the haplotype blocks are useful in identifying this (this SNP used to identify the real risk allele is the tag SNP) this means SNPs narrow down the area of the genome in which to investigate as they themselves may not be risk allele (though they can eb) but might be close to a SNP that is

Answer 11

the odds ratio OR = 1 means the events are independent OR > 1 means events are correlated OR < 1 means the events are negatively correlated CDCV - multiple alleles with OR <1.2 showing weak association to the disease phenotype

Answer 12

stats are needed to differentiate between true and false positives, 1 in 20 events are actually non-significant this means large groups and very strict cut offs for P values are needed genome wide significance must be attained, the p value must be < 5 x 10^-8

Answer 13

a Manhattan plot, has the cut off of 5 x 10^-8 drawn on for clarity necessary as GWAS is very susceptible to false positives

Answer 14

Greatest risk allele is intronic, affecting transcription factor required for pancreatic development Another is intronic and influences body weight regulation it's CDCV, so many 'novel' loci with a correlating but low odds ratio, were identified of course environmental factors also play a role

Answer 15

lifetime risk is 8-12% in females risk increases if first degree relatives suffer from the condition this is an example of intermediate model - rare coding mutations have a significant increase in risk with other small contributing risk alleles (66 in this case to be precise) BRCA1 and 2 autosomal dominant cause 5% of breast cancer cases (these were mapped by linkage analysis)

Answer 16

the idea that even though risk alleles have been identified for certain complex diseases, they only explain a certain percentage of the heritability of a disease, e.g. Crohn's disease, 32 loci identified explain 20% of heritability

Answer 17

false negatives in a GWAS study rare variant alleles with an MAF of 1-5% structural alteration of the genome epigenetics 3D genome organisation all of which are not detected in a GWAS

Answer 18

1) 50% estimated effected by chromosomal disorders, with 8% clinically recognised conceptions terminate as a result of this, but the estimate is 50% (e.g. not clinically recognised) 2) 5% (chromosomal, mitochondrial, single gene, complex etc...) 3) 2 out of 3 diseases have a genetic component

Answer 19

aneuploidy, specifically trisomy of chromosome 21

Answer 20

2-10 homoplasmic (identical) copies of a circular 16.6kb genome however mutations can cause heteroplasmic copies that can cause mitochondrial disorders, these do not follow mendelian pattern of inheritance as all mitochondria are inherited from the cytoplasm of the egg

Answer 21

single: single gene mendelian pattern individual conditions are rare, but collectively common high penetrance - mutations are deterministic tests are predictive complex: risk alleles at multiple genes not simple pattern of inheritance but run in families conditions are common mutations aren't deterministic no reliable tests influenced by environment

Answer 22

3.2Gb or x 10^9 1.1% is coding codes for 20,500 genes other species tend to have a higher proportion of coding sections in their genome for the amount of genes they code for

Answer 23

no, these non-coding regions still have a role, e.g. promoter regions, regulatory elements like enhancers within the middle of genes there are introns as well

Answer 24

a lot of small fragments unlike sanger sequencing whole genome sequencing - loads of data, lots of time and money needed, though it can be done now in 24hrs and costs about £1000 to do whole exome sequencing - only the protein coding regions

Answer 25

in simple, whole exome as is faster and cheaper and mutations usually occur in the protein region - from around 2011 conditions are almost always identified with NGS approaches in complex, whole genome sequencing as more often than not the mutation occurs outside the coding region

Answer 26

genetic elements that cannot replicate without a host, but can exist outside a host they are very small which is partly why they need a host cell, they cannot carry enough genetic material to replicate alone polio is around 28 nm which is small, smallpox is around 200nm which is large genomes range from 0.5Kb to 1000Kb, stored as RNA, DNA or ssDNA linear or circular

Answer 27

virus/virion (when it’s outside the cell) must find a host that it recognises (can be only one, can be many species) Genomic material is injected into the host cell while protein coat remains outside The injected DNA/RNA enters replication process

Answer 28

it's ability to infect C.difficile - a major issue in hospitals - in order to potentially engineer phage-like particles to attack C.difficile

Answer 29

lack of space - some viruses like TMV minimise the proteins they really need to code for e.g. only one protein for the coat/capsid capable of self assembly so only one gene is required so reduce number of proteins needed, with less machinery required for assembly

Answer 30

T4 phage as an example 1st, read viral DNA, make EARLY mRNA to make early proteins for things like nucleases, polymerases sigma factors etc... involved in DNA replication so needed to make loads of the viral DNA Switch to middle mRNA and make middle proteins at 7min-ish Late mRNA = late proteins, capsid and structural proteins, and finally a lysozyme to break out Switching to late proteins - T4 has an earlier produced sigma factor for these late proteins From start to finish, 25 minutes

Answer 31

to ensure this is the order it occurs in, sigma factors are used A sigma factor works with RNA polymerase to bind to promoter region and get transcription going Host sigma factors are used for early proteins, some of these often modify/bind to host RNA polymerase, targeting alpha subunit, altering its specificity to recognise middle protein phage promoters Early protein MotA recognises sequence in middle promoters to guide RNA polymerase Phage codes for an anti-sigma factor to take out the hosts sigma-70 to prevent host transcription T4 has a sigma factor produced in earlier for the late proteins

Answer 32

lysogenic viruses, often termed temperate bacteriophages, can integrate into the host, rather than escaping from it they can end in typical lysis - lots of replication and protein production then breaking forth from the cell they can go down the lysogeny route instead and insert their genome into the host genome as a plasmid essentially or into an actual chromosome where it is called a prophage, replicated in sync with the host chromosome (though viral genes aren't really expressed) it is possible to switch from lysogeny to the lytic pathway via induction

Answer 33

proteins that supress the lytic pathway, if they get inactivated, that is when induction occurs and the switch to the lytic pathway

Answer 34

Viral genome, in the case of lambda phage, needs the enzyme 'lambda integrase' to attach at site 'att-lambda' Viral DNA has sticky ends that come together to form a ring once in host Site-specific nuclease makes staggered ends in phage and host DNA so they can join, a DNA ligase fills the gaps

Answer 35

the rolling circle: (also used for plasmids) DNA forms loop once in the host a nick is made in the outer strand, and the inner strand is used as a template while a new outer strand is made for it, displacing the original outer strand once complete and sealed with DNA ligase, the original outer strand can now be used as a template to make another inner strand, forming two copies of the DNA (that can then be replicated again)

Answer 36

in eukaryotes transcription and translation are discrete processes so a virus has to get it's genetic material into the nucleus for transcription then back out for translation (usually) eukaryotic viruses often have a membrane envelope they cannot make so must steal from the host upon leaving any RNA genomes require reverse transcription eukaryotic mRNA undergoes extra processing than bacterial (splicing, 5'cap, polyA tail etc...)

Answer 37

(+) strand RNA virus, very small So genome = ssRNA this mimics eukaryotic mRNA, with a polyA tail and a fake cap, even forms stem-loops Host starts to translate it, forms a polyprotein that is cleaved into all the individual proteins required, including more protease for cleaving and * its own RNA replicase to replicate the RNA in the cytoplasm Inhibits host RNA and protein synthesis by destroying host cap-binding protein

Answer 38

I think it's to do with the sense (screw sense) of the strand more importantly, the negative sense RNA virus comprises viral RNA that is complementary to viral mRNA and can only act as the genome, while the positive sense RNA virus comprises viral mRNA, which can be translated into proteins straight away acting as genome and mRNA

Answer 39

eukaryotic hosts do not convert RNA to RNA

Answer 40

To replicate the genome, the virus must provide an RNA polymerase because the host cannot convert RNA to RNA, and the virus must make the (+) strand from it’s parental negative one, before being able to replicate that over and over (so end result is loads of copies of -ve strand) Same enzyme is used to make (+) mRNA from the parental (-) strand From her translation occurs to produce viral proteins, upon assembly they steal some host membrane

Answer 41

Similar to rabies, except the RNA is in pieces, not one long strand, there are 8 linear ssRNA molecules Two membrane bound proteins that bind to sugars on host cells - neuraminidase and hemagglutinin the virus (virion) has the RNA polymerase but also an RNA endonuclease first, the viral genomic RNA replicates in the host nucleus the same as rabies, using the RNA replicase In transcription, influenza actually makes a 5’cap The virus steals 5’caps/primers? from host mRNA using its viral endonuclease Poly A tail is added and the viral mRNA moves to cytoplasm, just as the host would do

Answer 42

antigenic drift = slight and gradual change in surface proteins like hemagglutinin and neuraminidase in influenza antigenic shift - rapid and large change in surface protein genes, when different strains meet in the host and exchange RNA/ get reassorted - origin of pandemics and epidemics

Answer 43

2 identical (+) ssRNA replicated through a cDNA intermediate using reverse transcriptase to make this cDNA from the viral RNA On the RNA - gag region encodes structural proteins, pol encodes reverse transcriptase and integrase and env encodes envelope proteins of the membrane Once you've got the dsDNA, it's integrated into the host cell DNA using the integrase enzyme the virus brought with it. From here it can just be transcribed with host DNA Unless latent, promotors in the LTR region of the DNA cause transcription to produce capped and polyadenylated mRNA Similar to polio, produces polyproteins

Answer 44

polyomaviruses and pox virus (except pox viruses replicate in host cell nucleus) some polyoma viruses such as SV40 can induce tumours in animals

Answer 45

has a circle of dsDNA, replicated in both directions using host cell machinery no viral enzymes are required (this is often used as a vector for moving genes into eukaryotic cells) only encodes large and early proteins genes overlap so only a short section of DNA is used to code for multiple proteins

Answer 46

it is a ssRNA (+) virus Respiratory infections in human 15% common colds, but can be fatal Largest known RNA viruses Glycoprotein spikes look like a crown hence ‘corona’ The genome is (+) ssRNA and already has a 5’cap so can already act as an mRNA The only translated gene at first is a for a replicase This replicase is used to produce a (-) strand of RNA From here, the (-) strand is used to make monocistronic mRNAs to make viral proteins, or to make many copies of (+) RNA Replicate in cytoplasm Respiratory infections in human 15% common colds, but can be fatal Largest known RNA viruses Glycoprotein spikes look like a crown hence ‘corona’ The genome is (+) ssRNA and already has a 5’cap so can already act as an mRNA The only translated gene at first is a for a replicase This replicase is used to produce a (-) strand of RNA From here, the (-) strand is used to make monocistronic mRNAs to make viral proteins, or to make many copies of (+) RNA

human and viral genetics Flashcards

(71 cards)