Understanding our genome Flashcards

1
Q

What is the difference between a genomic and a cDNA library?

A

A genomic library contains DNA fragments that represent the entire genome of an organism, whereas a cDNA library includes clones that correspond to the mRNA sequences from an organism or from specific cells of an organism.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the function of oligo dT chromatography?

A

to separate mRNA from the other RNA in the cell

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What percentage of RNA in a eukaryotic cell is rRNA?

A

~ 90%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What percentage of RNA in a eukaryotic cell is tRNA?

A

~ 6%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What percentage of RNA in a eukaryotic cell is mRNA?

A

~ 2-4%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is found at the 3’ end of eukaryotic mRNAs?

A

a poly A tail (added after the mRNA is formed)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How does oligo dT chromatography work?

A
  • oligo dT affinity column
  • mRNA A tail hydrogen bonds to oligo dT
  • rRNA and tRNA cannot bind to the column
  • mRNA is eluted using high salt to break A=T bonds
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How is double-stranded cDNA formed from mRNA?

A
  • mRNA is copied to cDNA using reverse transcriptase, dNTPs, oligo dT primer
  • RNA phosphodiester bond cleavage by ribonuclease H
  • RNA is replaced by DNA by DNA polymerase
  • DNA ligase is used to repair the phosphodiester backbone
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does the ‘H’ in ribonuclease H stand for?

A

hybrid

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

In sequencing of the human genome, why is only a small amount of enzyme used?

A

so that the genome is not cut at every restriction site

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a BAC?

A

Bacterial artificial chromosome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are BACs used for?

A

to create libraries with large fragments (insert size ~ 100 000 bp)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Outline the method of using BACs

A
  • white blood cells are mixed with agarose and placed in a mould
  • cell wall is ruptured in the agarose
  • restriction enzyme is added to digest DNA in the agarose mold
  • each mould is placed in a well of agarose gel
  • gel is run and viewed under UV light and DNA of 100 000 bp is excised from the gel
  • DNA is eluted from the excised agarose
  • DNA is ligated to a plasmid vector excised with the same restriction enzyme
  • treated with DNA ligase
  • bacteria are transformed
  • transformed bacteria are picked into 384-well plates
  • bacterial DNA is isolated for sequencing reactions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Why are white blood cells used?

A
  • easy to take a blood sample (non-invasive)

- no associated moral concerns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the function of agarose?

A

to protect the BACs from mechanical shear

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the size of the mitochondrial genome?

A

16.6 kb

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How many genes does the mitochondrial genome encode?

A

37

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What percentage of the cell’s DNA is made up of mitochondrial DNA?

A

up to 0.5% due to the hundreds of mitochondrial genomes found in the cell

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What genes does the mitochondrial genome encode?

A

2 rRNA genes
22 tRNA genes
13 polypeptide-encoding genes for oxidative phosphorylation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is the H strand?

A

the heavy stand; G-rich

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the L strand?

A

the light strand; C-rich

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

How many genes does the H strand encode?

A

28

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

How many genes does the L strand encode?

A

9

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Describe some features of the mitochondrial genome

A
  • circular genome
  • genes contain no introns
  • genes do not overlap
  • the whole strand is transcribed and then cleaved
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

How much of the genome encodes RNA?

A

10%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What types of RNA are encoded?

A

mRNA, rRNA, tRNA, snRNA, snoRNA, other RNAs eg. telomere RNA, micro RNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What is snRNA?

A

small nuclear RNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What is snoRNA?

A

small nucleolar RNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Which protein-coding genes have no introns?

A

tRNA, histones, α-interferons

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What is the advantage of histones having no introns?

A

During the S-phase of the cell cycle, a vast quantity of histones is needed for the formation of the newly synthesis chromatin. The intronless organisation of histone genes may facilitate a highly efficient organisation of histone synthesis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

What is the longest human gene?

A

dystrophin (2.6 kb)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

How long does it take to transcribe dystrophin?

A

16 hours

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What is the implication of the long transcription time of dystrophin?

A

There is a large amount of time in which mutation can occur

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

What percentage of dystrophin DNA is protein-coding?

A

0.6%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

Which genes is the globin family comprised of?

A

alpha globin gene cluster on chromosome 16
(embryo, fetus and adult)
beta globin gene cluster on chromosome 11
(embryo, fetus, adult)

36
Q

How do gene families arise?

A

due to gene duplication

37
Q

Why is the number of genes important?

A

in order that the right amount of protein is synthesised

38
Q

Give an example of the importance of gene number

A
  • α-thalassemia is caused by a deficiency of α-globin genes
  • the alpha globin cluster is found on chromosome 16 and encodes two alpha globins for fetus and adult
  • this makes a total of four alpha globins for fetus and adult in each cell, since an individual inherits one copy from each parent
    3α = α-thalassemia trait
    2α = mild anaemia
    0α = hydrops fetalis
39
Q

Why does the baby die at birth?

A

The intact genes for embryonic alpha globin means that the embryo can survive in the womb, but the baby dies soon after birth

40
Q

How many histone clusters are there in humans?

A

11 cluster

41
Q

How many histone genes are there in humans?

A

60 genes

42
Q

Over how many chromosomes are the histone genes spread in humans?

A

over 7 chromosomes

43
Q

What is the relationship between members of a gene family?

A

each member encodes the identical protein (highly conserved)

44
Q

What would happen without histones?

A

Without histones, DNA could not be compacted into the cell nucleus and would not fit into the cell.The compacted molecule is 40 000 times shorter than the unpacked molecule.

45
Q

What percentage of the genome is made up of non-coding DNA?

A

90%

46
Q

What does ‘non-coding’ mean?

A

does not code for RNA (except micro RNA)

47
Q

What types of non-coding DNA exist in the genome?

A
  • introns, regulatory regions
  • pseudogenes - redundant, produced as part of the evolution of genes
  • gene fragments - also produced as part of the evolutionary process, micro RNAs - regulate gene expression
48
Q

Give an example of a pseudogene

A

two Ψ genes on the alpha globin cluster on chromosome 16

one Ψ gene on the beta globin cluster on chromsome 11

49
Q

Why are pseudogenes non-coding?

A

Pseudogenes are genes that have picked up mutation and lost their function, since they are now unable to bind RNA polymerase to be transcribed.

50
Q

What are processed pseudogenes?

A
  • DNA underwent transcription, splicing and polyadenylation to form mRNA
  • mRNA was converted back into DNA by reverse transcription
  • the DNA was re-integrated into the host genome to form a pseudogene
  • due to viral infection during evolutionary history
51
Q

Where are processed pseudogenes found?

A

often found on a different chromosome to the functional gene

52
Q

What is a transposable element?

A

also known as a transposon; a DNA sequence that can change its position within the genome

53
Q

What are Alu elements?

A
  • the most abundant transposable elements in the human genome
  • primate-specific
  • do not occur in exon sequence
54
Q

How many Alu elements are estimated to be in interspersed throughout the human genome?

A

over one million

55
Q

What percentage of the human genome is estimated to consist of Alu elements?

A

10.7%

56
Q

How big are Alu elements?

A

~ 280-300 bp in length

57
Q

How can Alu elements be detected?

A

by digestion with restriction endonuclease Alu1

one main resulting band = one prevalent sequence

58
Q

What is the relationship between Alu elements and mutation?

A

Alu elements are a common source of mutation in humans, but these are often confined to non-coding regions

59
Q

What is the structure of an Alu element?

A
  • identical target site duplication (TSD) sites on either side of the Alu dimer
  • dimer comprised of two similar but distinct monomers (left and right arms) joined by an A-rich linker
60
Q

How is the structure of the Alu element believed to have come about?

A

dimer emerged from the fusion of two distinct monomers over 100 million years ago

61
Q

How long is the polyA tail?

A

the length of the polyA tail varies between Alu families

62
Q

Where are Alu elements thought to be derived from?

A

Alu elements are thought to be derived from the small cytoplasmic 7SL RNA (the signal recognition particle RNA), a universally conserved ribonucleoprotein that directs the traffic of proteins within the cell and allows them to be secreted.

63
Q

Is the Alu repeat a SINE or a LINE?

A

SINE

64
Q

What does SINE stand for?

A

short interspersed nuclear element

65
Q

What does LINE stand for?

A

long interspersed nuclear element

66
Q

What does TSD stand for?

A

target site duplication

67
Q

Why are the TSD sequences identical?

A
  • the DNA was originally circular, with the TSD sequences hydrogen bonded to one another
  • circular sequence cut with restriction enzyme and overhang blunted by the addition of free nucleotides to form two identical sequences, one on each end
68
Q

What percentage of our genome is made up of LINEs?

A

17-20%

69
Q

What is the structure of a typical LINE?

A

consists of two non-overlapping open reading frames (ORF), which are flanked by UTR and target site duplications

70
Q

What is encoded by the first open reading frame?

A

a RNA-binding protein of 500 amino acids that functions as a chaperone

71
Q

What is encoded by the second open reading frame?

A

a protein-complex that has endonuclease and reverse transcriptase activity

72
Q

How do LINEs promote their own transcription?

A
  • promoter for transcription
  • reverse transcriptase to copy RNA into DNA
  • endonuclease to cleave target DNA
  • RNase H for RNA removal (can digest RNA hydrogen bonded to DNA)
73
Q

Outline the insertion of LINEs into the genome

A
  • transcription of LINE mRNA
  • translation of ORF1 and ORF2 proteins, which bind to LINE mRNA at the polyA tail
  • mRNA binds to target DNA at AT-rich sequences to form RNA-DNA hybrid
  • LINE endonuclease cleaves target DNA
  • LINE reverse transcriptase copies LINE mRNA into cDNA
  • LINE-encoded RNase H degrades the RNA strand
  • second-strand synthesis and repair occurs
74
Q

What happens to LINEs over time?

A
  • LINEs shorten as they ‘age’
  • most are truncated at the 5’ end to remove the promoter
  • RNA polymerase cannot bind and truncated LINEs cannot be transcribed
75
Q

What are some consequences of LINEs being inserted into the human genome?

A
  • disruption of gene transcription
  • insertion into a promoter can silence a gene
  • insertion into an intron can slow down transcription
76
Q

What is the consequence of LINEs being inserted into an intron?

A
  • the slowed rate of transcription enables a higher frequency of mutation
  • proteins may be synthesised too slowly to meet the demands of the cell
77
Q

How does LINE insertion differ in humans?

A

in number and position in the genome

78
Q

What is the frequency of LINEs in somatic cells and in the germline?

A

LINEs are rare in somatic cells and more abundant in the germline

79
Q

How does the cell protect against LINEs?

A

heavy methylation of LINEs to silence them by preventing the binding of RNA polymerase and other proteins required for transcription

80
Q

What percentage of cells in our body are microbes?

A

90%

81
Q

What percentage of functional genes in our body are microbial?

A

99%

82
Q

How do commensal microbes help the body?

A
  • form us eg. help immune system development
  • feed us eg. process food, provide vitamins
  • protect us eg. fight undesirable pathogenic bacteria
83
Q

What genetic information is targeted in sequencing the microbiome?

A

16S rRNA

  • must specifically target bacterial RNA
  • small subunit of the ribosome
  • common to all bacteria
  • present in one or more copies
84
Q

What regions do the bacterial 16S rRNA genes have?

A

variable (v) regions and conserved regions (common between species

85
Q

Outline one method of bacteria identification

A
  1. isolate DNA from faecal sample
  2. amplify bacterial 16S rRNA gene with primers that encompass variable regions
  3. sequence 16S rRNA gene amplified product of variable gene
  4. data analysis and processing of gene sequence
  5. taxonomix classification using reference databases
  6. relative abundance of species within sample
86
Q

Outline another method of bacteria identification

A
  1. isolate DNA from faecal sample
    2 sequence all DNA using next generation sequencing method
  2. data analysis and processing of bacterial rRNA gene sequence