Lecture 1 Organisation of the Human Genome Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

What are the different types of DNA?

A

Nuclear, mitochondrial, bacterial and viral

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the dimensions of nuclear DNA?

A

10bp per turn, with 3.4 nm per turn and 2.37 diameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How many genomes are present in a cell?

A

2 genomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How many base pairs are there in the nuclear genome and corresponding genes?

A

3 x 10^9, (3-3.3 billion) corresponding to 30,000-20,000 genes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How much does mitochondrial DNA account for in the genome?

A

0.001%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How many base pairs are there in the mitochondrial genome? How many genes and how many proteins?

A

16,569 bp,
37 genes
13 involved in respiratory chain (protein coding)
24 non-coding RNAs (2 rRNAs, 22 tRNAs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Does the mitochondrial genome contain introns?

A

No

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a karyotype?

A

The characteristic number and size of chromosomes examined through staining chromosomes during metaphase.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the composition of the human genome?

A

3200 Mb, 2000Mb is intergenic, 1200 is gene relates sequences.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are LINES?

A

Long INterspersed repeat Elements;
6-8kb in length, can copy themselves into the genome. They encode proteins required for their integration into the genome: 3 Line families (LIINE1/2/3) however only LINE1 is transcriptionally active.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are SINES?

A

Repeat elements shorter than 6kb in length that have lost the ability for retrotransposition.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the SINE families?

A

ALUs and MIRs; only ALUs are active.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Can ALUs be found throughout the animal kingdom?

A

No, they are only present in primates, with 1 approx every 3 kb. They also have their own subfamilies and are over 80 million years old - molecular clocks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What percentage of the human genome is transposable?

A

Approx 50%; however they do not move much anymore

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are LTRs?

A

Long Terminal Repeat elements, related to viruses. Originate from retrotransposons, related to retroviruses

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What percentage of the genome is composed of retroviral insertions?

A

8.3%

17
Q

What are microsatellites?

A

Short sequence (1-15bp) repeated in tandem many times (2-50). Dinucleotide repeats are the most frequent (unineucleotide repeats are found). Most common is 3-4 bp repeats.

18
Q

How do microsatellites affect genomic complexity?

A

The reduce the complexity (ACACACACAC or GGGGGGG) for example.
They are prone to expansion and contraction during replication due to polymerase slippage.

19
Q

Can microsatellite sequences be found in coding sequences?

A

Yes, but not very often, 3bp in length if found.

20
Q

What are the major classes of RNA sequences that do not contain open reading frames?

A
tRNA (25% in cluster on Chr6) 
rRNA (28S, 18S, 5.8S, 150-200 copies)
snoRNA (RNA processing/base modification, 97 different snoRNA)
snRNA (22snRNA, multiple copies of some)
Unknown function (mostly single copy)
21
Q

What are pseudogenes?

A

sequences related to coding or non-coding sequences that have become mutationally saturated so that the function is lost. They are evolutionary relics, derived from genes (both coding and non-coding) by duplication or retrotransposition.

22
Q

What are the different types of pseudogenes?

A

Gene fragments: single/ multiple exons.
Whole genes that include introns, mutated splice sites
Processed pseudogenes: mature mRNA from expressed gene reverse-transcribed and integrated into the genome: 4/5 exons that are inserted into the genome.

23
Q

What are the features of coding genes?

A

They make up 1-5% of the genome, producing proteins which perform cellular activities. Can be single copy or multiple copy. Can be grouped into families based upon sequence familiarity; often evolved by duplication and divergence. Superfamilies also exist

24
Q

What is the significance of gene overlap?

A

Numerous genes can be located on the same section of DNA. For example the BRCA1 (5’-3’) and Rho7 (3’-5)
Opposite strands can also code different genes

25
Q

Describe the general structure of coding genes.

A

Promoter: RNA pol binding site, TF sites, TATA vs TATA-less
Introns and exons (coding must have exons - mitochondrial genes do no have introns)
Must have a 5’-UTR, drives translation for tissue specific elements. Has regulatory functions
Start codon (ATG); multiple ATGs present
Polyadenylation sequences to end transcription
Splice sites (AG/GT vs AT/AC)
Splice enhancers; exonic and intronic
Stop codon (TAA, TAG, TGA)
Once transcribed, mature mRNA can also have polyA tails added to them.

26
Q

Describe the structure of the beta-globin gene.

A

It has three exons; at the beginning of the introns, GT at the beginning and AG at the end; this pattern can be found throughout the genome – splice sites.

27
Q

Describe the alternative splicing methods.

A

Exon skipping.
Intron retention
Alternative 3’ site
Alternative initiation

28
Q

Describe the Average Human Gene.

A

It covers 10-15kb of genomic DNA
Encodes a cDNA 1500-2000 bp, with the majority being intronic sequences (>85%)
There is a large variation in gene size, dystrophin is 2Mb whereas some are

29
Q

Describe the genome’s content.

A

50’000-100’000 genes were predicted in the genome; there are 30’000-35’000 genes: alternative splicing however can account for some differences in number.

30
Q

Describe the common features that can be identified in genes with a common function.

A

The leader sequence (which directs the protein to the cell surface) are approx 20 amino acids in length.
Extracellular domains
Stalk region
Membrane anchoring/ transmembrane sequence
intracellular domains (of different families) to deliver different sequences.

31
Q

Which processes account for the largest number of genes?

A

Metabolism, reproductive and expression processes. Gene number reflects this (metabolism has the greatest number of genes)