11 - The human genome project Flashcards

1
Q

What is the size range of human chromosomes?

A

55x10^6 bp to 250x10^6 bp

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What were the 4 goals of the human genome project?

A
  1. Determine the sequence of the 3 billion chemical base pairs in human DNA.
  2. Identify all genes in human DNA to their position on chromosomes.
  3. Attempt to predict the function of all genes.

4 . Utilise this information for understanding disease, developing medicines, understanding human variability and how humans compare to other species.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What were the 3 phases of the human genome project?

A

Phase 1 - Produce high resolution chromosomal maps
- position genetic markers and genes
- Create libraries of BAC clones for sequencing.

Phase 2 - Sequence each BAC DNA

Phase 3 - Assemble all sequences to produce final draft and annotate to identify genes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the differences between the old sanger sequencing and the new sanger sequencing?

A

Old sequencing:
- 4 separate dideoxy reactions (one for each base).
- Very slow.
- Manual reading of results off X-ray film.

New sequencing:
- Like PCR
- Uses fluorescent terminators
- Products run on a gel, separated by size, laser scans bands to read them.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What approach did IHGSC use for sequencing the human genome and what were the advantages and disadvantages of their approach?

A

Clone-by-clone approach:

Advantages:
- Very effective at getting over regions of highly repetitive DNA sequences

Disadvantages:
- Slow approach
- Expensive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What approach did Celera use for sequencing the human genome?

A

Shotgun sequencing:

Blast genome into small fragments sequence each one and then use the power of computers to reassemble to sequence.

Had to rely on public databases of sequence and mapping information in order to assemble the sequence that was generated by this method.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When was the human genome sequence 80 and 99.9% finished?

A

80% - June 2000

99.9% - July 2003

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the 2 major types of global human genetic variation?

A
  1. Single nucleotide polymorphisms (SNPs)
  2. Copy number variants (CNVs)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What was the goal following the human genome project?

A

SNP identification - technologies were developed for rapid, large-scale identification of SNPs and other DNA sequence variants.

A public database of DNA differences was created dbSNP.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What was the international HapMap project?

A

Aimed to find the common SNP variants in the world’s population.

Studied 270 people in total from 4 populations around the world.

Found >1 million human SNPs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Once the gaps in the human genome project were filled what was found?

A

After 20 years the final 8% of the sequence was finished:

  • 3,054,815,472 bp of nuclear DNA
  • 16,569 bp mitochondrial genome
  • 63,494 genes
  • 233,615 transcripts
  • 19,969 genes and 86,245 transcripts are predicted to be protein coding

Many non-coding genes:
- tRNAs and rRNAs (involved in translation).
- microRNAs and long non-coding RNAs (involved in transcription regulation).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What causes an increase in complexity of protein structure as the evolutionary tree is climbed?

A

Alternative mRNA splicing complexity increases - more protein isoforms per gene.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly