Lecture 5: The Human Genome Flashcards
What is the human genome made up of and how many bp are in the haploid genome?
22 autosome pairs and 2 sex chromosomes. 3.2 billion
What was the human genome project?
International Human Genome Sequence Consortium aimed to obtain the entire DNA sequence of the hapoid human genome in 15 years. Launched 1990
How does hierarchical shotgun sequencing work?
- Create a library of segments of the genome using bacterial artificial chromosomes (BAC library)
- All the BACs are screened for markers and classified by their location on the chromosome
- A set of minimally overlapping BACs are selected for sequencing
- Individual BACs divided into smaller fragments and sequenced using sanger sequencing
- sequenced fragments are assembled based on overlapping segments as many bacteria’s DNA is used.
What had the hierachical shotgun sequencing achieved after 10 years?
only sequenced the smallest chromosome
What was Celera? Who started and it and what were their goals?
Celera Human Genome sequencing was started by Craig Venter who thought using shotgun sequencing would be faster. He used shotgun sequencing to sequence the first bacterial genome. Funded Celera in 1998 with private funds. Goal to sequence the entire genome in 3 years
What is the random shotgun strategy?
the whole genome is shredded into smaller fragments of a few kilobases. each fragment is sequenced at both ends to create read pairs. Based on the overlap, the reads are assembled into contigs which are used to build scaffolds. Read pair mates can be used to determine the size of any gaps between contigs.
How many genomes were used in Celera?
5 individuals and BAC libraries
Why did both methods need each other?
Celera method cheaper and faster. Public effort used this method to finish sequencing. Celera method used the physical map from the public effort.
How are the gaps filled in?
Using PCR to amplify the unknown segments which are sequenced. For gaps greater than 20kb the BAC libraries are screened to identify segments containing the edge of the gap and they are shotgun sequenced.
Whose genome was used for the public effort?
10-20 people from across different racial and ethnic backgrounds
Why were their still gaps in the sequence in 2001?
Due to 2% hard to clone heterochromatin.
What main points did the human genome sequence reveal?
Thought there would be 50-100,000 genes but only 20-22K. Made sense that the complexity of eukaryotes would mean more genes. There is a large variation in gene size. most genes sequenced were blow 10 kb but the first intron tends to be very large as lots of regulatory sequences in it.
Where have many genes derived from?
Horizontal transfer from bacteria or from transposons.
Describe how genes and chromosomes are organised in our genome.
Genes are not evenly distributed in the genome. Higher expressed genes tend to be in high GC content areas. Some chromosomes are positioned in the nucleus in order to have access to particular TFs so they are more likely to be transcribed. Transcription complexes are located in the centre of the nucleus where most of the open chromatin with high GC content is located.
What is the genome made up of and in what proportions?
Exons = 1.5%
Introns =25%
the rest is composed of repetitive sequences. Transposons make up a large proportion of introns and a small proportion of exons.