Week 1.1: Introduction, the landscape of the human genome Flashcards
When was the human genome project published?
The human genome was first published in 2001
Who led the private, and public HGP? Where were they published?
The private project led by J. Craig Venter published in Science. Eric S. Lander led the public project that was mainly funded by the US government and published in Nature, during the same time.
Human genomics timeline
1866
1866 Mendel publishes laws of inheritance, using pea plants, this is when we first began to understand how genetics works.
Human genomics timeline
1869
1869 Miescher isolates “nuclein” from cells, at the time we did not know about the link (DNA) between the nuclein discovered and the genetics of Mendel’s discovery.
Human genomics timeline
1912
1912 Chromosome counts 47 male, 48 in female, at the time chromosomes were not understood. Today we know that the counts were wrong, because the male Y chromosome, which is smaller than the X chromosome, was too small to be seen by the microscopes.
Human genomics timeline
1944-1952
1944-1952 DNA shown to be genetic material
Human genomics timeline
1953
1953 Crick and Watson structure of DNA
Human genomics timeline
1961
1961 Nirenberg cracks genetic code
Human genomics timeline
1996
1996 Yeast genome sequenced
Human genomics timeline
2001
2001 Human genome sequence published
Human genomics timeline
2012
2012 ENCODE project published, telling us a lot more about the functionality of the human genome
What is the latest version of the human genome called? when was it released?
Latest version of the human genome is called GRCh38, and was released in December 24th 2013.
How many bases does the human genome have? What percentage of bases are unknown? what percentage of bases are unplaced?
Current best human genome we have is 3.2 billion bases long, and about 4.98% of those bases are unknown, we do not know if they are A,G,C, or T. To this day, we do not have a perfectly complete human genome. 0.14% of bases are unplaced, meaning we do not know where on the chromosome they are.
How many bases long is the human genome? What is the equivalent to in bibles, Qu’rans?
The size of the human genome, it is 3.2 billion letters long, the equivalent to 1,000 Bibles or 10,000 Qu’rans, in (almost) every cell in our bodies we carry 2 copies. We have 100 trillion cells, therefore we carry about 6.5x10^23 bases of DNA.
In the context of computers, one computer byte can represent four base pairs, meaning a haploid human genome can be coded in 0.8Gb of information
How much information does the human body contain?
Thus, the human body contains approximately 161 billion terabytes of information in DNA
= 161 exabytes
= 322 billion of average laptop hard-drives
= 32 trillion DVDs at 5Gb
= 26 years of internet traffic
As you walk around more information is moving then all the information on the internet over 26 years!
Is it cheaper to generate DNA than to store it? What could DNA be used as?
Generating big data, in sequencing DNA is relatively cheap and easier, we are getting to a point where generating the data is cheaper than storing the information. As DNA is an incredibly efficient way of storing and moving information, we are thinking about storing our scientific results in a DNA file in a tube at -80degrees rather than on a hard drive.
Nick Goldman demonstrated this.