Human Genome and Genomics Flashcards

Question

How were the marker-sequences beneficial to the map-based approach?

Answer 1

- This process produced huge amounts of data that have been used to virtually reassemble our genome. However, there are gaps, repeat sequences are common in the human genome so repeats from entirely different chromosome regions may be wrongly joined together. It will take many years to identify the mismatches caused by repeat sequences. Some regions, especially near the centromeres, may never be fully finished

Answer 2

- Chromosome is partially digested with restriction enzymes and ligated into YACs or BACs to create libraries of contigs (large insert clones) - Large-insert clones are placed in their correct order using a pre-existing genetic map of markers or by generating a restriction enzyme cleavage map - A subset of overlapping clones are selected and further fractured into smaller fragments that are subsequently cloned (small insert clones) and sequenced

Answer 3

- Both Human Genome Sequencing Consortium (map based approach) and Celera (shotgun approach) moved forward simultaneously - Rough drafts were published in the same Feb week in 2001 by both parties, and were found to be amazingly consistent

Answer 4

- An Intergenic region is a stretch of DNA sequences located between genes. Intergenic regions are a subset of noncoding DNA. Occasionally some intergenic DNA acts to control genes nearby, but most of it has no currently known function

Answer 5

- On average, there is 1 gene per 145 kb (but there are gene clusters) - The average human gene is 27,000 bp in length and contains 9 exons - Exons make up 1.1% of the genome, introns make up 24% of the genome, and 75% of the genome is intergenic DNA - 44% of the intergenic DNA is derived from transposable elements - There are 22,287 protein coding genes - Additional genes specify RNA products (rRNA, tRNA, snRNA, miRNA).

Answer 6

- Google Doc

Answer 7

Map-based sequencing: - Organizes contigs from a restriction map before sequencing - More time-consuming, cumbersome, expensive Shotgun sequencing: - Random sequencing and then assembly - Faster, cheaper, and has now become the most common method of assembling first drafts of genomes

Answer 8

Map-based approach now used to resolve problems encountered during shotgun approach. For example: - Assembling highly repetitive sequences - Resolving gaps in sequence Shotgun and map-based approaches usually combined to assemble complete or near complete genome sequences

Answer 9

- Next-generation DNA Sequencing

Answer 10

There are several NGS systems, which have been developed by different companies. However, all these systems share at least three fundamental steps: - DNA preparation and immobilization onto solid support - Amplification (PCR) - Sequencing

Answer 11

- Construction of a sequencing library -> clonal amplification to generate sequencing features - No in vivo cloning, transformation, colony picking, etc - Array based sequencing - Higher degree of parallelism than capillary-based sequencing

Answer 12

- There is no single genome that represents the human species, instead there is a reference genome that was based on DNA donated from several individuals. - In reality there are billions of human genomes (every persons genome is likely to be unique), and there is not even a single genome for a person (cells within the same individual will vary due to somatic mutations).

Answer 13

- The available human genome is not the ancestral sequence for all humans, but instead an arbitrary sequence based on the idiosyncrasies of those individuals whose DNA was used in the human genome projects

Answer 14

- Genome Sequencing has identified a number of sites where humans differ in their sequence - There is 99.9% sequence identity between any 2 individuals. This might not seem like much difference, but given that humans have a genome consisting of 3 billion bp, this would mean that there are more 3 million bp differences between any 2 individuals.

Answer 15

- A single-nucleotide polymorphism is a substitution of a single nucleotide that occurs at a specific position in the genome, where each variation is present to some appreciable degree within a population - It is a site in the genome where a person differs by a single base pair, called a single nucleotide polymorphism (SNP)

Answer 16

- SNPs are inherited as allelic variants in the same way as alleles that produce phenotypic differences. • Can spread throughout a population over time. - Numerous and present throughout genomes. • Same chromosome from two different people, a SNP can be found approx. every 1000 bp

Answer 17

- The specific set of SNPs and other genetic variants observed on a single chromosome or part of a chromosome is called a haplotype - Due to the rate of crossing over events being proportional to the physical distance between genes, SNPs that are located close to each other will be strongly associated as haplotypes.

Answer 18

- SNPs variability and widespread occurrence throughout the genome make them valuable markers in linkage studies

Answer 19

- SNPs that are physically close to a diseasecausing locus, tend to be inherited along with the disease causing allele. - People with disease tend to have different SNPs than healthy people. - Comparing SNP haplotypes between people with a disease and healthy people can help determine the location of the disease causing gene.

Answer 20

- Provides a saliva-based direct-to-consumer personal genome test - Companies like ancestry, myheritage, etc, use SNP genotyping to assess the genetic variation between members of the same species (eg. Humans).

Answer 21

- DNA profiling is the process of determining an individual's DNA characteristics, which are as unique as fingerprints

Answer 22

- Using Restriction Enzymes to cut DNA into fragments (Restriction fragment length polymorphism (RFLP) - Due to differences within the genome, fragments from two people will be different sizes

Answer 23

- Variable nucleotide tandem repeats (VNTRs) • Certain DNA motifs are repeated • The motif does not change, but the number of times it repeats does • Distributed through out the genome

Answer 24

- Short tandem repeats (STR) • Very short DNA sequences repeated in tandem (adjacent) • Power of STR analysis is simultaneously looking at multiple STR loci which are independently assorted.

Answer 25

- STRs are detected using PCR with primers that flank the microsatellite repeats - The higher the number of repeats the larger the fragment and the slower it moves via electrophoresis

Answer 26

- Google doc

Answer 27

- Using fluorescent detection, the fragments are represented as peaks on a graph. - Homozygotes for an STR allele have a single tall peak - Heterozygotes have two shorter peaks