1.1 The Genome and Epigenome Flashcards
What are the five major classes of non-coding DNA?
- Promoter and enhancer regions
- Binding sites for factors that maintain higher order chromatin structures
- Non-coding regulatory RNA (e.g. microRNAs and long non-coding RNAs)
- Mobile genetic elements (e.g. transposons)
- Special structural regions of DNA (in particular telomeres and centromeres)
What is the function of a promoter or enhancer region in DNA?
Provide binding sites for transcription factors. Promoters initiate gene transcription; they are on the same strand and upstream of their associated gene. Enhancers modulate gene expression by looping back onto promoters and recruiting additional factors that drive the expression of pre-messenger RNA species
What are non-coding regulatory RNAs?
RNAs that are transcribed, but never translated. They regulate gene expression through a variety of mechanisms (e.g. microRNA/miRNA or long-non-coding RNA/lncRNA)
What are mobile genetic elements?
“Jumping genes” (e.g. transposons) - genes which move around the genome during evolution, resulting in variable copy number and positioning. They are implicated in gene regulation and chromatin organisation, but their function is not well established.
What is a major component of telomeres? Describe this component and the function that it serves.
Satellite DNA.
These are large repetitive sequences of DNA (from 5bp to 5kbp) that are associated with the spindle apparatus, but also maintain the dense, tightly packed organisation of heterochromatin
What are the two most common forms of DNA polymorphisms (variation)?
- Single nucleotide polymorphisms
- Copy number variations
What percentage of single nucleotide polymorphisms (SNPs) occur within coding regions?
What percentage of copy number variations (CNVs) occur within coding regions?
1% of single nucleotide polymorphisms occur within coding regions
50% of copy number variations occur within coding regions
Define a single nucleotide polymorphism
Single nucleotide polymorphisms are a variation at a single nucleotide. They are almost always biallelic, occurring across the genome (within exons, introns, intergenic regions, and coding regions).
Define copy number variations
Copy number variations are a form of genetic variation consisting of different numbers of large contiguous stretches of DNA.
FILL IN THE BLANKS
1___________________ occur across the genome - within exons, introns, intergenic regions, and coding regions.
1___________________ occurring in 2___________________ can occur within genomic regulatory elements, thus altering 3_________________
Some 1__________________ are termed “4__________” variants and are thought to have 5__ effect on 6_____________ or 7________________________________
However, even 4____________ 1__________________ may be useful markers if they happen to be coinherited with a disease-associated polymorphism as a result of physical proximity. In other words, the 1_________________________ and the causative genetic factor are in 8___________________________________.
- Single nucleotide polymorphisms
- Non-coding regions
- Gene expression
- Neutral
- No
- Gene function
- Individual phenotype
- Linkage disequilibrium
Fill in the blanks
- Heterochromatin (dense, inactive)
- Nucleolus
- Nucleus
- Euchromatin (sparse, active)
- Cell
- p arm
- q arm
- Telomeres
- Centromere
- Chromosome
- Nucleosome
- DNA
- Transcription
- Pre-mRNA
- Promoter
- Exon
- Enhancer
- Intron
- Splicing
- 5’ UTR
- Open reading frame
- 3’ UTR
- Translation
- Protein
- mRNA
What are copy number variations?
Copy number consist of different numbers of large contiguous stretches of DNA. They can be biallelic and simply duplicated, or alternatively deleted in some individuals. At other sites, there are complex rearrangements of the genomic material, with multiple variants in the human population.
What percentage of single nucleotide polymorphisms involve gene coding areas?
1%
What percentage of copy number variations involve gene coding areas?
50% (thus underlying a large portion of human phenotypic diversity)
Describe the structure of chromatin
A segment of DNA 147 base pairs long is wrapped 1.8 times around an octameric low-molecular weight histone proteins (consistening of 8 subunit histone proteins - 2 x H2A, 2 x H2B, 2 x H3, and 2 x H4), resembling spools of thread. These complexes are called nucleosomes and joined together by 20-80 base nucleotide stretches of linker DNA and H1 histone proteins.
Histone units are positively charged, and DNA is negatively charged, thus allowing compaction.
This allows the genome to be packed into the nucleus. Chromatin is not wound uniformly; it can be dense and inactive (heterochromatin) or disperse and active (euchromatin).