Genomics And Human Variation Flashcards

1
Q

Describe nuclear genes

A

High gene density in euchromatin and centromeric regions are generally non-coding

Genes are variable in size
-By far the largest human gene is the dystrophin gene ~ 2.5 Mb -2.7Mb depending on how it is defined

  • Other genes may be very (very) small
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are single copy genes?

A
  • Unique sequences in the genome that code for a protein:

- Receptors, enzymes, hormones, structural cellular elements etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are multiple gene families?

A

(Can be clustered together or dispersed in different genomic locations): genes with similar functions that have arisen by gene duplication

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the types of gene families?

A

Classic gene families
-multi copy genes that show a high degree of homology (e.g. HOX genes, globin genes, and genes for rRNAs, tRNAs)

Gene superfamilies
-multi copy genes with similar function but limited gene homology (e.g. HLA genes, T-cell receptors)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is extragenic DNA?

A
  • Extragenic DNA constitutes the majority of the human genome
  • Extragenic DNA is predominantly transcriptionally active
  • Extragenic DNA might play a role in regulation of gene expression
  • Tandem repeated DNA sequences consists of blocks of tandem repeats of non-coding DNA. (Length of repeat sequence determines how it is named)
  • The shorter the repeat length, the more polymorphic that sequence is
  • Tandem repeats are inherited in a co-dominant fashion; one from each parent
  • Simple sequence repeat variations are used as the basis of DNA fingerprinting
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a Single nucleotide polymorphism?

A
  • formally defined as a variant that is found at least in 1% of the population
  • Single base pair change between individuals
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a simple sequence repeat (SSR)?

A

These is the most simple type of repetitive sequence; and most polymorphic.

  • These are tandem repeats of 2,3 or 4 bp, repeated many times
  • Some texts will call these simple tandem repeats (STR), but this may lead to confusion with VNTR si we with use that terminology
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a VNTR?

A

(Sometimes called short tandem repeats)

-These repeats are a bit longer then the SSR, maybe 5 bp, 10 bp, or hundred of bp; repeating many times

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Describe single nucleotide polymorphism

A

Human genome is about 3x10^9 bp for haploid

  • Approximately 1 in each thousand bp there is a common variation (common means that below 1% of the population has the change; polymorphism)
  • Therefore there are about 3 million common single nucleotide bp changes- SNP
  • Most SNP do not have associated phenotype
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Explain the application of SSR analysis of a CA repeat linked to an autosomal-dominant disease

A

This is tracking an SSR, since it is a variation in tandem repeats of a dinucleotide sequence

-Dad’s genotype is most likely B,D. the 6 tandem CA repeat ‘allele D’ is most likely linked to the disease in his family

CA repeat ‘allele D’ linked to the disease

  • SSR are vary useful in forensics and paternity testing
  • Easy to detect by PCR. Based methods
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Explain what is a low copy repeat

A

These may be thousands (kB) to many hundreds of thousands of bp long (that is, they can be big)

  • May be repeated just twice in the genome, or many times
  • May cause mispairing during meiosis or mitosis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the long interspersed nuclear elements(LINES)?

A

( LINES ~600 base pairs)

  • found in large numbers in eukaryotic genomes)
    • They are able to make an RNA
    • LINES include a gene that encodes the enzyme reverse transcriptase
  • Reverse transcriptase makes a DNA copy of the LINE or SINE mRNA that can be integrated into the genome at a new site
  • LINEs are therefore capable of copying themselves and may enlarge the genome. The human genome contains 100,000s of LINES
  • LINE and SINE sequence repeats may contribute to mutation by leading to unequal crossover during meiosis
    • (same idea as for low copy repeat sequences)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

explain what are short interspersed nuclear elements (SINES)

A
  • Short sequences of under 500 bp (maybe 300 bp) that are found up to 1.5 million times in the genome - approximately ~10% of the human genome
  • Appear to be “normal” RNAs that were converted to DNA reverse transcriptase and were reinserted into the genome. Reverse transcriptase is hijacked from a LINE.
  • Most common SINE in humans is the Alu sequence. Called Alu because the SINEs contain a sequence that is recognized by the Alu restriction enzyme
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the rare variants of SNPs?

A

Rare variants are “rare” in that they aren’t frequently found in the population, but there may be billions of them
-Another way of saying this : there are many more rare variants than there are common variants- they just aren’t frequently found

-We don’t know all of these rare variants, and there will always be a new one to discover

Rare variants are thought to contribute to human disease more than SNP

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are pseudogenes?

A

Sequences that look like real genes but aren’t functional (no protein product). Most probably arose during evolution by:

  • Gene duplication and subsequent mutation
  • Copying of RNA back to DNA (by viral reverse transcriptases) and reinsertion into genome
  • The gene is turned off for some reason
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Contract expressed and pseudogenes

A

Expressed- transcribed and functional

Pseudogenes- nonfunctional copies of a gene

-Conventional pseudogenes aren’t expressed & they often occur in gene families. Others can be expressed (transcribed) but their transcripts are nit functional

17
Q

Describe mitochondrial DNA (non-nuclear DNA)

A
  • Mitichondria thought to have arisen by by endosymbiosis
  • Single circular DNA of 16.6 KB
  • Codes for some mitichondrial tRNA, and other factors needed for mito gene expression
  • Codes for a few mitichondria proteins
  • Most mitochondrial proteins are encoded by genes in the nucleus
  • Maternally inherited in humans (all mitochondria come from the cytoplasm of the oocyte)
  • Mitovhondria disorders are only passed through maternal line
  • Mitochondrial genes are more prone for mutation
18
Q

Summarize chromosomes

A

Made up of a long chain of DNA, DNA wound around a specific (histone) proteins, to form a compact structure:chromosome

Each chromosome carries a different subset of genes that are arranged linearly along its DNA

  • Each chromosome is made up of many contiguous genes
  • Chromosome has a constricted region: centromere
  • Centromere divides the chromosome into two arms p arm (short) and a arm (long)
19
Q

Describe the chromosome at metaphase

A
  • Appearance of a chromosome during metaphase when they are maximally condensed and most visible
  • The microtubules attach to the centromere kinetochores during
  • The ends of a chromosome are ‘telomeres’
  • Genes are usually absent in the telomeres and the centromere
20
Q

Describe chromosome nomenclature

A

Metacentric - p and a arms are equal length

Sub metacentric- p arm is shorter than q arm

Acrocentric- p arm contain little genetic information

21
Q

Describe karyotyping

A

visualization of chromosomes during metaphase (maximally condensed)

  • Stained with specific stains to form a banded pattern, that is specific for a chromosome (G-banded karyotype)
  • 22 pairs of autosomes and one pair of sex chromosomes (Total 23 pairs of chromosomes)
  • Chromosomes are ordered according to size: Chromosome 1 is the largest and Ch 21 is the smallest of the chromosomes
  • One chromosome of the pair is derived from the mother and the other chromosome is derived from the father (when both the chromosomes of the pair are derived from the same parent, it is said to be uniparental disomy- UPD)
22
Q

Explain the lyonization of X-inactivation

A
  • Occurs during the early stages of development of female embryos
  • One of the X-chromosome is inactivated and condensed to form the Barr body in females
  • X-inactivation is random (paternal X chromosome active in approximately 50% cells and maternal X chromosome is active in approximately 50% of cells )(Assymmetric X-inactivation is a phenomenon observed when in more than 50% of the cells the paternal/maternal is active)
  • X-inactivation is fixed: the same X chromosome is inactivated in all the descendants of the cell
  • X-inactivation is regulated by a region called X-inactivation center (Xic) that has the gene, Xist gene involved in inactivation
  • XIST RNA coat one of the X chromosomes, that results in transcriptional interference
23
Q

How much X-chromosomes are inactivated?

A

Paternal X-Chromosone active in approximately 50% of cells and maternal X chromosome is active in approximately 50% of cells

Pseudoautosomal regions of the X chromosome are not inactivated

24
Q

Summarize how X-inactivation works

A

-Xist gene produces an RNA that coats the X-chromosome

  • Causes X-chromosome condensation to heterochromatin
    • cytosine methylation of specific regions occurs (epigenetic mechanism)
    • Only specific regions of the X chromosome are methylated
    • Some of the X chromosome escapes inactivation
  • Methylation of cytosine bases (CG islands) and histone deacetylation, result in transcriptional repression
  • The process of imprinting is similar to X-inactivation, in that the imprinted gene is methylated and transcriptionally silenced
25
Q

explain the concept of gene alleles

A

Members of a pair of chromosones(homologous chromosomes) carry matching genetic information:
-Same genes in the same order. At any specific locus, however, they may have either identical (homozygous) or slightly different (heterozygous) forms of the same gene, called alleles

  • Alleles are different versions of the same gene; they have slight differences in the base pair DNA sequence
  • If the homologous chromosomes have the same allele of a gene, then the person is homozygous for that allele
  • If the homologous chromosomes have different allelic forms of a gene, then the person is heterozygous for that allele
26
Q

Summarize Mendelian inheritance in man

A
  • Victor McKusick and colleagues at John Hopkins university developed a cataloging system for human genetic traits and genes
  • Began as a library project (JHU medical library
  • Advances outpaced ability to update library entries, so online Mendelian inheritance in MAN (OMIM) project created
    - Now maintained by NCBI

-Each genetic trait has a unique sux-digit MIM number

27
Q

What is an MIM number?

A

Autosomal dominant disorders begin with 1
-example: Marfan syndrome (MIM 154700)

Autosomal recessive disorders begin with 2
-example: cystic fibrosis (MIM: 219700)

X-linked disorders begin with 3
-example: Duchenne muscular dystrophy (MIM 310200)

Mitichondrial disirders begin with 5
-example: lever hereditary optic neuropathy (MIM 535000)