Genome organization Flashcards

1
Q

number of human chromosome pairs

A

23; 22 autosomal, 1 pair sex

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

base pairs in haploid genome

A

3X10^9

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what percent of the genome codes proteins

A

<1.5%, only about 5% contains regulatory information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

point of attachment for cell microtubules for cell division

A

centromere

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

centromeric DNA sequence

A

alpha satellite family, and appx 171-bp unit critical for the attachment of microtubules of the spindle apparatus

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

the Alu family and the LINE (L1) family

A

two salient examples of repetitive DNA sequences that have implications in disease

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

define: single nucleotide polymorphism

A

occurs when a single nucleotide differs in a sequence that ultimately accounts for two alleles.
average of 1 SNP for every 1000 bp

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

the three gene poor chromosomes

A

13, 18, 21; these are the only ones that can have viable life

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

unstable dynamic regions

A

common mutations, so common sites for disease

eg. SMA (Chr 5q13); DiGeorge syndrome (Chr 22q); 12 diseases (1q21)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

the dynamic nature of the human genome

A

~30 new mutations for each new individual, Shuffling of regions at each meiosis due to recombination
Can produce somatic DNA changes as well as germ-line DNA changes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Gene-rich regions/chromosomes

A

chromosome 19

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

the non-random distribution) of GC-rich (38% of genome) and AT-rich (54%) regions is called:

A

clustering. this is the basis for chromosomal banding patterns (cytogenetics, karyotype analysis)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

euchromatic regions

A

more relaxed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

heterochromatic regions

A

more condensed; repeat rich

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

percentage of DNA represented by genes (exons, introns, flanking sequences involved in regulating gene expression)

A

20-25%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

percentage of DNA that is “single copy” sequences

A

50%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

percentage of DNA that are classes of “repetitive DNA”

A

40-50%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

how big is the dystrophin gene (on the X chromosome)

A

big, over 2 million base pairs, less than 1% are coding exons. mutations lead to Duchenne muscular dystrophy

19
Q

very large arrays of tandemly repeating, non coding DNA

A

satellite DNA; the main component of functional centromeres (alpha satellite), and heterochromatin, or even in human-specific heterochromatic regions on the long arms of Chr 1, 9, 16 and Y (hotspots for human-specific evolutionary changes)

20
Q

of SINEs: Short Interspersed repetitive Elements

A

Alu family, - ~300 bp related members

- 500,000 copies in genome

21
Q

of LINES: Long Interspersed repetitive Elements

A

L1 family,- ~6 kb related members

- 100,000 copies in genome

22
Q

Repeats may facilitate aberrant ______________ between different copies of dispersed repeats leading to diseases

A

Non-allelic homologous recombination (NAHR)

23
Q

Retrotransposition may cause __________ inactivation of genes

A

insertional

24
Q

giemsa banding

A

A,T rich regions (gene poor) take up the dark stain,

25
Q

Insertion-deletion polymorphisms (indels):

A

Minisatellites and microsatellites

26
Q

Minisatellites

A

tandemly repeated 10-100 bp blocks of DNA
VNTR (variable number of tandem repeats)
dozens in the genome

27
Q

Microsatellites

A

-di-, tri-, tetra-nucleotide repeats
->5 x 104 per genome
STRPs (Short Tandem Repeat Polymorphisms)
thousands in the genome

28
Q

Copy number variations (CNVs)

A
  • variation in segments of genome from 200 bp – 2 Mb
  • can range from one additional copy to many
  • array comparative genomic hybridization (array CGH)
  • implicated in lots of diseases
29
Q

typical gene

A

intron containing

30
Q

retroposed gene

A

intronless

31
Q

evolutionary hotspots on chr 1, 9, 16, Y

A

a particular pentanucleotide sequence on human specific heterochromatic regions

32
Q

SNPs can easily be detected by

A

PCR

33
Q

describe gene families

A

Gene family is composed of genes with high sequence similarity (e.g. >85-90%) that may carry out similar but distinct functions

34
Q

how do gene families arise?

A

through gene duplication, thus is a major mechanism for evolutionary change

35
Q

beta globin # of exons? mutations can cause?

A

3 exons; hemoglobin deficiencies

36
Q

BRCA1 # of exons? mutations can cause?

A

24 exons; inherited breast and ovarian cancer

37
Q

MYH7 # of exons? mutations can cause

A

40 exons; inhereted cardiomyopathy

38
Q

CNV loci may cover ________% of the genome

A

12

39
Q

examples of CNV implicated in human disease

A

1q21.1; 9p13.3-9q21.12, 5q13.3

40
Q

CNV example of a Link between evolutionarily adaptive copy number increases and increase in human disease

A

1q21.1 macrocephaly, autism

41
Q

1q21.1 deletions

A

microcephaly, schizophrenia

42
Q

genome wide association studies

A

focuses on SNPs

43
Q

nextgen sequencing limitations

A

relies on short read sequences; Complex, highly duplicated regions are typically unexamined. Such regions are implicated in numerous diseases, e.g. 1q21

44
Q

explain “missing heritability”

A

Many large-scale studies implicate loci (e.g. SNPs) that account for only a small fraction of the expected genetic contribution. answer may lie in unexamined DNA