Lecture 3: Linkage Analysis Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

Why do we care about variations?

A
  1. Allow tracking ancestral human history
  2. Underlie phenotypic differences
  3. Cause inherited diseases
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

If you are interested in STUDYING A HUMAN DISEASE, HOW do
you FIND out WHICH GENE, when MUTATED, CAUSES THAT DISEASE?

A

You can find that POSITION by GENETIC MAPPING.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the EVIDENCE that a DISEASE OR TRAIT is GENETIC?

A
  1. TWIN STUDIES
  2. FAMILY SEGREGATION
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the EVIDENCE that a DISEASE OR TRAIT is GENETIC?

twin studies…

MS EXAMPLE

A

TWIN STUDIES:
COMPARE MONOZYGOTIC AND DIZYGOTIC TWINS
—- COMPARE THE CONCORDANCE RATES OF MZ AND DZ TWINS.

  1. Monozygotic Twins : Genetically identical
  2. Dizygotic Twins : Like siblings (1/2 GENOME shared)

***MS Example:
- Monozygotic Twins : Concordance rate 25-30%

  • Dizygotic Twins : Concordance rate 2-5%
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the EVIDENCE that a DISEASE OR TRAIT is GENETIC?

family segregation …

MS EXAMPLE

A

Family segregation: INCREASED RISK for DISEASE AMONG FAMILY MEMBERS OF AN AFFECTED INDIVIDUAL.

HOW?
COMPARE FREQUENCY OF DISEASE AMONG FIRST DEGREE RELATIVES OF AFFECTED INDIVIDUALS WITH THE FREQUENCY OF THE DISEASE IN THE GENERAL POPULATION.

MS Example:
- Risk to 1st-degree relatives (Parents, siblings, children): 2-5%
- General Population: 0.1-0.2%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the goal of a genetic study?

A

IDENTIFY GENETIC RISK FACTORS (genetic locations) causing human diseases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Types of Genetic (INHERTIABLE) Disease:

A
  1. SINGLE GENE (MENDELIAN) DISORDERS
  2. COMPLEX DISEASES/COMPLEX GENETIC DISORDERS
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Explain single gene mendelian disorders: 3

A
  1. rare
  2. gene segregation within pedigrees (families)
  3. obvious they are genetic
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

explain complex disease/complex genetic disorders: 4

A
  1. may be NO RECOGNISABLE PATTERN OF INHERITANCE
  2. likely due to ACTION OF MULTIPLE GENES
  3. GENES may be INTERACTING WITH EACH OTHER TO RESULT IN DISEASE PHENOTYPES
  4. ENVIRONMENTAL FACTORS/GENE-ENVIRONMENTAL INTERACTIONS may affect disease
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a complex disease?

A

complex disease: phenotypes, symptoms, endotypes
- disease features and nomenclature may change overtime

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are some factors which induce or affect complex diseases: 5

A
  1. Environmental drivers
  2. Epigenetics
  3. Social ‘determinants of health’
  4. Genetics
  5. other HOST factors
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Explain/list Environmental Drivers: 6

A
  1. chemicals
  2. smoking
  3. sun exposure
  4. infection/infectious diseases
  5. diet
  6. alcohol
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

List/explain Genetics: 4

A
  1. molecular mechanisms of disease
  2. susceptibility (risk) or modifier gene/variants
  3. genomic and somatic variants
  4. SNPs, CNV, INdels, mode of inheritance, monogenic or polygenic
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Host factors : 9

A
  1. time
  2. age
  3. microbiome
  4. obesity
  5. nutrition
  6. inflammation
  7. immune system
  8. stage of development at exposure
  9. immune response
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Epigenetics: 6

A
  1. methylation
  2. miRNA
  3. retroviruses
  4. regulatory events
  5. gene rearrangements
  6. gene expression
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

social determinants of health: 5

A
  1. social setting
  2. communication barriers
  3. access to healthcare
  4. cultural and or economic status
  5. adverse childhood experiences
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Explain the MONOGENIC DISORDER PATHWAY (SLIDE 10): 4

A
  1. Gene; MUTATION present
  2. Inheritance pattern (in family members) – DOMINANT OR RECESSIVE
  3. IMPACT of MUTATIONS in a SINGLE GENE on a DISEASE PHENOTYPE
    ( genetic risk in population… in percentage)
  4. Genetic risk in different families (in percent)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Explain the COMPLEX DISORDER PATHWAY (SLIDE 10): 4

A
  1. GENE VARIATIONS
    eg. gene A, gene B, gene C, gene D.
  2. Inheritance pattern is COMPLEX
  3. IMPACT OF VARIATIONS IN DIFFERENT GENES ON DISEASE PHENOTYPE
    - Genetic risk in population (epidemiological evidence..in percent)
  4. Affecting families (in percent).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What are the 3 GENE IDENTIFICATION APPROACHES

A
  1. POSITIONAL CLONING
  2. CANDIDATE GENE APPROACH
  3. WHOLE GENOME SCREENING
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

GENE IDENTIFCATION APPROACH: POSITIONAL CLONING

WHAT?
EXAMPLE?
HOW?

A
  1. POSITIONAL CLONING (REVERSE GENETICS)
  2. DISEASE —> (not) FUNCTION <— GENE <— MAP

EXAMPLE: Cystic Fibrosis (CFTR)

HOW: identification of a gene BASED SOLELY on its POSITION IN THE GENOME

MOST WIDESPREAD STRATEGY IN HUMAN GENETICS IN THE LAST 20-35 YEARS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

STRENGTHS AND WEAKNESSES OF POSITIONAL CLONING: 4

A

STRENGTH:
1. No knowledge of gene product required

  1. very strong track record in single-gene disorders

WEAKNESSES
3. Understanding of FUNCTION not a CERTAIN OUTCOME

  1. POOR TRACK RECORD WITH MULTIFACTORIAL TRAITS
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Gene Identification Approaches

WHAT IS “CANDIDATE GENE APPROACH”: 3

A
  1. CANDIDATE GENES are genes LOCATED in a CHROMOSOME REGION SUSPECTED OF BEING INVOLVED IN THE EXPRESSION OF DISEASE TRAITS.

2.Limit to the known biological functions of a particular disease.

  1. Can be IDENTIFIED BY ASSOCIATION AAND LINKAGE WITH PHENOTYPES
23
Q

Gene Identification Approaches

WHAT IS “WHOLE GENOME SCREEN APPROACH”: 4

LINKAGE VS ASSOCIATION

A
  1. SCAN the WHOLE GENOME WITHOUT ANY PRIOR INFORMATION
  2. CAN DISCOVER POTENTIAL GENES PLAYING ROLES IN DISEASES
  3. LINKAGE: 6 or 10k SNPs enough
  4. ASSOCIATION: 500K OR 1 MILLION SNPSs are recently available
24
Q

SUMMARY OF GENE IDENTIFICATION APPROACHES

FOR “POSITIONAL CLONING”
- STARTING POINT
- KEY METHOD
- USE CASE

A

1.STARTING POINT:
genetic marker/location

  1. KEY METHOD:
    linkage analysis, chromosomal mapping
  2. USE CASE:
    identifying genes linked to specific chromosomal regions
25
Q

SUMMARY OF GENE IDENTIFICATION APPROACHES

FOR “CANDIDATE GENE APPROACH”
- STARTING POINT
- KEY METHOD
- USE CASE

A
  1. STARTING POINT
    suspected gene based on function
  2. KEY METHOD
    genetic association studies
  3. USE CASE
    investigation specific genes based on prior knowledge
26
Q

SUMMARY OF GENE IDENTIFICATION APPROACHES

FOR “WHOLE GENOME SCREENING”
- STARTING POINT
- KEY METHOD
- USE CASE

A
  1. STARTING POINT
    Entire genome
  2. KEY METHOD
    GWAS, whole genome sequencing
  3. USE CASE
    unbiased discovery of genetic factors for complex traits
27
Q

Gene Mapping technique: 3

A

Disease genotype <—> Disease phenotype = BIOLOGY

Disease genotype <—> Marker genotype = PROXIMITY

Marker Genotype <—> Marker Genotype = GENE MAPPING; LINKAGE/ASSOCIATION ANALYSIS

28
Q

GENE MAPPING TECHNIQUE PRINCIPLE:

A

“People who have similar phenotypic values (ie DISEASE) should have
higher chance of SHARING OF GENETIC MATERIAL near the genes that influence those traits.”

29
Q

How to identify genes contributing to disease? = 2

A
  1. LINKAGE MAPPING
  2. LINKAGE DISEQUILIBRIUM (ASSOCIATION STUDY)
30
Q

Explain LINKAGE MAPPING: 5

A
  1. MEASURES THE SEGREGATION OF ALLELES AND A PHENOTYPE WITHIN A FAMILY
  2. USE CROSSOVER OCCURRING DURING MEIOSIS II

—> 3. Genes that are PHYSICALLY CLOSE TOGETHER ARE MORE LIKELY TO BE CO-INHERITED

—> 4. * Genes that are PHYSICALLY FAR APART ARE LESS LIKELY TO BE CO-INHERITED

  1. DETECT OVER BOARD CHROMOSOMAL REGIONS ON THE GENOME
31
Q

EXPLAIN Linkage disequilibrium (Association Study): 2

A
  1. EVALUATE the EVIDENCE of a DIRECT CORRELATION BETWEEN A MARKER ALLELE and a DISEASE RISK ALLELE
  2. SHARING of GENETIC MATERIAL: ACTUAL SHARING OF THE SAME ALLELE (LINKAGE DISEQUILIBRIUM = LD)
32
Q

What is a GENETIC MARKER?

TYPES?

A

A genetic marker is a POLYMORPHIC DNA SEQUENCE with a KNOWN LOCATION ON A
CHROMOSOME THAT CAN BE USED TO IDENTIFY INDIVIDUALS that can be used to identify individuals

TYPES:
1. SNP
2. SSR (Short sequence repeat/micro satellite)

33
Q

SNP:
No of Loci:
Advantage:
Disadvantage:

A
  1. NO. OF LOCI:
    High, occur ~1 in every 100-300 bp
  2. ADVANTAGE:
    …1. Abundant in the genome
    …2. Provide high-resolution
    mapping
    …3. Low mutation rate (stable)
    …4. Many technologies available
  3. DISADVANTAGE
    Often Bi-allelic (two
    possible alleles) which
    LIMITS INFORMATION FROM ONE LOCATION
34
Q

SSR (short sequence repeat/micro satellite):

No of Loci:
Advantage:
Disadvantage

A
  1. NO. OF LOCI:
    High, occurs ~ every 2-30kb
  2. ADVANTAGE:
    Multi-allelic: each SSR can
    have multiple alleles
  3. DISADVANTAGE
    …1. Lower abundance
    …2. more labour-intensive
    …3. Higher mutation rate
35
Q

Meiosis and Recombination: 3

A
  1. During MEIOSIS, the CHROMOSOMES
    DUPLICATE, then CROSS OVER
    (‘rRECOMBINE) to PRODUCE A HAPLOID GAMETE (sperm/ egg)
  2. The gamete DERIVES GENETIC VARIANTS FROM BOTH PARENTS
  3. MEIOSIS IS THE “BASIS” FOR “HEREDITY”
36
Q

BIVALENTS IN PROPHASE OF MEIOSIS I:

A

SINGLE CROSSOVER —> BIVALENTS IN PROPHASE OF MEIOSIS I: “A1 and A2 CROSSOVER” —> CHROMATIDS IN GAMETE: NRRN

DOUBLE CROSSOVER —>BIVALENTS IN PROPHASE OF MEIOSIS I: “A1 AND A2 CROSS OVER, B1 AND B2 CROSSOVER” —> CHROMATIDS IN GAMETE:
1. 2 STRAND: NNNN
2. 3 STRAND: RNRN
3. FOUR STRAND: RRRR

NEED TO LOOK AT SLIDE 18

37
Q

Understanding Markers and Inheritance = 3

A
  1. Polymorphic loci whose locations are known
  2. Most often SNPs or micro-satellites
  3. Inherited within the Chromosomes
38
Q

For linkage analysis, we need informative meiosis

A

NEED NOT A WANT

UNDERSTAND SLIDE 20 PEDIGREE

39
Q

Markers and Inheritance

A

NEED NOT A WANT
UNDERSTAND SLIDE 21 PEDIGREE 1 And 2

40
Q

What is LINKAGE? = 4
“HOW DO WE MEASURE IT?”

A
  1. Only ~1 recombination per chromosome/meiosis
    “→ Loci that are close together on the same chromosome tend to be
    inherited together (‘linked’ or ‘in LD’ = linkage disequilibrium)”
  2. The closer the loci, the more linkage
    “→ Degree of linkage is a measure of genetic distance”
    • Linkage is measured by the recombination fraction, θ = proportion of
      recombinants
      “θ = 0 complete linkage
      θ = 0.5: no linkage”
  3. Two loci on the same chromosome, only a cross-over event will separate them.
41
Q

What is recombinant fraction? = 5

A
  1. θ is a measure of genetic distance
  2. Further apart two loci, the more likely a crossover event will occur (θ value will increase)
  3. Centimorgan (cM) is a genetic distance unit
  4. 1 cM = 1% chance of recombination
  5. 1 cM approx. 1000 kb = 1 m
42
Q

Obtaining enough family material to test multiple
meiosis is difficult for rare diseases

A
  1. Higher RF (RECOMBINATION FRACTION) between two loci, MORE MEIOSIS needed to OBTAIN EVIDENCE that THEY ARE LINKED.
  2. Scoring recombinants in human pedigrees not always simple
43
Q

Linkage mapping: is a marker “linked” to the
disease gene

A
  1. Collect families with affected individuals
  2. Genome Scan - Test markers evenly spaced across the entire genome
    (~every 10cM, ~400 markers)
  3. Lod score (“log of the odds”)
44
Q

WHAT IS LOD SCORE?

A

what are the odds of observing the family
marker data if the marker is linked to the disease (less recombination than
expected) compared to if the marker is not linked to the disease

45
Q

Test to estimate whether
the likelihood that TWO LOCI ARE LINKED is greater than
likelihood that THE TWO LOCI BEING UNLINKED

A

Test to estimate whether
the likelihood that TWO LOCI ARE LINKED is greater than
likelihood that THE TWO LOCI BEING UNLINKED

Z = LOG 10

46
Q

Z = LOG10 FORMULA

A

Z = LOG10 * (LIKELIHOOD OF LINKAGE ( THETA <0.5)/ LIKLIHOOD LOCI ARE UNLINKED ( THETA = 0.5))

47
Q

linkage mapping

A
  1. parametric method
  2. non-parametric method
48
Q

parametric method

A

Estimate recombination fraction between a marker locus and an unobserved trait locus

Out of 4 informative meioses,
2 are recombinants => 1/2

slide 26

49
Q

NON PARAMETRIC METHOD

A

Count the number of alleles
two affected sibs share
identical by descent (IBD).

“If the marker is linked to the disease locus, the affected sibs will tend to share the disease allele more often than they would at a marker unlinked to the disease locus.”

50
Q

Statistical significance of Lod Scores:

A

Z>3.0
EVIDENCE OF LINKAGE
(with 5% chance of error)

2.0<Z<3.0
SUGGESTIVE EVIDENCE OF LINKAGE

-2.0<Z<2.0
UNINFORMATIVE LINKAGE ANALYSIS

<-2.0 –> EXCLUSION OF LINKAGE

51
Q

Lod Scores: 4

A
  1. Human families only produce small numbers of children.
  2. To get statistically significant evidence for linkage, combine evidence
    from many families
  3. A complex mathematical procedure, implemented by computer software, is
    used to generate “Lod scores”.
  4. Lod score is a statistic that describes the STRENGTH OF EVIDENCE for linkage,
    at any chosen value of the RF, given the family data available.
52
Q

The task of linkage analysis is to:

A

find markers that are linked to the hypothetical disease locus

53
Q

The task of linkage analysis is to find markers that are linked to the hypothetical
disease locus HOW? = 7

A
  1. Determine the approximate location of disease predisposing genes
  2. Linkage is caused by loci (e.g. the risk gene and a genetic marker) being close to each other on a chromosome.
  3. The recombination fraction is the probability that, in any meiosis,
    there will be a recombination between them between two markers.

…….4. θ = 0.5 Mendelian segregation – no linkage, = recombination.

…….5. θ = 0.0 very tight linkage – no recombination.

  1. Great success in identifying genes for simple Mendelian diseases
  2. Few successes in identifying genes contributing to complex diseases