GWAS in prokaryotes Flashcards

1
Q

GWAS

- define

A

association analysis performed w/ panel of polymorphic markers adequately spaced to capture most of linkage disequilibrium info in entire genome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Study designs

A

Family based
OR
Case control

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Linkage disequilibrium

A

Genes close together in genome
- closer they are = higher the linkage
= more likely to be inherited together

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Study design

  • cases
  • controls
A

> those with the phenotype of interest e.g. disease
presumed to have high prevalence of susceptibility alleles

> those w/out phenotype
presumed to have lower prevalence of such susceptibility alleles

case + controls ideally similar in majority of other factors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Misclassification bias

A

study participant categorised into incorrect category

-> alters the observed association or research outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what can identification of susceptibility variants lead to?

A

*novel biological insights
-> clinical advances
-> therapeutic targets
OR
biomarkers
OR prevention

*improved measure of individual aetiological processes
-> personalised medicine
–> diagnostics
OR
prognostics
OR
therapeutic optimisation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Genotyping

- what is it?

A

Looking for a nt variation associated w/ a given phenotype

e.g. GC change associated with a disease
AT means you don’t have disease

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Genotyping

- process

A
  1. Extract, amplify + fragment DNA
  2. Either:
    Microarray
    OR
    Sequencer
  3. Genotype calling
  4. SNP genotype
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Genotype calling

A

Determining genotype for each individual

- typically only done for positions in which a SNP or a ‘variant’ has already been called (=estimated)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Significance of hits

- Contigency tables (Fisher’s Exact Test)

A

Gives a p values for the significance of the SNP being associated w/ disease

Sum all probabilities for observed + all more extreme values with same marginal totals to compute probability of null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Does the affected or control group exhibit Population Stratification?

  • what is this?
  • what can it cause?
  • how is this controlled?
A

When subpopulations exhibit allelic variation because of ancestry

Can cause false +ves if there are SNP differences in the case + control population structures

Control for this by testing control SNPs for general elevation in X^2 distribution between cases + controls

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Associated haploblocks

A

Linkage disequilibrium organises genome into haplotype blocks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Haplotype block

A

region of genome where there’s little evidence of a history of genetic recombination

contain only a small number of distinct haplotypes (group of alleles inherited together from 1 parent)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Bottom-up approach

A

Starts w/ DNA sequence

-> tests effect on phenotype

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Top-down approach

A

Starts w/ phenotype + associates it w/ particular genomic elements

(by 2010 had large bacterial collections)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Campylobacter

- causes

A

most common cause of bacterial food poisoning in developing countries

meat gets contaminated

  • > people eat meat
  • > get ill
17
Q

Campylobacter

- who carries it?

A

humans don’t
- get ill and then bacteria goes away

chickens, sheep, cows + birds carry the bacteria

18
Q

Old signals vs New signals of adaptation

A

Old
= host-associated clonal complexes

New
= host-associated mobile elements

19
Q

GWAS study method

A
  1. sample lots of isolates from cows + chickens
  2. sequence genomes
  3. divide into 30bp fragments
    ( have 1 for every position in genome)
  4. sort by cow or chicken origin by:
    looking for fragments overrepresented in 1 org
20
Q

Difference in overrepresentation of a gene fragment in 1 species

  • compare to…?
  • shows…?
A

compare observed vs expected by descent

which bit of DNA might be adaptive

21
Q

ST-45 complex

- how many host-associated words?

A

9034

22
Q

Vitamin B5 synthesis gene

A

Present in cow campylobacter

NOT in chicken campylobacter

B5 found in grain not grass
Cows eat grass
-> need to synthesise own B5

23
Q

Campylobacter survival through the food chain

A

ST-45 most prevalent in chicken carcass or meat

ST-21 increasing prevalence from farm -> meat -> clinical

ST-45 meat + ST-21 clinical have same prevalence of 8%

24
Q

Campylobacter survival through the food chain

- deletion mutants have altered functions

A

nuoK
(associated in ST-21)
- involved in NADH activity
+ switching from anaerobic to O2 enviro

mutants grow better in enhanced O2

25
Q

Staphylococcus epidermidis

A

Coagulase-negative staphylococci
= commensal on human skin

major cause of nosocomial infection associated with surgery

26
Q

Evolutionary models of infection

A

Pathogenic clones
- only pathogenic sub-population can cause disease

True opportunistic pathogenicity
- all strains can cause disease if they get into blood
(same virulence elements in both strains)

Divided genome
- any clone can cause disease (pathogenicity elements can move between strains)

27
Q

Evolutionary models of infection

- associated variation in genomic data

A

Pathogenic clones
- all clones related

True opportunistic pathogenicity
+ divided genome
- identical evolutionary tree

28
Q

GWAS method 1 = matching isolate pairs

- findings

A

61 genes
containing infection-associated genetic elements that correlate with in vitro variation in known pathogenicity traits
e.g. biofilm formation

29
Q

pan genome

A

set of all strains of a species

30
Q

GWAS method 2 = correlating with in vitro phenotype

- what is it?

A

Infection associated SNP prevalence correlated w/ pathogenicity phenotypes