Screening And GWAS Flashcards

1
Q

What is new born screening ?

A

Screening is there to detect potentially fatal or disabling conditions as early as possible before signs or symptoms. It allows treatment to begin immediately reducing or eliminating the effect of the condition preventing devastating outcomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Ins why genomic sequencing should be part of newborn screening

A
  1. Conditions could be screened for simultaneously
  2. Allow for interventions early in life for children with treatable or preventable diseases
  3. As soon as a genetic condition met the criteria for the newborn screening it could be added the next day
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is babyseq?

A
  1. The idea of babyseq is to explore the impact of incorporating genomic sequencing into newborn screening they looked at: 
    - The medical impact on the individual and public health
    - The behavioural impact on positions and patient behaviour
    - The economic impact in terms of cost for the health care system
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Barriers to implementation of poly genic risk scores into clinical use

A
  1. Confrontation of terminology
  2. inadequate description of applications
  3. Failure to define key elements of test strategy
  4. uncertainty as to evaluation approach
  5. lack of consensus on nature, quality and quantity of evidence
  6. concerns about the performance of applications in specific populations.
  7. imprecise descriptions of polygenic scores
  8. lack of evidence for their perceived value
  9. Concerns that they may exacerbate health inequalities
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is necessary for the successful clinical implementation of polygenic score based applications

A

That’s for implementation requires establishing evidence for the performance of each component of the test and how the test performed in a given healthcare pathway 

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the importance of context specific evaluation in the use of polygenic scores

A

Context- specific evaluation is important because it defines the intended purpose,role and population for the test which is essential for developing evidence of clinical validity and utility

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Give a definition for screening

A

Screening is a process used to detect early signs of a particular condition or disease in individuals who may not yet show symptoms. It aims to identify those at risk so that further diagnostic tests or interventions can be offered.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the key principles of screening

A
  1. Validity
  2. Reliability
  3. Acceptability
  4. Feasibility
  5. Early Detection
  6. Natural history
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Describe why validity is a key principle of screening

A

Screening tests should accurately identified individuals with the condition and exclude those without it (High specificity)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Describe why reliability is a key principle of screening

A

Consistent results should be obtained when the same test is repeated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Describe why Acceptability is a key principle of screening

A

Screening should be acceptable to the target population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Describe why feasibility is a key principle of screening

A

The screening program should be practical, cost -effective, and feasible to implement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Describe why early detection is a key principle of screening

A

Screening aims to detect conditions early, when intervention can improve outcomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Describe why natural history is a key principle of screening

A

UNderstanding the natural history of the condition helps determine the optimal timing for screening

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What do you need to consider when putting together population genetic screening programmes

A
  1. Define the purpose of population genetic screening
  2. Explain how it differed from targeted or individualised testing
  3. Discuss the potential impact on public health
  4. What types of screening programs are you doing ? Universal screening, selective screening or cascade screening
  5. What condition are you screening for? Hereditary disorders or carrier screening or pharmacogenomics
  6. Implementation and logistics
  7. Ethical considerations ; address informed consent, privacy, and data sharing
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is stratified screening

A

Screening individuals based on risk assessment results. Based no stratification each group is given a different levels of screening.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Impact of population stratification vs screening the whole population.

A
  1. Fewer people are screened
  2. Reduction in harm associated with screening
  3. Reduced anxiety and inconvenience associated with testing
  4. Fewer false positives and unnecessary biopsies
  5. Less overdiagnosis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Common features of complex diseases

A
  1. Variation in multiple genes influence the trait, and have an additive, multiplicative or interactive effect
  2. Familial clustering with no clear pattern of inheritance
  3. Presence of a high-risk genetic variant doesn’t imply that disease will develop
  4. Often affects homeostasis gradually, resulting in onset later in life
  5. Often common in the general population.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

How to find complex disease gene

A
  1. Linkage vs association
  2. Candidate gene vs Genome -wide
  3. Case control vs Quantitative trait
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What’s the difference between linkage analysis vs association analysis

A

co-occurrence of a genetic variant with a disease trait, more frequently than can be readily explained by chance

Linkage analysis: looks at the cosegregation of alleles within affected FAMILY MEMBERS.
Association analysis: Difference in allele frequency between UNRELATED groups of affected and unaffected individuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

When choosing which test to do to find genetic factors of a common disorder what factors do you have to consider:

A
  1. Phenotype: Clinical definition
  2. Participant: Population based - will you use case-control or cohort
  3. Approach: Genome-wide or candidate gene
  4. Analysis: Lab analysis or Data analysis
  5. Verification: Replication or functional studies or Meta analysis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Define endophenotype also known as intermediate phenotype

A

A phenotype that you can quantify in disease cohort and control cohort eg for diabetes the enodo phenotype would be insulin levels
Or

It’s a measurable biological trait that reliably shows how a specific biological system works, is somewhat inherited, and is more directly linked to the underlying cause of a disease than just the general symptoms.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

You need a large enough group to have statistical power to prove your hypothesis. What influences statistical power.

A
  1. Allele frequency
  2. Effect size
  3. Number of markers genotyped
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

How does allele frequency affect statistical power of a GWAS

A

-Higher Allele Frequency: Increases statistical power. When the allele of interest is more common in the population, it’s easier to detect an association with the trait or disease.
-Lower Allele Frequency: Decreases statistical power. Rare alleles are harder to detect, as fewer individuals carry them, making it more difficult to observe a significant association.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

How does effect size affect the statistical power of GWAS

A

-Larger Effect Size: Increases statistical power. If the allele has a strong impact on the trait or disease, it’s easier to detect, even with a smaller sample size.
-Smaller Effect Size: Decreases statistical power. Alleles with subtle effects require larger sample sizes to detect significant associations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

How does number of markers genotyped affect statistical power

A

-More Markers: Generally decreases statistical power for individual markers due to the multiple testing burden. When more genetic markers are tested, the threshold for significance becomes stricter (to account for multiple comparisons), making it harder to detect associations unless the sample size is large enough to compensate.
-Fewer Markers: Increases statistical power per marker but may miss important associations not covered by the limited set of markers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What is population stratification what affect does it have

A

Population stratification in GWAS refers to differences in allele frequencies between subgroups within a population that arise from ancestry rather than an association with the trait or disease being studied. These differences can lead to false-positive associations, where a genetic variant appears to be linked to the trait simply because it is more common in one subgroup than another, rather than because it is truly related to the trait. This confounding effect can distort the results of a GWAS,

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What do you use to identify population stratification

A

A PCA plot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Define confounding factors

A

A confounding factor (or confounder) is a variable that influences both the independent variable (the factor being studied) and the dependent variable (the outcome of interest), creating a false association or masking a real one between them.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Why does failure to adjust for multiple testing appropriately lead to excessive false positives or overlook true positive signals.

A
  1. Excessive False Positives: When multiple tests are performed simultaneously (e.g., testing thousands of genetic variants for association with a trait), the likelihood of finding a statistically significant result by chance alone increases. If no adjustment is made, many of these “significant” results may actually be false positives, meaning they appear significant due to random variation rather than a true association.
  2. Overlooking True Positives: On the other hand, overly strict adjustments for multiple testing (e.g., using very conservative methods) can make it more difficult to detect real associations. The stringent criteria may reduce the power of the study, causing true positive signals to be missed because they don’t meet the adjusted significance threshold.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Most frequent phenotypes observed in infants

A
  1. Congenital and multiple congenital abnormalities
  2. Developmental delay
  3. Dysmorphism
32
Q

Between WGS, WES & Gene panel which has ben shown to have a better yield (%) and changes in managment

A

WES has the greatest yield in infant diagnosis and leads to the most changes in management.

33
Q

Why do a panel for infant diagnosis instead of a WGS if the yield is higher for WGS

A

The turn around time is shorter 100h vs 149hrs

34
Q

Why is WGS more cost effective than the current standard of care methods

A

WGS is more comprehensive which means there is a higher probability of diagnosis and higher diagnostic yield which is more cost effective

35
Q

List the pilot genome screening projects out there

A
  1. BeginNGS (USA)- WGS
  2. Generation study (England)- 490 genes associated with >200 conditions
  3. BabyScreen+ (Australia)- >500 treatable childhood -onset conditions
36
Q

Screening vs diagnostic tests

A
  1. A proper screening program is that it is applied to a whole population
  2. Screening is a top-down approach offered to a large cohort
  3. Most screening tests do not results in a definite diagnosis with a higher rate of false positives and false negative outcomes vs diagnostic tests
  4. Screening tests need to be cheap and simple
  5. Diagnostic testing is aimed at an individual with a particular problem
37
Q

Points to consider when asking when might screening be done?

A
  1. Can the subjects give properly informed consent?
  2. How easy is it to access the group to be screened?
  3. How relevant will the information be at that time to the person screened ?
  4. What are the practical and ethical implications of a positives results?
  5. What will the program cost, and do the benefits justify the cost?
38
Q

Characterise population attributable risk

A

The proportion of the total disease risk in the population that is attributable to the factor in question.
Population attributable risk = proportion of the population that gave a variant (p) x the additional risk that variant gives (R) divided by overall risk in the population.

39
Q

Common types of screening in healthcare

A
  1. Newborn screening
  2. Cancer screening
  3. Cardiovascular risk screening
  4. Genetic screening
  5. Eye screening checks in diabetics
40
Q

Potential applications of polygenic risk score in cancer

A
  1. Risk prediction
    - Tumour subtypes
    -Contralateral breast cancer
    -Familial or hereditary cancers
  2. Screening
    -risk stratified screening
    -improving performance of current screening tests.
  3. Guiding medical intervention
    -information risk prediction of subtypes and selection of therapeutic interventions
    -Health promotion
41
Q

What needs to be considered about the implications of statisfied screening.

A

The size of the programme needs to be considered, for example under normal breast screening 7,4 million women are screened with stratified (Age) screening only 5.7 million women are screened. This means that there is a reduction in the number of cases screened which means there may be an impact on the number of potentially detectable cases.

42
Q

Other the sample size what else has to be considered for stratified screening, what else is there.

A
  1. Which screening strategy i.e age or family history … or
  2. Which percentile or threshold should be used (example if its 70th percentile it means 70% of the population is within the risk category, So this means that if we stratify for age and end with 100 people and then only look at the 70th percentile we end up with 70 people of that 100)
43
Q

Key questions when we ask if it is clinically useful to incorporate polygenic core information into healthcare for common disease.

A

1.how good are existing models to calculate a polygenic score
2. Does incorporation of this bio marker into risk prediction tools did value?
3. Does using this information lead to improvements in current prevention programme/pathway.
4. If this is better what are the organisational, ethical and social implications

44
Q

Disagreements and challenges of using polygenic scores for screening

A

-Prediction accuracy: Disagreements on how well PGS predicts complex traits like disease risk.
-Research validity: Concerns about whether PGS research applies broadly, especially across different populations.
-Clinical usefulness: Uncertainty over whether PGS improves medical decision-making or patient care.
-Effective interventions: Questionable whether high-risk PGS results lead to actionable prevention or treatment.
-Evidence required: Lack of agreement on how much evidence is needed to prove PGS is reliable for clinical use.
-Purpose of PGS: Differing views on whether PGS should be used for broad screening or only in specific cases.

45
Q

Why is transferability so poor for polygenic risk score

A
  1. True heterogeneity in allelic effect frequency
  2. Population specific variants
  3. Apparent heterogeneity in effect size
  4. Admixture effects
  5. Residual population stratification
    6 Impact of rare variants missing from the PRS
  6. Effect on gene-gene and gene environment interactions
46
Q

Challenges associated with using genomic sequence data

A
  1. Interpretation is difficult and requires access to multiple data types
  2. Multidisciplinary expertise is often required
  3. Data storage is expensive
  4. Precarious funding reduced funds and increasing demand.
  5. Accurate diagnosis often relies on finding an unrelated genotype /phenotype match
    6.knowledge is constantly evolving existing resources may be unreliable
  6. Incidental findings of uncertain clinical relevance
  7. Lack of designated infrastructure
47
Q

Accurate and early identification of disorders in newborns for diagnosing is difficult why?

A
  1. Atypical disease presentation
  2. Non-specific and overlapping phenotype (Facial features are easier to distinguish in adult faces not so much in newborn faces)
  3. Common phenotypic markers immeasurable in neonatal and newborn period
  4. Rapid phenotype and disease progression
  5. Obscured by comorbidities (sepsis, premature, birth trauma )
48
Q

Clinical utility (measured by changes in clinical management and turn-around times) of an early diagnosis.

A
  1. Accurate prognoses and risk assessment
  2. Medication changes
  3. Initiation of palliative care
  4. Additional screening and monitoring
  5. Specialist consuls
  6. Supportive procedures and surgery
  7. Genetic counselling services
  8. Access to clinical trials for emerging therapies
49
Q

Personal utility (subjective outcomes assessed across different domains) of an early diagnosis

A
  1. Affected utility -emotional state, acceptance and coping mechanisms, mental preparation, relief and responsibility
  2. Cognitive - information itself, knowledge condition and health status
  3. Behavioural - tactical utilisation, future planning, reproductive planning
  4. Social- social support, access to services
50
Q

Characterise phenotype-first approach

A
  1. Guided by differential clinical diagnosis from a medical geneticist.
  2. This is a targeted testing and analysis which means decreased cost of testing /analysis but increased time to a result
  3. It’s reliant on strong phenotype-genotype relationships,
  4. limitations: genetic heterogeneity, limited phenotypic heterogeneity, requires doctors to have expertise about mild and non-classic phenotypes that most don’t have
51
Q

Characterise genotype-first approach

A
  1. Examine genomic variation regardless of phenotype
  2. Analysis more comprehensive
  3. Useful for clinical heterogeneity
    -atypical presentation
    -varying severity
    -conditions not fully penetrant
    Facilitates new genotype-phenotype discovery
    -new associations
    -expand current phenotypic associations
    -better understanding disease mechanism
52
Q

Pros and cons Panel vs WGS

A
  1. Panels have a lower diagnostic yield that genome sequencing (research of 400 iII infants showed a yield of 27% for panel and 49%)
  2. Panels however have a lower turn around time
53
Q

WES for infants, characterise

A
  1. Around 50% yield
  2. Only around 30% have been diagnosed by standard of care
  3. Around 20% of the entire cohort changed medical management
54
Q

Pros and cons WES vs WGS

A
  1. They have similar diagnostic yields and turn around times
  2. WGS has better analytical performance, better nucleotide coverage which means 85x more rare variants called
55
Q

WGS vs standard of care

A

Standard of care testing has a higher cost and longer hospitalisation and lower diagnostic yield

56
Q

Characterise Rapid testing

A
  1. Faster results = earlier intervention (Fastest is 13.5 hours, decrease TAT by 35% )
  2. Involves automatic data transfer from electronic health records once test ordered, (Electronic health records are scanned for natural language/ terms that have known disease association ).
  3. Reduced library prep time, Faster sequencing , Pipeline used results in faster alignment and variant calling.
    4.
57
Q

New Born screening thats done in South Africa (not that important just read)

A
  1. Biotinidase deficiency
  2. Isovanic acidemia
  3. Propanic acidemia
  4. Galactosemia
  5. Maple syrup urine disease
  6. Glutamic acidemia type 1
  7. Vitamins b responsive methylmalonic acidemia
58
Q

Controversy of newborn screening

A
  1. Interpretation genomic data constantly evolving
  2. Reporting on conditions later onset, psychological impact
  3. Associated costs - testing and counselling
    4.access to therapies - most conditions are rare thus treatment expensive and inaccessible
  4. Data handling and storage
59
Q

What are beta values

A

SNP effect size estimates used to calculate PRS

60
Q

DEFINE PRS

A

A score indicating an individuals genetic predisposition to a given phenotype based on a cumulative effect of many SNPs across the genome.

61
Q

How do you calculate PRS

A

It’s the sum of risk allele an individual carries,weighted by the effect size of the allele.

62
Q

Why estimate genetic risk

A

Personalised medicine
- Tailor prevention and treatment strategies based on genetic profile
- predict a persons prognosis or an individuals response to treatment

Research
-estimate genetic overlap between outcomes
-control for genetic effects
-investigate genetic architecture

63
Q

Benefits of polygenic risk scores

A
  1. Easy to calculate
  2. Easy to store
  3. Stable across lifetime trajectory, inform a baseline risk
  4. Largely independent of traditional risk factors
  5. A single genotyping test to develop multiple PRS for different disease
64
Q

Explain clumping

A

Clumping creates a list of approximately independent SNPs, identified in a manner that preferentially selects the SNPs most associated with the outcomes

65
Q

Explain thresholding-

A

these independent SNPs are tested against a threshold to determine if they are included in the polygenic risk score or not

66
Q

How does linkage disequilibrium affect the clumping algorithm ?

A
  1. We must account for linkage disequilibrium when polygenic scoring to avoid doubling counting highly correlated variants
  2. LD-based clumping removes variants highly correlated with the lead variant (SNP with lowest p value) within each locus
67
Q

Limitation of the clumping method what is the way to fix this

A

Clumping may also remove non-overlapping information.
Joint Effect size- Rather than remove SNPs, estimate the joint effect of all the overlapping SNPs
Shrinkage - Keep all SNPs but effect size is shrunk toward 0 to make them more realistic and generalisable to all populations

68
Q

The predictive utility of polygenic risk score depended on what ?

A
  1. Max variance explained by PRS depends on phenotypic heterogeneity ( example trait has high heritability, genetics plays a bigger role prs might be more useful and visa versa)
  2. Power/ GWAS sample size: polygenic scores explain more variance as GWAS sample size increase and genetic effects are estimated more accurately
69
Q

What is the biological target of NIPT

A

Cell free fetal DNA which is short segments of DNA in maternal plasma primarily placental origin

70
Q

detection rate and false positive rate of NIPT vs other screening techniques

A
  1. Detection rate of NIPT is Around a 60% higher than maternal age.
  2. Detection rate of NIPT is Around a 30% higher than biochemical screening
  3. Detection rate of NIPT is Around a 10% higher than nuchal translucency.
  4. False positive rate of other screening is 5% and NIPT is 0.1%
71
Q

Biological challenges of cell free DNA NIPT

A
  1. False positives:
    -unrecognised or vanishing twin (more DNA)
    -placental mosaicism
    - low level maternal mosaicism
    - maternal genetic variation
    -maternal malignancy
  2. False negatives:
    -low level of fetal DNA
    -placental mosaicism
    -maternal genetic variation
  3. Failed results
    -increased BMI
    -low level of fetal DNA
    -fetal aneuploidy
72
Q

After NIPT gives Pos result, explain the diagnostic test that has to happen.

A
  1. Extract baby DNA - Amniocentesis (amniotic fluid taken 16 and 22 weeks ), chronic villus sampling (placenta taken 11 and 14 weeks)
  2. Perform QF-PCR or karyotype on the sample
73
Q

Should we be implementing NIPT in South Africa

A

Implementing NIPT in South Africa could improve prenatal care by offering a safe, accurate, and non-invasive option for detecting genetic conditions, but factors like cost, accessibility, and healthcare infrastructure must be addressed for it to be practical and equitable.

74
Q

Other possible uses of NIPT

A
  1. Other aneuploidies
  2. Gender
  3. Specific microdeletions e.g 22q11.2del
  4. Larger gains or loss of chromosomal material across the genome
  5. Single gene disorder
75
Q

What are the cons of prenatal Chromosomal microarray compared to karyotype or QF-PCR

A
  1. Will not detect all genetic abnormalities
  2. Cannot detect balanced chromosome rearrangements such as reciprocal translocations and inversions (karyotype can)
  3. Cannnot detect low level mosaicism
  4. Anomalies may occur which cannot always be recognised or properly interpreted
  5. Cannot detect mutations within a single gene
76
Q

When to Offer carrier screening

A
  1. Ethnic specific disorders
  2. For disorders cystic fibrosis and spinal muscular atrophy
  3. Women with family history
  4. Couples with consanguinity