Genomic Analysis/GWAS Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

___% of the genome is genic (exons and introns)

A

40%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

___% of the genome encodes for proteins

A

1%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

___% of an individuals genome consists of genetic diversity

A

1%
99% shared

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

______ is one of two or more alternative forms of a DNA sequence

A

allele

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

_______ is a rare change to the DNA sequence that severely disrupts function and typically leads to disease
Occurs in «1% of the population

A

mutation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

_______ – a common change to the DNA sequence in which there are two or more different alleles. The minor allele occurs in >1% of the population

A

polymorphism

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Polymorphisms occur roughly every ____ to _________bp (~10-15 million polymorphisms in the genome)

A

300 to 1000

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

________ is a single nucleotide variant (SNV) which occurs >1% in the population

A

SNP

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

_____ and ________ are the two major types of common genetic polymorphisms

A

SNPs
indels (< 50 bp)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

___________ are repeats of short sequences of DNA and are a type of InDel

A

Short tandem repeats (STR)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

________ _____________ Includes deletions, duplications, insertions, inversions, and translocations

A

structural variation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Copy number variations (CNVs) are a subset of structural variations that lead to a change in copy number (loss or gain) of a DNA fragment >______ bps (typically >___kb)

A

> 50 bps
typically > 1000 kb

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

_______________ affect entire chromosomes (e.g., Trisomy 21)

A

aneuploidies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

SNPs occur every _____ to _________ bp

A

100 to 300

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

There are ______ SNPs in the human genome

A

10 million

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

____________ = a unique and stable identifier for every SNP. It links the alleles/genotypes to a chromosomal location (which is updated with each genome build)

A

rsID

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

_____% of the genome is noncoding

A

80%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

___________ disease = Mendelian disease

A

Monogenic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

___________ - the range of disease presentation (e.g., mild vs. severe)

A

Expressivity

20
Q

Heritability varies by ________ and _____________

A

population
environment

21
Q

________: a physical location in the genome. Typically used to refer to a region of the genome associated with your trait of interest.

A

Locus/loci

22
Q

___________ Use family pedigrees of multiple affected individuals as genetic variants close together are linked

A

linkage analysis

23
Q

________ studies test for the statistical relationship between genotype and phenotype

A

association

24
Q

______________ analysis is best for variants with modest to high effect size

A

linkage

25
Q

Two genetic loci are ______ if they are transmitted together from parent to offspring more often than expected under independent assortment (50%)

A

linked

26
Q

Two genetic loci are in linkage _________ if this holds true (transmitted more than 50% together) at the population-level

A

disequilibrium

27
Q

______________ analysis – identifies the chromosomal location of a disease gene by looking for genetic markers that co-segregate with the disease phenotype
Simply put: Do affected individuals have a common haplotype seen more often than unaffected individuals?
We use genome-wide genetic markers (e.g., SNPs, microsatellites [STRs]) to tag loci

A

linkage

28
Q

Linkage studies report likelihood of linkage in terms of ______ _______ __________ scores

A

logarithm of the odds (LOD)

29
Q

Linkage analysis does not work well for variants with ______effect sizes typically seen in polygenic diseases

A

small effect sizes

30
Q

Complex traits are known as ____________ traits

A

polygenic

31
Q

With _____________ traits environment plays a role in etiology

A

polygenic

32
Q

______________ studies test for the statistical association between genotype and phenotype. Typically, under an additive genetic model, where each additional minor allele changes the phenotype by the same amount

A

association

33
Q

___________ gene studies test a limited number of variants/genes based on a priori information (e.g., biological hypotheses or focus on a linkage peak)
Very high false positive rate

A

candidate

34
Q

_____________ test millions of common genetic variants across the genome for a statistical relationship with your phenotype of interest

A

GWAS

35
Q

___________ regression is used to analyze GWAS

A

linear ( or logistic)

36
Q

GWAS ___________ plot is a tool to visualize our loci significantly associated with BMI

A

Manhattan
X - chromosomal location of gene
Y = statistical strength of association

37
Q

Approximately only ~_____million independent common genetic variants in the genome (in individuals of European ancestry)

A

1 million

38
Q

GWAS identify __________ of correlated SNPs in loci associated with your trait of interest. These loci may overlap genes, suggesting their importance. However, GWAS do not directly identify the causal genes in the locus.

A

clusters

39
Q

GWAS tend to name loci by their nearest ________, but this does not mean we are confident it is the causal gene

A

gene

40
Q

Binary traits/case-control outcomes (e.g., Obese vs. non-obese) are fit using a ________ regression and the beta is transformed into an ______ _______

A

logistic regression
Odds ratio (OR)

41
Q

Do GWAS studies translate well to other more diverse populations

A

No, not necessarily

42
Q

_________ ___________is a measure of an individual’s overall genetic risk or propensity for a given disease or trait

A

Polygenic Scores (PGS)

43
Q

How do you calculate a simple PRS?

A

The simplest model:
For each individual, and for each SNP, multiply the number of effect alleles (0, 1, or 2) by the effect (beta) on the trait of interest
PRS = Sum of these values (the aggregate risk for each individual)

44
Q

The higher your PRS the higher your ____________ genetic risk for the disease of interest

A

aggregated

45
Q

PRS = effect __________ x the effect (or _______) + A2E2 + A3E3

A

effect allele (0, 1, 2)
effect (beta)
Individual 1 PRS = 02.2 + 21.4 + 0*1.6 = 2.8

46
Q

More complex PRS models take into account _________-______ genetic effects and correlation between SNPs

A

genome-wide