Twins Flashcards

0
Q

When extracting the influence of d on a phenotype what are the mz and dz probabilities? I.e what do you need to multiply the non additive genetic effects by in a path diagram..

A

Mz =1

DZ = 1/4 (0.25)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
1
Q

What does concordance mean?

A

The probability that a pair of individuals will both have a certain trait (I.e twins have the same psychopathology)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the rule for model identification?

A

The number of unknown parameters must equal the number of predictive statistics (co variance MZ, covariance DZ, Vp)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does Vp stand for?

A

Variance of phenotype

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

When would you decide to run an ADE model over an ACE model?

A

When the difference between the mz and dz correlations is more than half the mz correlations (I.e heritability estimates would be greater than the mz correlation -not possible)

(this shows that the effects can not be due to additive genetic influences but must be due to non additive genetic influence)

E.g rMZ = .6
rDZ =.25

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does ‘free’ mean in mx language?

A

= TRUE or FALSE and it refers to whether we are estimating these parameters or not.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a saturated model?

A

Most basic model to describe the data. It’s a perfectly fitting model as the covariances are treated as free parameters, so that their maximum likelihood estimates will be the sample covariances.

ACE model is compared to the saturated model (is it a better fit of the data or not?)

But there is a lot of noise!

Also used to check assumptions of an ACE model (equality of means, equality of variances, twin specific environment (twin-sib data), sex differences)

To obtain a baseline fit statistic (-2LL)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Explain a variance covariance matrix

A

The number on the diagonal refers to the variance of each trait and on the off diagonals (symmetrically) is the covariance a (between traits).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is an identity matrix?

A

It has 1 on the diagonals and 0 on the off diagonals

Can be used to multiply with other matrices

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What can the 2log likelihood do?

A

Can quantify differences between models with others

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How could you tell a saturated model from an ACE model?

A

In an ACE model the variance for twin 1 and twin 2 will be the same but in a saturated model they are free to vary.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is R?

A

R is a a programming language and a software environment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is open MX?

A

It is a package in R

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does the library command do?

A

Makes packages available

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

The recycling rule means?

A

Refers to a situation where 2 vectors of different lengths are computed and the shorter vector is repeated until it matches the length of the longer vector.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a vector?

A

A sequence of data elements of the same basic type. Members in a vector are officially called components.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a matrix?

A

A matrix is a collection of data elements arranged in two-dimensional rectangular layout. Data must all be off the same type

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is a data frame?

A

Is a table, or two-dimensional array-like structure, in which each column contains measurements on one variable and each row contains one case. (A case does not necessarily mean the same as an experimental subject). Data can be of different types.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

“a” “b” “c” “d” “e” “f” “g” “h” “I” “j”

What letters does

C(2:5,8) refer to?

What’s does (4:8)

A

b c d e and h

d e f g h

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

When indexing how do you drop an element from the analysis? (E.g item 1 in a vector)

A

Z [-1]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

How do you select elements from a vector? (E.g 1 & 4 & 7)

A

Z[c(1,4,7)]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What does c stand for in r programming language?

A

Concatenation (combine)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

When indexing from a matrix what is the nemonic to help remember the order written or read?

A

Roman Catholic

rows then columns!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

How to define a string variable in r?

A

Quotation marks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Quantitative genetics is based on what theory?

A

Biometric all genetic theory

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What does quantitative genetics aim to do?

A

Infer a direct relationship between the observed variance of a trait to unobserved genetic, shared and unique environmental factors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

How does quantitative genetics make predictions of the effects of underlying latent factors?

A

By using the relatedness between individuals (e.g MZ twins are clones of each other and DZ share on average half their segregating genes .5)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What three groups of participants do you get from an adoption design?

A

Genetic relatives
Environmental relatives
And genetic and environmental relatives

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Define non additive genetic effects (dominance and epistaxis)

A

Dominance is an interaction between alleles at the same locus

Epistasis is an interaction on difference loci.

(Dominance does not contribute to the genetic covariance)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Which part of ACE contains error?

A

E

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What two ways can phenotypic variance be decomposed into?

A

ACE

ADE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

What is falconers formula for the correlations for MZ and DZ twins?

A

rMZ = A + C

rDZ = .5A + C

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What are three assumptions of the twin model?

A

Equal environments

Genotype- environment effects (Random mating, no gxe interactions and no gxe correlations)

Generalisability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What does path analysis allow us to do?

A

Represent linear models for the relationship between variables in a diagram.

Makes it easy to derive expectations for the variances and covariance a of variables in terms of the parameters of the proposed linear model.

Also permits easy translation into matrix formulation as used by programs such as mx and open mx

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

In path analysis what do squares or rectangles represent?

A

Observed variables

35
Q

In path analysis what do circles or ellipses denote?

A

Latent (unmeasured) variables

36
Q

In path analysis what do upper-case letters denote?

A

Variables

37
Q

In path analysis what do lower-case letters or numeric values denote?

A

Covariances or path coefficients

38
Q

In path analysis what do single-headed arrows or paths (->) represent?

A

Hypothesised causal relationships where the variable at the tail is hypothesised to have a direct causal influence on the variable at the head.

39
Q

In path analysis what do double headed arrows represent?

A

Covariance between two variables, which may arise from a common cause not represented in the model.

Double headed arrows may also be used to represent the variance of a variable.

40
Q

What is meant by the term causal in path analysis?

A

The meaning of causal is the assumption that change in the variable at the tail of the arrow will result in change in the variable at the head of the arrow, with all other variables in the diagram held constant. The causal relationships represented by straight arrows are assumed to be linear.

41
Q

In path analysis what are variables that do not receive causal input from any other variable called?

A

Independent, source or predictor variables. Or exogenous

In general, only independent variables are connected by double-headed arrows

42
Q

In path analysis what are variables that do receive causal input from another variable called?

A

Dependent variables or endogenous variables

43
Q

In path analysis where can single headed arrows be drawn from?

A

From independent to dependent variables and from dependent to dependent variables.

44
Q

In path analysis what does omission of a two-headed arrow between two independent variables imply?

A

That the covariance of those variables is zero

45
Q

In path analysis what does omission of a direct path from an independent or dependent variable to a dependent variable imply?

A

That there is no direct causal effect of the former on the latter variable.

46
Q

In path tracing the covariance between any two variables can be calculated by?

A

Summing all legitimate chains connecting the variables

The numerical value of a chain is the product of all traces path coefficients in it.

47
Q

In path tracing what are the three rules?

A

Trace backwards, then forwards, or simply forwards from one variable to another. NEVER forward then backwards. (Include double-headed arrows from the independent variable to itself, these variances will be 1 for latent variables).

Loops are not allowed, I.e. We can not trace twice through the same variable

There is a maximum of one curved arrow per path. So the double headed arrow from the independent variable to itself are included unless the chain includes another double headed arrow (e.g. A correlation path)

48
Q

In path tracing what is the variance?

A

Since the variance of a variable is the covariance of the variable with itself, the expected variance will be the sum of all oaths from the variable to itself, which follow the path tracing rules.

49
Q

A D C AND E are all what? (2 points)

A

Independent variables and parameter estimates.

50
Q

Why in the classical twin model can only 3 model parameters be estimated at a time? (ACE OR ADE)

A

The number of parameters estimated cannot exceed the number of predictive statistics (covariance of MZ And DZ ANF Vp) - when 3 parameters are unknown the model is just identified.

Note back E must always be in the model as it contains error.

51
Q

How can we estimate A C E AND D in one model?

A

By combing twin and adoption data it is possible to work them out

Cov(MZ) = a2 + d2 + c2

Cov (dz) = 1/2a2 + 1/4d2 + c2

Cov (adopsibs) = c2

Vp = a2 + d2 + c2 + e2

4 unknown parameters (a, c, d and e) and 4 predictive statistics.

52
Q

How do you calculate the variance of a set of data?

A
  1. Calculate the mean
  2. Calculate each squared deviation (subtract the mean from each observation and square individually)
    3 divided the sum of the squared deviations by (N-1)
53
Q

How do you calculate the covariance?

A
  1. Calculate the mean for each variable
  2. Calculate each deviations (subtract the mean for variable 1 from each observation & do the same for variable 2)
  3. Multiply the deviations for variable 1 and variable 2
  4. Sum up all the multiplied deviations
  5. Divide sum of he multiplied deviations by (N-1)
54
Q

How are ACE assumptions tested using a saturated model?

(Means across twins & zygosities and both & variances across twins and zygosities and both).

A

All tested by comparison of fit statistics

55
Q

What does the saturated model do?

A

Allows us to test assumptions that underlie the ACE twin model

And equate means and variances across twin 1 and twin 2 and across MZ and me zygosities group

56
Q

What is matrix algebra?

A

Branch of mathematics devoted to working with matrices

57
Q

What is element-wise multiplication?

A

Requires two matrices of the same dimensions

Multiplies corresponding elements

Less common than the more complex matrix multiplications

58
Q

What is another name for element-wise multiplication?

A

Dot product

59
Q

What is the rule for matrix multiplication?

A

The number of columns in the 1st matrix must equal the number of rows in the 2nd matrix.

60
Q

How do you determine the size of the matrix required after matrix multiplication?

A

The number of rows of matrix one by the number of columns of matrix two.

61
Q

What is a cross-product

A

Used in the context of vector multiplication:

We have 2 vectors of the same length (same number of elements)

The cross-product (x) is the sum of the products of the elements e.g.

Vector 1 = { a b c} vector 2 = {d e f}

V1 x V2 = ad + be + c*f

62
Q

Why is

Matrix multiplication not commutative?

A

Because A ** B does not equal B ** A

63
Q

Things to remember about matrix multiplications

A

The product of a lower matrix and its transpose is symmetric

Not all matrices were made to multiply with one another

64
Q

What do multivariate models want to discover?

A

The reason behind covariance/correlation between traits

Do the same genes/environments influence different traits?

As well as decomposing the variance of each trait, we can decompose the co-variance between two traits

65
Q

Is a phenotypic correlation required for a genetic correlation to exist?

A

No

66
Q

Explain the difference between a univariate and a bivariate decomposition of variance?

Using the example of bmi and waist

A

Univariate = bmi for twin 1 and bmi for twin 2

Bivariate = bmi for twin 1 and waist in twin 2
Cross twin- cross trait correlations

67
Q

What do the cross-twin covariance in combination with the variance enable us to calculate?

A

A C and E for e.g. BMI and waist

68
Q

What do the within-twin, cross-trait covariance tell us?

A

Tells us the phenotypic covariance between traits (these are held the same across twins).

69
Q

What do the cross twin, cross train covariance tell us?

A

These are held the same across twins and contain information necessary to calculating A, C and E contribution to covariance.

70
Q

2 facts about bivariate matrices?

A

Matrices are symmetrical and variance/covariance a and held to be the same across twins

71
Q

When should you use a cholesky decomposition over a correlated factors?

A

If you have good reason to sequence variables in a specific manner (I.e. If the data is longitudinal)

If not then correlated factors is the most and most interpretable model to use.

72
Q

What are the assumptions of the correlated factors solution?

A

Each variable is influenced by a set of genetic, shared and non-shared environmental factors

The factors associated with each variable are allowed to correlate with each other through rA rC and rE

Correlations among phenotypes are a function of rA, rC and RE and the standardised A C and E paths connecting them

73
Q

What does rbind and cbind mean?

A

Row bind and column bind

I think this is something to do with multiplying matrices and wanting the position of the data to reflect the label it is given and not the position of the data.

74
Q

In R when free=true what does this mean?

A

It means that the parameters of the model are being estimated

75
Q

When would you run an A E model?

A

When you want to compare the fur of the two models to test the significance of the parameter (shared environment) remember you can never drop non shared environment as it contains error.

76
Q

What do small a c and e denote?

A

Path coefficients

77
Q

What do capital A C and E denote?

A

Variances and covariances

78
Q

How do you work out the phenotypic correlation between two traits?

A

It’s the sun of the components attributable to A C and E

Square root of ‘a2’ pathway for trait 1 x genetic correlations x square root of a2 pathway for trait 2 + the same for shared and non shared environments.

Which should approximate the phenotypic correlations (Pearsons)

79
Q

How do you calculate the proportion of the phenotypic correlation that is explained by A C and E?

A

You work out the overall phenotypic correlation and you divide the genetic, shared and non shared environmental total pathways by the total phenotypic correlations .

E.g.
A = .56/.75 = .75
C = .10/.75 = .13
E = .09/.75 = .12

This means that 75% of the phenotypic correlation can be attributed to additive genetic effects.

80
Q

Why can not you interpret the shared ethology of two traits just by looking at the genetic and environmental correlations?

A

You need to look at both the value of the correlation and the pathways in order to understand the aetiology of a phenotypic correlation.

As for example weakly heritable traits can still have a large proportion of their correlations attributable to genetic effects

E.g rA = .3 and rA = .7 can have the same proportion of the phenotypic correlation attributable to genetic effects

81
Q

How is the goodness of fit of the model measured?

A

It is measured to the perfectly fitting (saturated) model by the likelihood ratio chi-square (x2) statistic.

82
Q

What does a significant x2 result mean when testing the goodness of fit?

A

It means that the model provides a poor fit to the data and can be rejected.

83
Q

What are the degrees of freedom for x2?

A

The number of observed statistics (which is normally 3 covariances and variances (mz & dz) - the number of parameters being estimated in the model.

84
Q

How has the equal environments assumption been tested?

A

By observing twin correlation for twins who have mislabelled zygosities - mz twins who have been mislabelled as dz twins should be less similar to mz twins correctly labelled. This has not been found.

Mz twins reared apart have provided correlations for personality variables that are almost the same as those for mz twins reared together

Amount of Contact of mz twins has Also been tested and no difference between mz and s-s dz has been found. (Slight difference between high contact mz and low contact mz - but result is small)

85
Q

What is assortative mating?

A

Non-random pairing of mates in the basis of factors other than biological relatedness.

(Tested by observing the phenotypic correlation between parents over time of the trait in question)