Kew Internship Flashcards
RBG Kew Study Limitations
Our study does have certain limitations that need to be acknowledged. Firstly, the utilisation of herbarium specimens, which consist of dried material, poses some challenges. It is important to note that these specimens vary in quality, and that some are damaged or degraded, potentially impacting the accuracy and reliability of our observations.
Another limitation is related to georeferencing, which relies on the accuracy of the data recording. The precision of our georeferencing is contingent upon the quality and completeness of the available data. Therefore, any inaccuracies or limitations in the georeferencing data may influence the reliability of our findings.
Additionally, our study is constrained by a relatively small sample size. As a result, it becomes challenging to generate statistically significant p-values. However, it is crucial to emphasize that our approach is exploratory in nature, aiming to uncover initial trends and patterns that can guide further research in this area.
Furthermore, it is important to understand that the relationship between stomatal size and drought tolerance is not universally linear. Different plant species and genotypes may exhibit distinct responses to variations in stomatal size under drought stress. Moreover, a plant’s overall drought tolerance is influenced by various traits, including stomatal density, leaf anatomy, and root characteristics. Therefore, it is essential to consider these additional factors alongside the four chosen leaf traits when evaluating a plant’s response to drought conditions.
Additionally, there is an ongoing debate in the field of functional ecology regarding the predictability of traits from environmental factors and the possibility of isolating single climatic variables and traits amid the complexity of drought conditions. This discussion highlights the challenges and complexities associated with understanding and predicting plant responses to drought, underscoring the need for comprehensive research and analysis in this field.
Fourth Corner Analysis
Fourth corner analysis allows the researcher to explore the relationship between species traits, environmental variables, and community composition. It can help identify the environmental variables that are most strongly associated with the distribution of species traits.
Phylogenetic Principal Component Analysis (PCA)
PPCA is an extension of the traditional PCA method that incorporates information about the phylogenetic relationships among species.
- It considers the evolutionary relatedness of species when analysing and interpreting patterns of variation in multivariate data.
NMDS
Non-metric Multi-Dimensional Scaling is a technique used for visualizing and analysing dissimilarity or distance matrices.
- multiple variables and want to explore the similarity or dissimilarity patterns among your samples.
- If samples with similar trait values tend to cluster together or show a gradient in the reduced-dimensional space, the pattern would suggest a potential association between the leaf traits and climatic variables.
NMDS is a nonlinear ordination method that can handle complex relationships between variables.
Unlike PCA, NMDS does not assume linear relationships or normality. This flexibility makes NMDS suitable for capturing non-linear patterns and exploring complex ecological data.
What is an ANOVA?
ANOVA tells you if there are any statistical differences between the means of three or more independent groups.
If there is a lot of variance (spread of data away from the mean) within the data groups, then there is more chance that the mean of a sample selected from the data will be different due to chance
All these elements are combined into a F value, which can then be analysed to give a probability (p-value) of whether or not differences between your groups are statistically significant.
What is an eigenvalue?
PCA
- variation in distance along each principal component
What is continuous variable?
A continuous variable is a variable that can take on any value within a range (infinite values)
What is a discrete variable?
A discrete variable is a variable that takes on distinct, countable values
Summarise your palm findings
Data is still being analysed
Why did you pick yams?
What is a correlation?
any statistical relationship, whether causal or not, between two random variables or bivariate data
Why does this study matter?
What are the implications for yam conservation?
What’s next for the research?
If you had £1m what would you do to continue the research?
repeat in the field
consider tuber yield
growth habit
transplant experiment
Why did you sample like that?
How many known species?
Explain yam diversification onto Madagascar
Define climatic niche?
What factors did you not include in your niche?
Soil type
Humidity
Altitude - directly
Natural Disturbances: Consider other natural disturbances, such as floods or storms
Groundwater Level
pH
How does Stomatal size directly affects the rate of gas exchange in plants?
Larger stomata enhance the efficiency of gas diffusion, supporting processes like photosynthesis, but they also influence water loss through transpiration.
How does genome size determine stomatal size?
Sets the minimum nucleus - cell
Dioscorea is polyploid, how could that affect stomatal size?
What is a eudicot?
The eudicots, Eudicotidae, or eudicotyledons are a clade of flowering plants mainly characterized by having two seed leaves upon germination.
Explain: (Sandel et al., 2015) detected trait value biases in frequently measured against rarely measured species
Why is still some way to go before image processing technology can be used reliably, especially for less well studied crops (Li et al., 2023).
Heberling Protocols for SLA
“Specific Leaf Area data can also be gathered, provided that adjusted protocols are used (Heberling, 2022).”
What were your hypotheses?
“The first hypothesis questions whether a correlation exists between these traits and climatic conditions at the genus, bioregion and species level.
How did you supervisors make the SDMs?
The supervisors of the project provided Species Distribution Maps of EMYs and a dated Phylogenetic Tree of the Malagasy Clade
How did you supervisors make the dated Phylogenetic Tree of the Malagasy Clade
This tree was built using 260 targeted **low-copy nuclear genes **following Soto Gomez et al. (2019). HybPiper (Johnson et al. 2016)
was used to assemble and extract the targeted genes, alignments were built and edited with
MAFFT (Katoh & Standley 2013) and TRIMAL (Capella-Gutiérrez et al. 2009), respectively.
Phylogenetic trees were built using a supermatrix approach in RAxML-NG (Kozlov et al. 2019)
using the GTR+G molecular evolution model and 1000 bootstrap replicates. Divergence times
were estimated using treePL (Smith & O’Meara 2012) by calibrating the stem node of D.
sansibarensis at 22.5548 Ma (HPD: 15.5797-29.1444 Ma) following Viruel et al. (2016).
Overall, the phylogenetic relationships are very well supported by the bootstrap values. The earliest diverged species appears to be D. sansibarensis. Then the MEYs clade can be divided in four main subclades: the subclade I including 4 species, the subclade II including 9 species,
the subclade III including 16 species and the subclade IV including 4 species. D. arcuatinervis
occupies a very distinct position in the phylogeny and cannot be attributed to any of those
subclades. Finally, an accumulation curve of the number of lineages through time was produced
How did you clean? What did you remove?
The EMY-GBIF dataset was cleaned using CoordinateCleaner package (Zizka et al., 2019) to remove absent or inaccurate coordinates.
Why WorldClim, why 30 arc seconds?
Bioclimatic variables from the WorldClim 2.1 Dataset with a spatial resolution of 30 arc seconds.
What does a PCA do and why can we use it to identify climate niches?
What are the caveats of using a PCA to estimate species niche?
Why did you pick 0.8 to 0.8 collinearity?
Explain how you selected herbarium material?
K-means clustering algorithm was used in conjunction with a dendrogram
Leaf samples were mounted on Scanning Electron Microscope (SEM) stubs and sputter-coated with platinum (21.45g/m3) for 120s at 30mA using the Quorum Q150T ES.
Why Platinum? Why that time and speed?
A suitable platinum coating needs to be only half the thickness required for gold, resulting in less specimen distortion and giving superior peak-to-background ratio in X-ray analysis.
Why do you test for normal distribution using the Shapiro-Wilk test (shapiro.test() function? How does it work?
The Shapiro–Wilk test is essentially a goodness-of-fit test. That is, it examines how close the sample data fit to a normal distribution. It does this by ordering and standardizing the sample (standardizing refers to converting the data to a distribution with mean μ = 0 and standard deviation σ = 1 ).
Explain why you: Using the fitContinuous() function from the ‘geiger’ package (Pennell, 2014), five evolutionary models were applied to the trait data: Brownian Motion, Trend, Lambda, Ornstein-Uhlenbeck and Early Burst. Models were ranked according to minimum Akaike Information Criterion corrected (AICc) value and the maximum log-likelihood value (lnL) with the Lambda value reflecting the strength of the phylogenetic signal for each trait.
Explain PGLS
The trait-specific lambda values were then used in a Phylogenetic Generalised Least Squares (PGLS) analysis to examine the relationships between the traits and climate variables while controlling for phylogenetic signal. PGLS models were fitted using the ‘gls’ function from the ‘nlme’ package (Pinheiro, Bates and R Core Team, 2023), with correlation structure specified as Pagel’s lambda.
What are the pros and cons?
Why did you average the models?
The models were averaged using the model.avg() function from MuMIN
- acknowledge that there might be multiple models that could be used to describe our data
- by weighting the average we can communicate how confident we are in each individual model’s view of the world.
- model averaging usually reducing prediction errors beyond even above even the best individual component model
Caveats of small sample size
The sample size for each species was too small to build complex predictive models at the species level.
Why did you use different packages in the second pca?
To conduct a PCA to identify patterns in the trait data and visualize the trait space, we used factoextra (Kassambara and Mundt, 2020) and factoMineR (Le, Josse and Husson, 2008). SLA, PL and SD were used in the analysis, and the resulting PCA plot displayed individuals as points in the trait space.
Why was the size of the trait space, representing the variation in trait values for a species, hypothesized to be associated with climatic resilience?
Explain your trait PCA
Why did you log10 some of the values sometimes?
Trait x Bioregion Table
Why were there no significant differences detected for SD and invPLSD in Bioregion
Phylogenetic Model Define:
log likelihood score
Phylogenetic Model Define:
AICc.
Phylogenetic Model Define:
Lambda
indicates the strength of the phylogenetic signal, with 0 signalling no phylogenetic signal and 1 indicating a Brownian Motion model.
What is Brownian Motion
In the dry season Why does pore length increase as precipitation in the driest month increase if there are no leaves?
Why was SA_02
collected from continental
Africa?
Why only 3 namorokensis?
What are the implications of different no. individuals on statistical tests?
Explain the following: Therefore, present-day geographic distribution of the clades is thought to reflect dispersal ability and the formation of key geographic regions, such as the Central Highlands, but this affected different clades homogenously. Thus, any similarities observed for traits and species occurring in the same bioregions of Madagascar would probably be explained by a similar response to climatic conditions achieved independently in different clades, rather than following a biogeographic pattern linked to the phylogenetic topology.
Tell me about D. pteropoda
Tell me about D. sansibarensis
Found across Africa and Madagascar
atypical. It is often associated with watercourses and is likely to have green shoots and leaves year round in riverine forest (Wilkin, 2023). This may explain why D. sansibarensis shows a different InvPLSD trend to the other species in this region. D. sansibarensis individuals in areas with higher precipitation were found to increase pore length and decrease stomatal density.
Tell me about D. proteiformis
Humid species, found exclusively in coastal regions in sub-clade I. Varies all 3 traits in response to climatic variables.
Precipitation: Increase SLA and SD, PL decreased.
Temp seasonality: increase PL, decrease SLA
Temperature: reduce PL, increase SLA
Tell me about D. bako
Dry and Subclade III.
Tell me about D. bemarivensis
Dry and Sub-arid
Tell me about D. seriflora
Humid Region
X Datapoints
Trends
Tell me about D. heteropoda
Which species were in the dry region?
D. bemarivensis
D. namorokensis
D. bako
D. maciba
D. pteropoda
D. sansibarensis
Which species were in the humid region?
Proteiformis
Madecassa
Seriflora
Which species were in the sub-humid region?
Heteropoda
Hexagona
Seriflora
Which species were in the sub-arid region?
D. bemarivensis
D. nako
D. hombuka
D. maciba
D. sansibarensis
Which species were found in multiple regions?
Seriflora - sub humid and humid
Maciba, bemarivensis and sansibarensis - dry and subarid
Tell me about D. madecassa
Tell me about D. hexagona
Tell me about D. namorokensis
Tell me about D. nako?
Tell me about D. namorokensis
Tell me about D. maciba
Are bioregions are good scale to investigate?
Yes and No.
Yes
No - E.g. dry region. Habitat variety and sub climates
Tell me about D. hombuka
Dioscorea hombuka is unique because it is found in both the Sub-Arid and Sub-Humid regions, suggesting D. hombuka may change its growth rate and habit in response to varying temperature seasonality in different local climatic contexts
Why would stomatal density only correlate in the sub-humid region?
The creation of the highlands established empty ecological niches in a milder and more stable climate compared to the other regions which may explain why stomatal density, a more complex developmental trait, is only significantly correlated with climate in this region.
How does dredge work?
Models are fitted through repeated evaluation of the modified call extracted from the global.model.
“Let the computer find out” is a poor strategy and usually reflects the fact that the researcher did not bother to think clearly about the problem of interest and its scientific setting (Burnham and Anderson,2002).
Can you explain what this line means?
Here, the model evaluation found that additive effects of annual precipitation + isothermality + precipitation in the driest month + annual temperature range best predicted stomatal density values.
Why is this important?
D. hexagona, the only selected species found exclusively in the sub-humid region only had one significant trait-climate relationship. It was found to increase pore length and decrease stomatal density in response to higher maximum temperatures. Unlike other EMYs, which are vines, D. hexagona is a unique in that it can show a strictly upright dwarf habit when growing in the savannahs of the Central highlands
Similar observations in dwarf barley and wheat. Brassinosteroids affecting physiological reactions to drought e.g. antioxidant enzymes ascorbate peroxidase (HvAPX) and superoxide dismutase (HvSOD) was BR-dependent.
Why is distance from centre of PCA used as a proxy for distinctiveness?
D. proteiformis found to vary all three traits in response to different climatic variables which may reflect its adaptation to its distinct coastal distribution. D. proteiformis was found to occupy one of most distinct climate niches (using distance from PCA centroid as a proxy).
Why? With a near constant abundance of water, D. proteiformis increases stomatal density and reduced pore length in areas with higher levels of precipitation, which may be an adaptation to prevent overhydration in the Humid region
What other adaptations exist to prevent overhydration?
How do you measure phenotypic plasticity?