Lecture 6&7 - Chemometrics Flashcards
what is chemometrics
a multivariate statistical analysis that is computationally intensive and is applied to chemical systems or processes to find patterns and trends in the data
what does multivariate mean
analyses multiple variables
what does computationally intensive mean
the need for a computer and algorithms
name 6 things chemometrics can help us do
- reduce complex datasets
- identify and quantify sample groups
- optimise our experimental parameters
- identify covariance and pick out important variables
- give reproducible measures of data
- visualise the data (a picture is better)
what is meant by covariance
which parts of the data are associated/not independent
what effect does chemometrics have on subjectivity
reduces it in the analysis of data but does not eliminate it completely as human still interpret the data
what can chemometric reveal that may have not been noticed
underlying/ not obvious trends between variables
what is the main aim of chemometrics
to maximise output and quality with minimal cost
chemometrics has been used in forensic science since 2009 - what benefits has this provided
improved efficiency in forensic workflow
better quality of the use of resources for forensic purposes
how is chemometric applicable to forensic science (6)
statistical framework
replaces use of unique and match without use of stats to support (subjectivity)
can counteract bias
quicker than manual data interpretation
don’t need an expert but someone does need to be able to interpret the output data
can predict trace behaviour
what can be identified using multivariate analysis that may not be seen in univariate analysis
outliers
what is univariate analysis
analysis that only considers one variable
give an example of where multivariate analysis may be beneficial in forensic analysis and the variables that could be considered
pollen dispersion
considering time of year and weather
fingermarks
considering sweatiness of someone and weather conditions
what are the 4 broad categories of chemometrics
experiment design (DOE)
exploratory data analysis (EDA - what is the data showing me)
classification
regression
what does the DOE (design of experiment) affect in forensic science
evidence collection, storage, analysis instrument selection and optimisation
what can the DOE part of chemometrics be used to streamline in forensics in the future
efficiency, quality and reproducibility by establishing optimised workflows
what does the regression part of chemometrics involve
a version calibration curves based on a linear y=mx+c relationship which maps the effect of multivariate independent variables
we can make educated predictions based of the curve
What happens in the EDA part of chemometrics
data is reduced by an algorithm that looks how variables correspond
identifies groupings of samples in complex datasets
visualises trends
making the data more manageable
EDA is an unsupervised technique - what does this mean
it explores the data without any prior assumptions or knowledge of the samples (reducing human bias and subjectivity)
what is a supervised technique
the building of classification rules for grouping samples together - done from EDA analysis
name the two most commonly used EDA techniques
CA = cluster analysis
PCA = principal component analysis
name 3 features of cluster analysis (CA)
unsupervised technique
samples are grouped into clusters based on a measure of similarity (a calculated distance)
the output is a dendrogram