Summarising Data Flashcards by Shauna Angell

Why is it important to summarise data in neuroscience?

Using statistics to summarise data allows it to be presented and communicated, as well as quantifying the variation and uncertainty in the data.

How well did you know this?

Not at all

Perfectly

CLARITY AND UNDERSTANDING

Distil large amounts of complex data into an understandable form. Allows easier identification of patterns and insights.

How well did you know this?

Not at all

Perfectly

HYPOTHESIS TESTING

Allows focus on the essential data to allow more accurate conclusions to be formed about whether hypotheses are supported or refuted.

How well did you know this?

Not at all

Perfectly

EFFICIENT COMMUNICATION

Summarised data can be clearly and concisely presented for sharing in journals, conferences etc.

How well did you know this?

Not at all

Perfectly

RESOURCE MANAGEMENT

Focussing on the most relevant data allows time, resources and funding to be focussed to ensure that research is impactful.

How well did you know this?

Not at all

Perfectly

META-ANALYSES AND GENERALISATION

Summarised data is needed to combine results from multiple studies to increase statistical power and generalisability of findings.

How well did you know this?

Not at all

Perfectly

DATA INTEGRITY AND REPRODUCIBILITY

Clearly documented summarised data allows researchers to replicate experiments.

How well did you know this?

Not at all

Perfectly

DEVELOPMENT OF THEORIES

Summarised data allows identification of consistent patterns, allowing theoretical models to be developed and refined.

How well did you know this?

Not at all

Perfectly

What are nominal data?

Categorical data without a specific order e.g., blood types.

How well did you know this?

Not at all

Perfectly

What are ordinal data?

Categorical data with a meaningful order but no consistent interval between categories e.g., stages of cancer.

How well did you know this?

Not at all

Perfectly

What are discrete data?

Countable values (typically integers) e.g., number of hospital visits.

How well did you know this?

Not at all

Perfectly

What are continuous data?

Values can fall anywhere within a specified rang e.g., blood pressure.

How well did you know this?

Not at all

Perfectly

What are interval data?

Numerical data with meaningful intervals between values but without a true zero point e.g., temperature.

How well did you know this?

Not at all

Perfectly

What are ratio data?

Numerical data with equal intervals and a true zero point which allows for the calculation of ratios e.g., height, weight.

How well did you know this?

Not at all

Perfectly

MEASURES OF CENTRAL TENDENCY

Mean

Median

Mode.

How well did you know this?

Not at all

Perfectly

MEASURES OF SPREAD

Study These Flashcards

Range

Variance (average of squared differences from the mean)

Standard deviation (square root of variance)

IQR (range between 25th and 75th percentile)

MEASURES OF SHAPE

Study These Flashcards

Skewness (positive - tail on right, negative - tail on left)

Kurtosis (tailedness of data distribution)

GRAPHICAL SUMMARIES

Study These Flashcards

Histograms (frequency distributions)

Box plots (median, quartiles, potential outliers)

Scatter plots (relationship between 2 quantitative variables)

Bar charts (categorical data)

SUMMARY TABLES

Study These Flashcards

Frequency tables (number of occurrences in each category)

Contingency tables (frequency distribution of variables - shows relationship between them)

CORRELATION AND ASSOCIATION

Study These Flashcards

Correlation Coefficient (measures strength and direction of relationship between 2 variables)

Covariance (indicates direction of linear relationship between 2 variables)

REGRESSION MODELS

Study These Flashcards

Linear Regression (models relationship between dependent and independent variable(s) by fitting a linear equation to data - predicts value of dependent variable based on value(s) of independent variable(s))

Logistic Regression (models probability of event using binary outcome variables)

Poisson Regression (counts data and rates - for example the number of event occurrences within a fixed period)

LONGITUDINAL DATA ANALYSIS

Study These Flashcards

Mixed-Effects Models (accounts for fixed and random effects - useful for measurements taken on the same subjects over time)

Generalised Estimating Equations (estimates parameters of a generalised linear model with a possible unknown correlation between outcomes)

SURVIVAL ANALYSIS

Study These Flashcards

Kaplan-Meier Estimator (estimates survival function from lifetime data)

Cox Proportional Hazards Model (assesses effect of variables on survival time and estimates hazard ratios)

MULTIVARIATE ANALYSIS

Study These Flashcards

Principle Component Analysis (reduces dimensionality of data whilst retaining most of the variance - identifies patterns and simplifies datasets)

Factor Analysis (identifies underlying relationships between variables by grouping them into factors)

BAYESIAN METHODS

Bayesian Inference (updates probability of a hypothesis as more evidence becomes available) Markov Chain Monte Carlo (samples from a probability distribution to perform Bayesian inference)

ADVANCED VISUALISATION TECHNIQUES

Heatmaps (show intensity of data points) Network Analysis (visualises relationships between entities)

What type of outlier falls outside inner fences?

A minor outlier.

What type of outlier falls outside outer fences?

A major outlier.

How do you calculate inner fence boundaries?

(Q3-Q1) x 1.5 Add to Q3 Subtract from Q1

How do you calculate outer fence boundaries?

(Q3-Q1) x 3 Add to Q3 Subtract from Q1

Why would you transform data?

To transform the data into a different scale to allow interpretation and/or statistical analysis.

What reasons are there for transforming data?

1. To improve normality (to allow use of parametric tests) 2. To reduce skewness 3. To linearise the relationship between 2 variables 4. To make multiplicative relationships additive

What are some commonly used data transformations?

1. Natural logarithm transformations 2. Power transformations

When would you perform a log transformation?

If the data are positive values and positively skewed - log transformations stretch the scale at the lower end and compress the scale at the upper end.