Summarising Data Flashcards

- Understanding the importance of summarising data in neuroscience. - Clear demonstration of different types of data - How you can summarise data? - Get a clear idea of data variability & how you could Identify outliers in your data? - Demonstrate ways of managing my research data. - Apply statistical software (e.g., STATA) to carry out exploratory analysis and display a dataset (histogram, box plot, cumulative frequency) presentation.

1
Q

Why is it important to summarise data in neuroscience?

A

Using statistics to summarise data allows it to be presented and communicated, as well as quantifying the variation and uncertainty in the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

CLARITY AND UNDERSTANDING

A

Distil large amounts of complex data into an understandable form. Allows easier identification of patterns and insights.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

HYPOTHESIS TESTING

A

Allows focus on the essential data to allow more accurate conclusions to be formed about whether hypotheses are supported or refuted.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

EFFICIENT COMMUNICATION

A

Summarised data can be clearly and concisely presented for sharing in journals, conferences etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

RESOURCE MANAGEMENT

A

Focussing on the most relevant data allows time, resources and funding to be focussed to ensure that research is impactful.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

META-ANALYSES AND GENERALISATION

A

Summarised data is needed to combine results from multiple studies to increase statistical power and generalisability of findings.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

DATA INTEGRITY AND REPRODUCIBILITY

A

Clearly documented summarised data allows researchers to replicate experiments.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

DEVELOPMENT OF THEORIES

A

Summarised data allows identification of consistent patterns, allowing theoretical models to be developed and refined.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are nominal data?

A

Categorical data without a specific order e.g., blood types.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are ordinal data?

A

Categorical data with a meaningful order but no consistent interval between categories e.g., stages of cancer.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are discrete data?

A

Countable values (typically integers) e.g., number of hospital visits.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are continuous data?

A

Values can fall anywhere within a specified rang e.g., blood pressure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are interval data?

A

Numerical data with meaningful intervals between values but without a true zero point e.g., temperature.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are ratio data?

A

Numerical data with equal intervals and a true zero point which allows for the calculation of ratios e.g., height, weight.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

MEASURES OF CENTRAL TENDENCY

A

Mean

Median

Mode.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

MEASURES OF SPREAD

A

Range

Variance (average of squared differences from the mean)

Standard deviation (square root of variance)

IQR (range between 25th and 75th percentile)

17
Q

MEASURES OF SHAPE

A

Skewness (positive - tail on right, negative - tail on left)

Kurtosis (tailedness of data distribution)

18
Q

GRAPHICAL SUMMARIES

A

Histograms (frequency distributions)

Box plots (median, quartiles, potential outliers)

Scatter plots (relationship between 2 quantitative variables)

Bar charts (categorical data)

19
Q

SUMMARY TABLES

A

Frequency tables (number of occurrences in each category)

Contingency tables (frequency distribution of variables - shows relationship between them)

20
Q

CORRELATION AND ASSOCIATION

A

Correlation Coefficient (measures strength and direction of relationship between 2 variables)

Covariance (indicates direction of linear relationship between 2 variables)

21
Q

REGRESSION MODELS

A

Linear Regression (models relationship between dependent and independent variable(s) by fitting a linear equation to data - predicts value of dependent variable based on value(s) of independent variable(s))

Logistic Regression (models probability of event using binary outcome variables)

Poisson Regression (counts data and rates - for example the number of event occurrences within a fixed period)

22
Q

LONGITUDINAL DATA ANALYSIS

A

Mixed-Effects Models (accounts for fixed and random effects - useful for measurements taken on the same subjects over time)

Generalised Estimating Equations (estimates parameters of a generalised linear model with a possible unknown correlation between outcomes)

23
Q

SURVIVAL ANALYSIS

A

Kaplan-Meier Estimator (estimates survival function from lifetime data)

Cox Proportional Hazards Model (assesses effect of variables on survival time and estimates hazard ratios)

24
Q

MULTIVARIATE ANALYSIS

A

Principle Component Analysis (reduces dimensionality of data whilst retaining most of the variance - identifies patterns and simplifies datasets)

Factor Analysis (identifies underlying relationships between variables by grouping them into factors)

25
Q

BAYESIAN METHODS

A

Bayesian Inference (updates probability of a hypothesis as more evidence becomes available)

Markov Chain Monte Carlo (samples from a probability distribution to perform Bayesian inference)

26
Q

ADVANCED VISUALISATION TECHNIQUES

A

Heatmaps (show intensity of data points)

Network Analysis (visualises relationships between entities)

27
Q

What type of outlier falls outside inner fences?

A

A minor outlier.

28
Q

What type of outlier falls outside outer fences?

A

A major outlier.

29
Q

How do you calculate inner fence boundaries?

A

(Q3-Q1) x 1.5
Add to Q3
Subtract from Q1

30
Q

How do you calculate outer fence boundaries?

A

(Q3-Q1) x 3
Add to Q3
Subtract from Q1

31
Q

Why would you transform data?

A

To transform the data into a different scale to allow interpretation and/or statistical analysis.

32
Q

What reasons are there for transforming data?

A
  1. To improve normality (to allow use of parametric tests)
  2. To reduce skewness
  3. To linearise the relationship between 2 variables
  4. To make multiplicative relationships additive
33
Q

What are some commonly used data transformations?

A
  1. Natural logarithm transformations
  2. Power transformations
34
Q

When would you perform a log transformation?

A

If the data are positive values and positively skewed - log transformations stretch the scale at the lower end and compress the scale at the upper end.

35
Q
A