Semester 1 Recap Flashcards

1
Q

What is replicability and reproducibility?

A

Replicability - The ability of a scientific experiment or trial to be repeated to obtain a consistent result. Research is replicable when the researcher collects new data to arrive at the same scientific findings as a previous study.
Reproducibility - When the original researcher’s data and computer codes are used and regenerate the same results.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a theory?

A

An explanation of a particular behaviour or phenomenon, typically based on scientific research

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a hypothesis?

A

A specific prediction about a behaviour or phenomenon that can be tested in a scientific research project

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Independent variable (IV)

A

The variable that is manipulated by the experimenter (under the experimenter’s
control). May be naturally occurring

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Dependent variable (DV)

A

The variable that is measured. An outcome variable that changes as a result of the IV.
E.g. reaction times, IQ scores, personality scores, body fat percentage, etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Control variables

A

The variables that are kept controlled and constant throughout the study so that they don’t interfere with the dependent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Extraneous/Confounding variables

A

Other variables that might have an effect on the relationship between the IV and DV. We try to control these variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Nominal data

A

Categorise data by labelling them in groups, no order between the categories. No numerical properties. E.g., city of birth, gender, ethnicity, car brands, marital status.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Ordinal data

A

Categorise and rank data in an order, cannot say anything about the intervals between the rankings. E.g., top 5 Olympic medalists, language ability, likert-type questions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Interval data

A

Ordered scale, intervals between units of measurement are all equal. No absolute zero. E.g., test scores, personality inventories, temperature in Fahrenheit or Celsius

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Ratio data

A

Scale with equal intervals and an absolute zero. Negative values not possible. E.g., height, age, weight, temperature in Kelvin

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Experimental design - Independent groups and repeated measures

A

Researcher manipulates independent variable(s) and measures the outcome (dependent variable). E.g., T-test, Analysis of variance (ANOVA).
Independent groups (between subjects) - pps randomly assigned to different conditions/groups
Repeated measures (within subjects) - each pp takes part in all conditions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Quasi-experimental design

A

Independent groups design in which pps are not randomly allocated to different conditions of the IV (non-random criteria). IVs that we cannot directly manipulate - gender, smoker, religion, genetics, etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Correlational design
Limitations of a correlational design

A

Measures the association (relationship) between variables. No independent variables. E.g., correlation, regression.
Limitations - Cannot infer causation from correlations, extraneous various may cause changes in both measured variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Categorical design

A

Measures nominal variables (frequency). E.g., Chi-Square

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Population and samples
Summary values of population and samples

A

Population - A group that have something in common that is of interest for the research question.
Sample - Smaller group of members of a population selected to represent that group
Summary values:
- Parameters describe populations (e.g., the mean of a population is a parameter)
- Statistics describe samples (e.g., the mean value of a sample is a statistic)

17
Q

Sampling error

A

When an analyst does not select a sample that represents the entire population of data, so the results found in the sample do not represent the results that would be obtained from the entire population.
Can be reduced by increasing same size and using random sampling

18
Q

Sampling distribution

A

The distribution of sample statistics (e.g., means, proportions) obtained from multiple random samples of the same size from a population.
Provides information about the variability and characteristics of the sample statistics.
The shape of the sampling distribution depends on the population distribution and the sample size

19
Q

Central limit theorem

A

States that, regardless of the shape of the population distribution, the sampling distribution of the sample mean approaches a normal distribution as the sample size increases.
A fundamental concept in statistics and allows for the use of normal distribution-based inference methods even when the population distribution is not normal.
Applicable when the sample size is sufficiently large (typically n ≥ 30) or when the population distribution is approximately normal.

20
Q

Standard errors

A

Measures of the variability or uncertainty associated with a sample statistic.
Quantify the average amount of sampling error expected in estimating a population
parameter.
Directly related to the standard deviation.
SE reflects the precision of the sample statistic estimate, while the standard deviation describes the variability of the individual data point

21
Q

Measures of central tendency

A

Value that represents a typical (or central) score in a dataset.
Mean, Median, Mode

22
Q

Mean

A

Arithmetical value. Sum of scores divided by number of scores.
Used on interval or ratio data
Advantages - uses all values, is sensitive
Disadvantages - influenced by outliers

23
Q

Median

A

Middle score when data points are arranged from smallest to largest.
Used on interval, ratio, and ordinal data
Advantages - not affe ted by outliers
Disadvantages - does not use all data, not very sensitive

24
Q

Mode

A

Most frequently occurring score
Used on nominal data
Advantages - only measure that can be used for categorical data
Disadvantages - possible to have more than one mode or no mode - not vey useful

25
Q

Measures of dispersion

A

Value that describes how spread out a set of data is.
Range, Interquartile range, Variance, Standard deviation

26
Q

Range

A

Highest score minus lowest score
Disadvantage - only based on two scores, ignored other information

27
Q

Interquartile range (IQR)

A

Put all the scores in order, from smallest to largest:
3, 4, 4, 5, 5, 6, 7, 7, 10, 12, 13, 15, 16, 16, 18
Divide set of scores into four equal parts (quarters):
3, 4, 4, 5 (lower quartile), 5, 6, 7, 7 (median), 10, 12, 13, 15 (upper quartile), 16, 16, 18
The middle two quarters (50%) represent the IQR:
IQR = 15 – 5 = 10
Disadvantage - not based on all observations, first 25% and last 25% are ignored

28
Q

Variance

A

How much the values in the dataset deviate from the mean
Find the differences between each number in the data set and the mean, square the differences to make them positive, divide the sum of the squares by the number of values in the data set.
Variance = Sum of the squared deviations ÷ (N - 1). E.g.:
0.14 + 2.64 + 5.64 + 0.14 + 2.64 + 5.64 + 0.39 + 2.64 = 19.87
19.87 ÷ (8-1) = 2.84
Disadvantage - doesn’t describe the amount of variability in the same units as the original data (due to squaring the values)

29
Q

Standard deviation (SD)

A

Square root of the variance. Most popular measure of dispersion.
Large SD - scores are spread out
Small SD - scores are close to the mean

30
Q

Graphical representations

A

Tools used to present data in a visual format, making it easier to understand and interpret complex information. E.g., box plots and histograms

31
Q

Box plot

A

Summarise the distribution of a dataset using key summary statistics, such as the minimum, maximum, quartiles, and outliers.
Helpful for comparing distributions and identifying outliers.

32
Q

Histogram

A

Depict the distribution of numerical data by dividing it into intervals or bins and showing the frequency or count of data points falling into each bin.
Make it easier to identify whether data is unimodal, bimodal, or skewed

33
Q

Outlier

A

A data point/ observation that differs significantly from other observations; an observation that lies an abnormal distance from other values in a random sample from a population

34
Q

Normal distribution

A

Also known as Bell Shaped Curve
- Mean = mode = median
- The mean divides the data in half
- Symmetric
- Unimodal curve (i.e., one peak)
- The curve approaches, but never touches,
the x-axis

35
Q

Skewed distribution

A

Neither symmetric nor normal because the data values trail off more sharply on one side than on the other