Research methods Flashcards

1
Q

What is a fact in a scientific context

A

generally accepted reality based on objective inferences, verifiable by evidence that is published and accepted via peer review and replicated over time. Can still be open to scientific enquiry

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define the steps of the scientific method

A

1 - develop and define the problem (observation)
2 - formulate hypothesis
3 - gather data
4 - analyse and interpret results (test)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the two types of hypotheses

A

Null hypothesis - no relationship between variables
Alternative hypothesis - there is a relationship between anxiety

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the 2 kinds of statistics

A

Descriptive statistics and inferential statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a hypothesis

A

A statement about some real-world
phenomenon that can be tested through
observations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is an independent variable

A

the factor you will be testing. Thing you will change (eg temp, conc etc)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the dependent variable

A

the effect you think will change and the thing you measure - is effected by independent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What must the hypothesis include

A

Must state the question you are asking as well as include both dependent and independent variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How many relationships should be included in hypothesis

A

1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What do you need to check about hypothesis

A

make sure its testable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does APPEAR stand for

A

Acquire
Process
Plot
Examine
Analyse
Report

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Describe Acquire (APPEAR)

A

Acquire data through sampling.
Need to decide how you do it, think about assumptions, ethical considerations, numbers, resources available.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are some assumptions you have to make when sampling a population

A

that pop. is normally distributed, that you are randomly sampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a T-test

A

used to test hypotheses about means when the population variance is unknown (usual)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What sample size do you want

A

as large as possible - will give more accurate data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a single sample T-test

A

when we only have 1 group that we want to test against a hypothetical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is an independent samples T-test

A

We have 2 means and 2 groups that have no relation between each other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What are the 3 varieties of T-Tests

A

Single sample, independent samples and dependent samples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is a pooled average

A

A weighted average of the two sample variances with weighting done according to sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is a dependent T-test

A

2 means that are related to each other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

when do we use t tests

A

when population variance is unknown and sample size is small (usually under 100)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is an ANOVA test

A

(AN)alysis Of (VA)riance
A statistical test for comparing means between more than 2 groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What does the F-test test

A

the hypothesis that two variances are equal. Will be close to 1 if variances are equal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

what function do you use in R to import your dataset

A

read.csv()
e.g. Dataset1 <- read.csv(“dataset1.csv”, header = TRUE)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
what is the Shapiro Wilks test
a normality test for a null hypothesis to check that results are normally distributed if p>0.05 we fail to reject null hypothesis
26
What is the Bartlett test
Tests if there is a difference in variance between multiple sets of data p>0.05 there is no difference in variance
27
what does it mean if shapiro test and bartlett test give p>0.05
data is normal with homogenous variance and therefor you can proceed with t test to analyse data
28
what is a 1 sample t test
tests whether the mean of a single sample is significantly different from known or hypothesized mean
29
what is a 2 sample t test (aka independent sample test)
used when comparing the means of two independent groups or populations and assesses whether the difference in means between the two groups is statistically significant
30
what is a paired sample t test
used when comparing the means of two related groups or when each data point is paired. Determines whether the is a significant difference in the means
31
what is a Post hoc test
used when ANOVA indicates significant differences and can identify which group differs from others
32
What must you do before an ANOVA test
Must make sure data is normally distributed using Shapiro-Wilkes tests
33
Name an example of a Post hoc
Tukey's HSD (Honestly Significant Difference test)
34
How do you interpret the results of an ANOVA test
if the p value (Pr(>f)) is less that 0.05 you can reject null hypothesis. This indicates that at least one group is significantly different from the others. Yu can use Post-hoc now
35
What is the ANOVA function in R
ExampleA <- aov(dependent variable ~ independent variable, data = ANOVA1)
36
What is a 2 way ANOVA test
used to investigate the effects of two categorical independent variables on a continuously dependent variable
37
what is the Wilcoxon test
non-parametric rank test for statistical hypothesis testing used either to test the location of a population based on a sample of data, or to compare the locations of two populations using two matched samples
38
what needs to be checked before a two way ANOVA
Check groups are independent Normality (shapiro test) Homogeneity (Barlett test)
39
what are regression models
a relationship between one dependent variable and explanatory variables. They are used mainly for prediction and estimation
40
How do you set up relationships in regression models
Use equations - numerical dependent variables and 1 or more numerical or categorical independent (explanatory) variable
41
What are the steps of regression modeling
Hypothesize relationship between variables Specify probability distribution of random error term Evaluate the fitted model Use the model for prediction and estimation
42
What is model specification based on
Theory - theory of field, mathematical theory, previous research and common sense
43
what are the two main types of linear regression and when are they used
simple linear regression when you have only one independent variable multiple linear regression which uses two or more independent variables
44
How do you interpret the results of regression
by using summary function summary(model) which provides detailed output including coefficients, r squared p values and more
45
What are co-efficients (linear regression0)
The intercept and coefficients of independent variables
46
What is R-squared (linear regression)
this value measures the models goodness of fit and represents the proportion of the variance in the dependent variable
47
what is the p value (linear regression)
a low p value (<0.05) for the independent variable(s) suggests a significant relationship between variables
48
why are boxplots important
they are essential for visualising and summarising data. They provide a quick way to assess distribution of data and identify outliers
49
what is a boxplot
a graphical representation of the distribution of a dataset. It displays a 5 number summary of a set of data
50
What are the 5 main points of a boxplot
the minimum, first quartile, median, third quartile and maximum
51
Describe the 'anatomy' of a boxplot
The box itself is the interquartile range and spans from q1 - q3 with the median (q3) inside The lines on either side extend to the maximum and minimum values with 1.5x the IQR. Anything out with this are the outliers
52
Why use boxplots
makes it easy to identify skewness, central tendency and spread in data good for visualizing non normal distribution helps spot outliers easy to compare
53
How do you create a boxplot
Order the data Calculate the quartiles Determine the IQR (Q3-Q1) Find upper and lower limits Identify outliers Plot boxplot
54
Give some examples of non parametric tests
The Wilcoxon test and the Mann-Whitney U test
55
How do you interpret a boxplot
The median represents the centre of the data distribution whilst the box length represents the spread of the middle 50% of the data. The whiskers show the range of most of the data and outliers can be identified as individual points outside the whiskers
56
what extra information does a parametric test use
they operate under the assumption that the data is normally distributed
57
Give some examples of parametric tests
t tests, anova
58
what is nonparametric data
A class of statistical procedure that do not rely on assumptions about the shape or form of the probability distribution from which the data were drawn
59
What are the advantages of nonparametric tests
You can use these tests with any numeric variables with any distribution
60
What are the advantages of parametric tests
they use more information from available data which allows for more confidence of ruling out chance and finding real differences
61
why do we not always use parametric tests
data has to be normally distributed, interval or ratio level, and variance must be similar
62
what is the chi squared test
A method suited for situations involving larger sample sizes where it provides reliable insights into the independence or association between categorical variables
63
TRUE OR FALSE: if all conditions are met you should use parametric tests over non parametric tests
TRUE
64
what is the Wilcoxon test
non-parametric rank test for statistical hypothesis testing used either to test the location of a population based on a sample of data, or to compare the locations of two populations using two matched samples
65
What is the Fisher's exact tests
useful in smaller sample sizes or when dealing with 2x2 contingency tables where expected cell counts are low It computes the exact probability of obtaining observed distribution
66
What is the Mann Whitney U test
Similar to Wilcoxon test but for independent samples