Research Methods - Techniques Of Data Handling And Analysis Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

Quantitative data

A

Quantitative Data = Quantitative data involves numbers and can be measured objectively. It is immediately quantifiable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Quantitative data includes

A

 The dependent variable in an experiment.
 Closed questions in questionnaires.
 Structured interviews
 A tally of how many times a behavioural category is seen in an
observation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Qualitative data

A

Qualitative data involves words and the data is based on the subjective interpretation of language. It is only quantifiable if the data is put into categories and the frequency is counted.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Qualitative data includes

A

 Open questions in questionnaires.
 A transcript from an unstructured interview.
 Researchers describing what they see in an observation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Problems with qualitative data

A

Qualitative data is challenging to analyse because it relies on interpretation by the researcher, which could be inaccurate, subjective or even biased. Furthermore, qualitative data may not be easy to categorise/collate into a sensible number of answer types. The researcher could be left with lots of individual responses that cannot be summarised

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Primary data

A

Primary Data = Primary data is collected directly by the researcher for the purpose of the investigation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Secondary data

A

Secondary data is information that was collected for a purpose other than the current use. The researcher could use data collected by them but for a different study, or collected by a different researcher. A researcher might make use of government statistics, such as mental health statistics collected by the NHS.

However, there is substantial variation in the quality and accuracy of secondary data and it can be hard for researchers to know how reliable secondary data is.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Meta-analysis

A

A meta-analysis refers to the process of combining results from a number of studies on a particular topic (secondary data) to provide an overall view. Meta- analysis allows us to view data with much more confidence and results can be generalised across much larger populations. However, meta-analysis may be prone to publication bias; the researcher may choose to leave out studies with negative or non-significant results.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Tables

A

When tables appear in the results section of a research report they are not raw scores but have been converted to descriptive statistics (measures of central tendency or measures of dispersion). There should be a paragraph beneath the table explaining the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Scattergraph

A

A scattergraph is a graphical display that shows the correlation or
relationship between two sets of data (or co-variables) by plotting dots
to represent each pair of scores. A scattergraph indicates the strength
and direction of the correlation between the co-variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Bar chart

A

A bar chart is used to show frequency data for discrete (separate) variables. The height of each bar represents the frequency of each item. In a bar chart a space is left between each bar to indicate the lack of continuity. The frequency of each category is plotted on the vertical y-axis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Distributions

A

With most data sets the frequency of these measurements should reflect a bell shaped curve. This is called normal distribution which is symmetrical. Within a normal distribution most people are located in the middle area of the curve and very few people are at extreme ends. The mean, mode and median all occupy the same mid-point of the curve.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Skews

A

A positive skew is where most of the data is concentrated to the left of the graph. In this case the mode remains at the highest point of the peak, the median comes next but the mean has been dragged across to the right. The opposite occurs in a negative skew.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Measures of central tendency + mean

A

Measures of central tendency inform us about central values for a set of data. They are ‘averages’ – ways of calculating a typical value for set of data. The average can be calculated in different ways, each one appropriate for a different situation.

The mean calculated by adding all the scores and dividing by the number of scores. The advantage of this method is that it is representative of all the data collected as it is calculated using all the individual values. The mean is the most sensitive measure of central tendency as it uses all the values in set of data. However, the disadvantage is that it can be distorted by a single extreme value in the set and the mean score may not be one of the actual scores in the set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

The median

A

The median is calculated by arranging the scores in order then choosing the numerical midpoint. The advantage is that it is unaffected by extreme scores, unlike the mean. The disadvantage is that any outlier values/extreme values would not form part of the average measurement. It is less sensitive than the mean. It does not represent all the findings

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

The mode

A

The mode is the most frequent value in a set. The advantage is that it is unaffected by extreme scores. The disadvantage is that it tells us nothing about other scores in the data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Measures of dispersion

A

A set of data can also be described in terms of how dispersed or spread out the data items are.

The range is calculated by taking the lowest score from the highest. An advantage of this is that it is quick and easy to calculate. A disadvantage is that it can be easily distorted by extreme values.

The standard deviation is the average amount that each score differs from the mean. An advantage is that it takes account of all the scores. A disadvantage is that it is more difficult to calculate than the range.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What are inferential statistics designed to do

A

Inferential statistics have been designed to work out the probability (p) that a particular set of data has occurred by chance, and not because of the independent variable (IV). In other words these statistics tell us the chance that our sample of women are better than men at map reading because of luck rather than because women are genuinely better than men at map reading.

19
Q

Accepted level of probability that a data set has occurred

A

The accepted level of probability that a data set has occurred due to chance in Psychology is p<0.05 (less than 5%). This is the level at which the researcher decides to accept the alternative hypothesis.

20
Q

When to use a sign test

A

The sign test can only be used when there is one group of participants (e.g. a repeated measures design) and when the data is numerical (quantitative).

21
Q

How to carry out a sign test

A

Step 1: State the hypothesis – This is our hypothesis: ‘people are happier after going on holiday than they were beforehand’. This is a directional hypothesis and therefore requires a one-tailed test. If the hypothesis was a non-directional hypothesis then a two-tailed test would be used.

Step 2: Record the data and work out the sign - Record the difference between each pair of data (subtract the ‘happiness before’ score from the ‘happiness after’ score because we predict they will be happier afterwards. Next, record a (+) for happier after the holiday and a (–) for happier before the holiday.

Step 3: Find calculated value – S is the symbol for the statistic we are calculating. It is calculated by adding up the plus signs and the minus signs and selecting the SMALLER value. In this case there are 10 pluses and 3 minuses and one zero. Therefore the less frequent sign is minus so S=3. This is called the calculated value because we calculated it.

Step 4: Find the critical value – N is the total number of scores (ignoring any zero values). In our case N=13. The hypothesis is a directional hypothesis and therefore a one-tailed test is used. Now we use the table of critical values (below) and locate the 0.05 column for a one-tailed test and the row that begins with our N value (13). For a one- tailed test at 0.05 the critical value is S=3. The calculated value of S must be EQUAL TO or LESS THAN the critical value for significance to be shown. Our calculated value is equal to the critical value so is significant.

If the hypothesis is a directional hypothesis we also have to check that the results are in the expected direction. In this case we expect people to be happier afterwards and should therefore have more pluses than minuses. This was the case and therefore we can accept the alternative hypothesis and reject the null hypothesi

22
Q

Table to decide what statistical test to use

A
23
Q

Parametric vs non parametric tests

A
24
Q

Nominal data

A

Nominal data can be referred to as categorical data. For example, if a researcher was interested to know if more students doing A‐level psychology went to a school or a college, the data would be categorised as either ‘school’ or ‘college’: two distinct categories. If the data is nominal, then each participant will only appear in one category. This is called discrete data.

25
Q

Ordinal data

A

Ordinal data - data is ordinal if it is ordered in some way and the intervals between the data are not equal. Typically, this is used to simply rank data where the values assigned have no meaning beyond the purpose of stating where one score appeared in relation to others. For example, if people were asked to rate their preference of local restaurants, with 1 being their least favourite and 10 being their favourite, a researcher would be able to generate a list of restaurants from this data based upon the average ratings for each.

26
Q

Interval data

A

Interval data is like ordinal data in that it also refers to data that is ordered in some way. However, with interval data we are confident that the intervals between each value are equal in measurement. This type of data is much more objective and scientific in nature as a result. Examples of interval data include temperature and time. The difference between 3 and 4 degrees Celsius is the same as the difference between 35 and 36 degrees Celsius. Similarly, heart rate, blood pressure, ruler measures in m, cm or mm would be classed as interval level data.

27
Q

Evaluations of levels of measurement

A
28
Q

Type 1 errors

A

A Type 1 error would occur where we might reject the null hypothesis and accept the experimental/alternative hypothesis instead. However, the results for the study are really due to chance and are not statistically significant. So, we have made a mistake! We should have accepted the null hypothesis instead!Type 1 error is also known as a false positive

29
Q

Type II error

A

A Type II error would occur where a real difference in the data is overlooked as it is wrongly accepted as being not significant, accepting the null hypothesis in error (a false negative).

30
Q

Level of statistical significance defined

A

“The level at which the decision is made to reject the null hypothesis in favour of the experimental hypothesis. It states how sure we can be that the IV is having an effect on the DV and this is not due to chance.”

31
Q

Calculated value symbol for each test

A
32
Q

How to calculate Mann Whitney U test *

A
33
Q

How to calculate wilcoxon *

A
34
Q

How to calculate Chi squared test *

A
35
Q

How to calculate spearman’s rho *

A
36
Q

Related t, unrelated t, Pearson’s r how to know if results are significant

A
37
Q

Related t test

A

• This is used when we wish to test a difference
• The design is repeated measures or matched pairs
• The data should be interval
• Related t-test is a parametric test

38
Q

Unrelated t-test

A

• This test is used when we are looking for a test of difference
• This statistical test is used when we have an independent group design.
• This test has level of measurement which is interval.
• T unrelated t-test is a parametric test

39
Q

Pearson’s r

A

• This test is used for investigating correlations or relationships between variables
• The level of measurement for the data is interval
• Pearson’s r is a parametric test

40
Q

Features of science

A

Objectivity is a key feature of science:
Objectivity can be defined as: “Dealing with facts in a way that is unaffected by beliefs, opinions, feelings or expectations.”
A good researcher is always objective and keeps a, “critical distance” from the research they are conducting. Researchers should not let their personal opinions or biases interfere or affect the outcome of the research.

Empirical Methods are a key feature of science
An empirical method involves the use of objective, quantitative observation in a systematically controlled, replicable situation, in order to test or refine a theory. An example of an empirical method is an experiment

Replicability: is a key feature of science
This can be defined as, “The extent to which the findings of research can be repeated in different contexts and circumstances.”
This refers to when the research is carried out again in the future and whether the findings can be repeated and whether similar findings can be found; if this is the case then we can say that the research is scientific and reliable.

Replicability also serves the purpose of:
a) Guarding against scientific fraud
b) Researchers can check to see if results gained were “a one off fluke” possibly caused by extraneous/confounding variables
c) If research findings can be repeated, we would say that the findings are reliable (see lesson 3)
d) Replicability can also indicate that research findings are valid

Falsifiability: is a key feature of science
This can be defined by Popper (1934) as, “The notion that scientific theories can potentially be disproved
by evidence, it is the hallmark of science. It refers to proving a hypothesis wrong”
Popper (1969) stated that genuine scientific theories should be tested and can also be proven to be false or incorrect (falsifiability).

A Paradigm is a key feature of science:
A paradigm is a shared set of assumptions and agreed methods that are found within scientific disciplines

Kuhn stated that a paradigm shift is when, “The result of a scientific revolution occurs. A significant change in the dominant unifying theory of a scientific discipline occurs and causes a paradigm shift.”
Paradigm shift occurs in two stages:
1) One theory remains dominant within a scientific discipline. Some researchers might question the accepted paradigm and might have contradictory research that disagrees with the main paradigm. Counter evidence might start to accumulate against the main paradigm, critics might begin to gain popularity and eventually the counter evidence becomes hard to ignore. The present paradigm might then be overthrown due to the emergence of a new one. This is an example of a paradigm shift.
2) An established science makes rapid progress and a scientific revolution occurs due to the paradigm shift

Hypothesis Testing is a key feature of science
Hypothesis testing is an important feature of science, as this is how theories are developed and modified. A good theory should generate testable predictions (hypotheses), and if research fails to support the hypotheses, then this suggests that the theory needs to be modified in some way

41
Q

Stages in writing a psychological report

A

1) Title:
This should provide a clear focus of the study and should involve the key variables that you are investigating. It should not be too vague or too specific, e.g. “An investigation to study the relationship between health and stress levels

2) Abstract: (150-200 words long)
This is usually written once the whole report has been completed (because it involves a summary of main concepts). It provides a clear and concise summary of the entire investigation so that the readers can gain an overview of the piece of research and whether it is worth reading the whole report. Information is provided such as:

• Aims
• Experimental hypothesis (one or two tailed)
• Null hypothesis
• Research methods and procedures
• Experimental design
• Sample used (number, age, setting) and sampling method
• Brief account of findings, including statistical tests, results, levels of significance
• Conclusions of the study

3) Introduction leading to aims/hypothesis:
The introduction section is about justifying the need for conducting research. When developing a study, it is important to think about the research that already exists within the same field of psychology. Researchers try to identify if there a gap in existing research or if previous research created new questions that need to be answered. AIM is stated here

4) Method section:
This states how the investigation was carried out and it should be precise so that the study can be replicated. It might be a good idea to display the method using bullet points to aid clarity. There are a few subsections to be aware of:
a) Design:
i) The experimental design (independent measures, matched participants or repeated measures) and the reasoning for why this particular design has been used.
ii) Also the research method needs to be selected with justification
iii) The independent and dependent variables need to be stated (as well as confounding ones)
iv) Controls – for example if counterbalancing was used or random allocation etc
b) Sample:
i) Give details of your sample, e.g. number of males and females, age, background, where did you get them from?
ii) Also explain the sampling method used and why. How were the sample accessed? Where did the sample originate from e.g. place? Participants must remain anonymous and do not use any names!
c) Apparatus/materials:
Make a concise list of materials that are required to carry out the research
d) Procedure:
Bullet pointed steps (or written as a report) that need to be carried out in order to conduct the research, which must be written in sufficient depth and detail for easy replication. Clear information must be presented from the start to the end of the research. Briefing, standardized instructions and debriefing must be included. Please note that in the exam, you may be required to write a set of instructions for your pps
e) Ethics:
In the final subheading of the method section, it is important to consider any ethical issues that arose within the study, and how they were addressed. For example, if participants were being deceived about the true aims of the study, then it is important for the researcher to explain that there was an issue with informed consent, but that this was dealt with using a debrief following completion of the study. Please note that in the exam, you may be required to write a consent form or debriefing so please be prepared

5) Results section:
The results gained from the research.
• Descriptive statistics – Tables, charts, graphs, and raw data. Central tendency such as the mode, mean and median should be stated, as well as measures of dispersion (the range and standard deviation).
• Inferential statistics - what statistical test has been used and why (please remember to justify the use of statistical test). Significance levels and calculated values need to be reported
• If qualitative data has been collected, categories and themes should be described with examples
• State whether the experimental/null hypothesis is accepted/rejected

6) Discussion Section:
The researcher will interpret the results of the study using four key areas:
a) Summary of results:
The results are reported in a brief format and some explanation given about what they mean
b) Relationship to background research:
The results of the study are discussed in relation to the research reported in the introduction section and other related research
c) Limitations of methodology and modifications
Criticisms may be made about the methods used in the study and modifications/improvements suggested
d) Implications and suggestions:
The implication of the results for psychological theory and real life applications. Suggestions can be made for future research

7) Reference section:
The full details of any journals or books that have been mentioned in the report must be provided in the reference section.
Journal referencing:
Author’s name(s), date of publication, title of article, journal title, volume (issue number), page numbers (see example below)

42
Q

Non parametric tests - when are results significant

A
43
Q

RM 12 MARKER - DESIGN EXPERIMENTS TO INVESTIGATE….

Provide details of design, materials, data analysis that could be used

A

YOU GET AO3 MARKS FOR EXPLAINING WHY YOUR DOING THINGS, AO2 FOR EXPLAINING HOW. 6 + 6

DESIGN:

EXPERIMENTAL DESIGN - SAY WHAT TYPE WILL BE USED, WHY AND ADVANTAGES OF DOING SO/THINGS THAT COULD BE ADDED TO LIMIT DISADVATAGES OF SAID METHOD

VARIABLES - STATE IV AND DV. OPERATIONALISE BOTH

CONTROLS - HOW YOU LIMIT EV, OUTLINE AND RELATE TO WHAT THE EXPERIMENT IS ASKING. HOW PPS WILL CARRY OUT EXPERIMENT

MATERIALS:
IF QUESTIONNAIRE USED, MENTIONED SOME OF THE QS THAT WILL ALLOW YOU TO GAIN SOME DATA + MENTION OTHER APPARATUS REQUIRED

DATA ANALYSIS:

INFERENTIAL STATISTICS - OUTLINE WHAT TEST WILL BE NEEDED TO SHOW SIGNIFICANCE

DESCRIPTIVE STATISTICS- MEAN, MODE,MEDIAN,RANGE,SD COULD BE MENTIONED + STATE WHAT IT SHOWS