Research and Analysis Exam Flashcards
What are the ethical standards that research must adhere to? The research must respect..
Dignity
Rights
Safety
Wellbeing
of the people who take part
What does research communication mean in ethics?
Research communication must be plagiarism free and you must accurately represent your results
What does informed consent entail?
- What the study will be about
- The risks / benefits of taking part
- Your supervisor’s contact info and the institution’s ethics approval number
- Making sure that the data is kept confidential and that they are free to stop filling in the survey at any point for any reason
- Should give them at least 24 hrs to decide and opportunity to ask questions
What is data pseu-donymisation?
Data pseu-donymisation is an alterative method where you replace identifiable info of participants with fake (pseudonymous) identifiers.
Data can still be linked to participants but its harder to do so because you separate personal info from the study data
Giving each participant a random 3 digit number is an example of what ethical issue?
Anonymity.
You separate their personally identifying info from their research data and include the participant numbers in both files. Data can only be accessed by participants and researchers
What are the 4 factors of potential for harm?
Psychological
Social harm
Physical harm
Legal harm
When can secondary data analysis be used?
If the data is purely quantitative
Theory vs. Data
What is hypothesis induction? What is the opposite of this?
Hypothesis induction is when a theory or hypothesis is formed using data collect from research
Deduction is when a theory is being tested and used to make predictions and hypotheses about the data
What is truncation in regard to literature searching?
Truncation, stemming, is used to broaden your search to include various word endings and spellings by using an “*” or other symbols at the end of the root word
Example:
child* = child, childs, children, childrens, childhood
What is “wildcarding” in literature searching?
Wildcard, allows you to search for variations within the word (useful for picking up terms for UK/US variations) e.g. wom?n
What is adjacency / proximity in literature searching?
Allows you to decide how far apart you want two key words to be
For example: The search ‘mobile app$ adj3 effective$’ will return results in which “mobile app$” and “effective” are within three words of each other. To use the adjacent function use the feature adj within your search followed by the number of words you wish to limit it by.
What does it mean to have different levels of Independent Variables?
It means that there can be different categories of IV’s, called levels, conditions or treatments
Can there be multiple Independent variables?
Yes. You can have multiple IV’s but only belong to one level for each
Ex. Vaccine study participants have 2 levels (vaccinated or not)
Ex. Your height, weight and BP - you can have multiple of these
What is an example of categorical, ordered data?
Year of Undergraduate study (Yr 1-3)
What is an example of categorical, not ordered data?
Which degree program you’re in
What is the difference between nominal and ordinal data?
Nominal data is categorical and cannot be counted
E.g. Gender, occupation etc.
Ordinal data however, is categorical and can be counted, but not added or subtracted
E.g. satisfaction rating (extremely dissatisfied to extremely satisfied), education level (high school to PhD), spicy (mild to hell)
What is interval data?
Interval data can be ordered, but you cannot compute a ratio between two values because no meaningful zero exists
Examples: Exam mark, date, year
What is a ratio data type?
Ratio data is interval data that can have a ratio between the two because it has a meaningful zero
Example:
Distance, height, annual income, number of successes etc.
Number of years in education is an example of which type of data?
Ratio: Even if the same unit “year” is being used, now you can imagine
calculating the ratio between two values, e.g. “my years of education are twice
yours”, because there exist the meaningful zero.
Blood type is an example of which type of data?
Nominal: Blood types are categories but cannot be ordered.
Year (in Gregorian calendar, e.g. 2002) is an example of which type of
data?
Interval: Years can be ordered and have measurable & fixed intervals. Also
their intervals, i.e. difference between years, can be calculated. However, their
ratio cannot be taken because there is no meaningful zero in the years. In
other words, it makes no sense to double an year, e.g. twice of 1988
A response on a questionnaire about service quality between 1
(extremely unhappy) and 7 (extremely happy) is an example of which
type of data?
Ordinal: Response can be ordered but their intervals cannot be
measured/quantified
What is the median
Middle value. Cannot be obtained in nominal data
How do you calculate the mean of a set of data?
Sum of all the values divided by the number of values
The mean can be greatly affected by an outlier
Sections of data containing the same amount of data is called what?
Quantiles.
If there are 4, they are called quartiles
If there are 100, they are called percentiles
What is standard deviation?
The standard distance from the mean which is the square root of variance
What is a Z-score?
A Z-score is ratio with respect to Standard Deviation that enables fair comparison of deviations
It is the value minus the mean divided by the SD
Ex. Mary’s height is 5.6 ft and is 0.3 taller than the mean. Therefore, (5.6-0.3) / 0.3 = 1.0
A Z-score of 0 means the data point is exactly at the mean of the dataset.
A positive Z-score indicates that the data point is above the mean of the dataset.The larger the positive Z-score, the farther above the mean the data point is.
For example, a Z-score of 2 means the data point is 2 standard deviations above the mean.
The opposite is true for negative Z-scores, it would mean that the value is below the mean
In Jamovi, units are the SD plus or minus from the mean
What does positive and negative skewness of data mean?
Positive skewness means that the data is shifted towards the left
Negative skewness means that the data is shifted towards the right
If skewness is zero, the data is not shifted and is in the centre/symmetrical
What does the kurtosis of data mean?
Kurtosis measures the “sharpness” of data or how thin it is. it is always positive
At what value can you exclude an outlier from a data set?
If the Z-score is greater than 3, or less than -3
i.e. when the distance from the mean is more than 3 SD’s or less than -3 SD’s
OR
When using Inter-Quartile Range (IQR): the width between the 1st and 3rd quartile, if the outlier is greater than 1.5 IQR above the 3rd quartile or less than 1.5 IQR below the 1st quartile, it can be excluded from the dataset
In the binomial function:
Bi ( A / B, q ) = 0.25 what do each of the numbers mean?
That would mean that the probability of obtaining A on the condition of B. “q” would mean the probability of success rate.
For example: Bi ( 3 / 3, 0.5) = 0.125
means that the probability of obtaining 3 heads in three trials where the expected probability of getting heads is 50% is 0.125
What is a binomial distribution used for?
When there is a probability distribution of two choices. Using a BD you can assess how likely that an event (i.e. k heads in n tosses) happens by chance.
When one calculates the probability that the value falls in a certain range, what is this called?
Cumulative probability
Example: What is the probability of getting less than 40 heads on 100 times