9.0 Quantitative - data collection and analysis Flashcards

Question

Demographic data collection questions

Answer 1

AgeGender Ethnicity Marital status Family size Occupation Education Employment status Residence Previous contact with organisation Prior knowledge of topic First-time participant vs. repeats How you learned about the program

Answer 2

Always! With people as similar to respondents as possible: Do they understand the questions? The instructions? Do questions mean same thing to all? Do questions elicit the information you want? How long does it take? Revise as necessary

Answer 3

1. Editing data - includes procedures for detecting and correcting errors in the raw data. 2. Coding data 3. Analysis of data - computers play a major role here for analysing quantitative data. Quantitative data analysis software packages: SPSS, SAS, Stata, R, JMP, MATLAB

Answer 4

nominal, ordinal, interval and ratio

Answer 5

Variables are divided into two or more categories and assigned arbitrary numbers. These numbers have no rank order. Gender: male 1, female 2; Marital status: single 1, married 2, separated 3, divorced 4, widowed 5

Answer 6

Reflects a rank order among the categories, but we do not know how much greater than or less than. Pain: ranked on a scale of 1 to 10 with 10 being the worst pain and 1 being no pain. Anxiety: ranked on a 5-point scale from low to high.

Answer 7

Are made up of ‘real’ numbers that allow us to order the numbers and to know the distance between those numbers. The intervals between the categories are equal. Time: hours, minutes and seconds are precise intervals) Temperature: interval between the numerical values of 76 and 77 degrees is the same as the interval between the values of 44 and 45 degrees.

Answer 8

Ratios have the same properties as interval data, except that the measurement scale for the variable possesses a meaningful zero. This means that when zero is reached on the scale, the variable is absent. Weight, height: zero means no weight or no height.

Answer 9

1. Descriptive 2. Inferential

Answer 10

Describe or summarise information obtained from sample (e.g. frequency, percentage, mean, mode, median)

Answer 11

use sample results to draw conclusions regarding the relevant population. Statistics that allow a researcher to make inferences about whether relationships observed in a sample are likely to occur in the wider population from which that sample was drawn. Inferential statistics use logic and mathematical processes in order to test hypotheses relating to a specific population based on data gathered from a sample of the population of interest. Allow you to test a hypothesis or assess whether data is generalisable to the broarder population.

Answer 12

Descriptive statistics summarize the characteristics of a data set. Inferential statistics allow you to test a hypothesis or assess whether your data is generalizable to the broader population.

Answer 13

Most frequently occurring score in a frequency distribution. Only measure of central tendency where data are nominal, however it can be used with all levels of measurement.

Answer 14

Middle score or where 50% of scores are above it and 50% below it.

Answer 15

Most widely used measure of central tendency. Average of all scores. Used with interval and ration data.

Answer 16

Range Variance Standard Deviation z scores Quartiles and the interquartile range Percentil

Answer 17

Standard Deviation - meausre of average deviation or distance of each score from the group mean in a normal distribution. Must always be reported with the mean. 68% of the sample will fall within 1 SD from the mean. Small DF = less variability within the sample and the more similar the scores to the mean and to each other.

Answer 18

Variance = sum of squares / number of values Small value for variable = values are very close ot the mean and therefore similar to each other. Larger variance = values are very spread out around the mean and from each other.

Answer 19

Standard Deviation - measure of average deviation or distance of each score from the group mean in a normal distribution. Must always be reported with the mean. 68% of the sample will fall within 1 SD from the mean. Small DF = less variability within the sample and the more similar the scores to the mean and to each other.

Answer 20

Used to compare measurements/values in standard units. Takes account of mean and SD of the distribution Each score is converted to a z score and then the z scores are used to examine the relative distance of the scores from the mean - process called standardising the score. z score = 1.5 = observation is +1.5 SD above the mean z score = -2 = observation is -2 SD below the mean

Answer 21

Cuts the observations into 4 equal amounts / sections. Q1 - 25th percentile , Q3 = 75 percentile. Distance between Q1 and Q3 = interquartile range and indicates the range of the middle 50% of scores. More stable than range because it is less likely to be changed by a single extreme score.

Answer 22

Represents the percentage of cases a given score exceeds. A score in 90th percentile is only exceeded by 10% of the scores.

Answer 23

Refers to asymmetry of a distribution of interval or ration scores.

Answer 24

Related to peakness or flatness of distibution

Answer 25

Hypothesis testing Probability and level of significance 95% CI Odds ratio Errors in statistical inference: type 1 and type 2 errors Power anaylsis Effect size Tests of significance (parametric or non-parametric)

Answer 26

that the result is unlikely to have occurred due to chance fluctuations in sampling. Levels of significance - alpha levels a = 0.5 (researcher willing to accept a 5% risk that the results are in error) - minimal levels acceptable for all scientific disciplines. a = 0.01 (1% risk of error) a = 0.001

Answer 27

identifies a range of values that includes the true population value or a particular characteristic at a specified probability level. If CI passes through 0 = no realtionship e.g. 95% CI of a mean indicates if we took 100 similar samples and calculated their means, 95 of those samples would contain the 'true' population mean, and 5% wouldn't. Mean _+ SD 6.12 +- 2.54 95% probability that in the target population mean would be between 5.53 and 6.70.

Answer 28

Another way of presenting probabilities. Summary statistic that estimates the odds of an event occuring in one group compared to another. Measure of strength of association. OR of 1 = that either event is likely OR > 1 = event is more likely to happen OR <1 event is less likely to happen Can be obtained in chi square analysis, logisitc regression where outcome is dichotomous, used in meta-analyses. OR = 1.60 (can interpret at 60% increase in risk) Statistic used to assess the risk of a particular outcome and is widely used in the healthcare literature—particularly epidemiology because it can be calculated for case-control studies, in studies using logistic regression, and as a way of presenting the results of a meta-analysis.

Answer 29

Two types of errors in statistical inference. Type I - false positive, e.g. stats say treatment is effective when it isn't). Type II - false negative. Impossible to eliminate these errors however researchers try and minimise through conducting appropriate power analysis.

Answer 30

quantitative method that allows us to determine the sample size required to detect an effect of a given size with a given degree of confidence.

Answer 31

equates to the magnitude of the effect of an intervention or treatment.

Answer 32

Represent the number of data points in any given set of data are are free to vary.

Answer 33

type of inferential statistic that involves the estimation of at least one parameter. Such tests require either interval or ratio data and involve a number of assumptions about the variables under investigation including the fact that the variable is normally distributed.