Research Methods Flashcards
All information that was taught to me while attending Vanier College's "Animal Health Technology" Program, located in St-Laurent Montreal.
What is the goal of statistics
Figure out if what we observe is the result of the factor studied or a background of normal variation.Evaluate what the numbers actually meanRepresent them in a way that communicates their meaning to others
What is a variable
A characteristic that varies between individuals
What is a nominal qualitative variable
EX: Color (Catergories)
What is an ordinal qualitative variable
EX: Body condition (Catergories)
What are some biological variations
Genetics, Environment, Gender, Age
What are two types of technical errors
Human errorsInstrument errors
What is a population
All representatives in a group
What is a sample
A subgroup of the population
What are descriptive statistics used for
Used to summarize data in diagrams, tables, mean, variance.
What is inferential statistics,
To generalize from the sample something that can be applied to the whole populationEstimation of a population’s parameterHypothesis testing (to investigate a theory about the data)
Which types of diagrams are used to represent qualitative data
Bar chartsPie charts
Which types of diagrams are used to represent quantitative data
Dot diagramHistogramStem and leaf diagramBox and whisker plotScatter diagram
What are the averages taken of central tendency
MeanMedianMode
What is used to measure dispersions (spread)
RangeVariationStandard deviation
What is the price we pay by using sampling instead of questioning the whole population?
The price we pay for sampling is that we cannot make statements of absolute certainty about the population.The doubt is expressed as a probability. The larger the sample, the more representative it is.
What are the six types of studies
¤ Observational vs experimental ¤ Cross-sectionnal vs longitudinal ¤ Cohort vs case-control
What are six things to consider when trying to increase precision of the estimates
¤ Replication¤ blocks¤ independant vs pairing¤ Confounders & Interaction ¤ Outliers¤ Missing data
Why is it impossible to prove something with statistics
Because of the sampling error. We are not directly measuring a population
What is a null hypothesis
A hypothesis proving that there is no difference
What is an alternate hypothesis
The opposite of the null hypothesis: there is a difference
What is the P value
Probability. Value that makes you decide if you reject Ho or not.
What does it mean if your P value is very small
it is unlikely that we could have obtained the observed results ifthe null hypothesis were true, so we reject Ho. There is a small probability that Ho is true.
What does it mean if your P value is very large
there is a high chance that we could have obtained the observed results if the null hypothesis were true, and we do not reject Ho. There is a large probability that Ho is true.
What does an A value of 0.01 mean
even more certain to have truly taken the right decision to reject Ho
What does an A value of 0.05 mean
generally accepted
What is a type 1 error
False negative
What is a type 2 error
False positive
What is simple randomization
we would use a computer to generate the sequence, or a table of random numbers, or flip a coin
What is the drawback to simple randomization
The main drawback is that, with a small sample size, there could be a severe imbalance in the numbers assigned to each treatment.
What is sampling error
The sampling error is an error in the estimation of the POPULATION parameter, because we are using a SAMPLE of the population and not measuring the whole population.
What are the two types of sampling errors
sampling error in relation to the mean and sampling error in relation to a proportion.
Give an example of qualitative nominal
The nominal scale is composed of categories of things that can be assigned a name and are not in any particular order (coat color, white blood cells, type of diet (wet-dry)).
Give an example of qualitative ordinal
The ordinal scale is composed of categories that CAN be given an order, but there is not a consistently defined interval (body condition, toxic changes in neutrophils (1+, 2+, 3+))
Give an example of quantitative continuous
The continuous scale is composed of values on a continuum (age, hematocrit).
Give an example of quantitative discontinuous
The discontinuous scale is composed of integer3 values values (# of puppies in a litter, results of rolling a dice))
The rate of a particular enzyme reaction
Numerical, continous
number of offspring per litter
Numerical, discontinuous (discrete)
coat color in horses
Categorical, nominal
disagree, neutral , agree, somewhat agree
Categorical, ordinal
What is a measure of location used for
to measure the central tendency of the data set.
What are the measures of location?
Mean, median, mode
What is a measure of dispersion?
to measure how widely scattered the observations are in either direction from that average.
What are the measures of dispersion?
Range, interquartile range, variance, standard deviation
What is a variance a measure of
Dispersion
What is range a measure of?
Dispersion
What is mode a measure of
Location
What is interquartile range a measure of
dispersion
What type of data is best represented by a pie chart?
Pie chart: circle divided into segments with each segment portraying a different category of the qualitative variable. For categorical data.
What type of data is best represented by a bar chart?
Bar chart: diagram in which every category of the variable is represented. Thelenght of each bar (width is constant) depicts the # or % of individuals belonging to that category. For: categorical data.
What type of data is best represented by a dot diagram?
Dot diagram: each observation is a dot within horizontal and vertical axis calibrated in the units of measurement. For quantitative data (small size)
What type of data is best represented by a histogram?
Histogram: Two-dimensional diagram with (usually) the unit of measurement onthe horizontal axis and the height is proportional to the frequency (verticla axis is frequency). * The rectangles are contiguous (no space between each bar) because the numerical variable is continuous (compared to a bar chart). For quantitative variable (frequencies).
What type of data is best represented by a stem and leaf diagram?
Stem and leaf diagram: modified histogram. Row of number that represents the observations instead of a rectangle (vertical). (not used very often - you do not need to study)
What type of data is best represented by a box and whisker plot
Box-and-whisker plot (box plot): The scale of measurement of the variable isusually drawn vertically. The diagram comprises a box with horizontal limits defining the upper and lower quartiles and representing the interquartile range, enclosing the central 50% of the observations, with the median marked by a horizontal line within the box. The whiskers are vertical lines extending from the box as low as the 2.5th percentile and as high as the 97.5th percentile (sometimes the percentiles are replaced by the minimum and maximum values of the set of observations). Very common. For quantitative variables.
What type of data is best represented by a scatter diagram
Scatter diagram: effective way of presenting data when we are interested in examining the relationship between two variables which may be numerical or ordinal. The diagram is a two-dimensional plot in which each axis represents the scale of measurement of one of the two variables. Using this rectangular co-ordinate system, we relate the value for an individual on the horizontal scale (the abscissa) to the corresponding value for that individual on the vertical scale (the ordinate) by marking the relevant point with an appropriate symbol. For quantitative variables.
What is an outlier?
Outlier: An outlier is an observation whose value is highly inconsistent with themain body of the data. An outlier with an excessively large value will tend to increase the mean unduly, whilst a particularly small value will decrease it.
(T/F and Why) A qualitative variable comprises two categories which may be ordinal or numerical.
qualitative (categorical) —> nominal or ordinal scale; Numerical (quantitative) is the opposite category of qualitative.
(T/F and Why) An ordinal variable comprises categories that cannot be ordered.
ordinal (scale) is, by definition, composed of categories that CAN be ordered.
(T/F and Why) The age groups ‘young’, ‘middle aged’ and ‘old’ relate to a nominal categorical variable.
It is in the categorical (qualitative) variable group, but since we can give an order to the categories, it is part of the ordinal scale.
(T/F) Blood group is classified as a nominal categorical variable.
True
The number of eggs per clutch is a _____ variable
The number of eggs per clutch is an Numerical (quantitative) - discontinuous(discrete) variable