ED+A Flashcards
Hypothetico-deductive reasoning
Hypothesis created not by induction, experiments used to falsify hypothesis. Can be ‘good’ hypothesis however arguably there is no way of proving it to be true.
Hypothesis
Preposition tentatively put forward to explain an observation.
Alternate Hypothesis
(H1) Hypothesis makes a specific prediction about results which can later be tested.
Theory
Set of general ideas or rules to explain a group of observations. More general than hypothesis, less speculative.
Paradigm
Describes a whole way of thinking or a particular way of viewing the world.
Paradigm shift
Dramatic change in the way we think about a subject when evidence has accumulated in favour of rejecting a previous set of hypotheses or theories, or a creative genius moment.
Null Hypothesis
(H0) Form of hypothesis we test using statistics following an observation. Predicts NOTHING will happen/No effect/No difference or relationship. Hope to reject is data supports alternate hypothesis. Only one null.
Statistics
Branch of mathematics scientists use for an objective assessment of patterns in data from experiments or observations.
Nominal data
CATEGORICAL In the form of categories with names (e.g male or female). Non-quantitative.
Discrete data
QUANTITATIVE Count how many individuals in each group of Nominal data. Quantitative and always in the form of whole numbers.
Ordinal data
CATEGORICAL Ranked in order of size or on a rating scale (e.g 1st, 2nd). Not quantitative as we do not know the difference between 1st and 2nd, only that 1st is larger (e.g strongly agree, disagree)
Continuous data
QUANTITATIVE (e.g temperature, time) Subjective decision between continuous and discrete.
Descriptive statistics
Measures calculated from a data-set which summarise some characteristic of the data (central tendancy or variability)
Sample size
(n) number of individuals sampled.
Frequency
Number of times something occurs, or a count of the number of items in a category.
Mean
A measure of central tendency. Average of a sample of numbers
Median
A measure of central tendency. Middle number in a sample of numbers when placed in order. If sample is even then the average of the two middle numbers is taken.
Mode
A measure of central tendency. The most common number.
Measures of central tendency
Mean, Median, Mode - all tell about the position of the middle of the sample.
Frequency histogram
Graph showing the frequency of quantitative observations in each category.
Discrete - categories represent each possible total count made. Continuous - categories are arbitrary (1.-0, 11-20) you decide.
Distribution
Shape of data set as seen on frequency histogram. Described by mathematical equations.
Deviate
Distance between a data point/observations and the mean. Also known as residual.
Sum of Squares
(SS) total of all the squared deviates for a particular data-set. Gets rid of minus signs, quantifies the magnitude of the total variability but ignores direction of variability.
Variance
(S^2) Average size of the deviates. Measure of variability. Sample variance is an estimate of population variance.
Standard deviation
(s) Average size of deviates by square rooting variance we get a measure of variation unaffected by sample size. Standard deviation of 2.5 means the average data point is 2.5 times larger or smaller than the mean.
Population
All individuals in a group
Sample
Sub-set of population normally chosen to represent the population.
Normal distribution
‘Bell curve’ or Gaussian distribution. Continuous - useful, symmetrical, 68.5% of all data points in normal population will be within one standard deviation of mean.
Standard Error of the mean
Measure of confidence in sample mean as an estimate of real population mean. Small SEM = good estimate. If error bars do not overlap sample means are different. SEM decreases with sample size as more data means more confidence. = Standard deviation of a population of sample means.95% confidence interval.
Skew
Skewed to right = long tail to the distribution on the right and no symmetry (mode closer to left than right) = Not normal.
Parametric statistics/statistical tests
make several key assumptions about distibution. e.g it is normal.
Non parametric statistics/test
fewer assumptions about data.
Poisson distribution
Common for discrete data if maximum possible count is larger than mean. Many different shapes, if mean is near to zero = heavily skewed normal distribution, mean is big = normal distribution
Binomial distribution
good for discrete data where maximum possible count is close to the mean.
Bar chart
type of graph used for visualising differences between samples.