Sub-topic 1: Variables, distributions and summary statistics Flashcards
Week 2
Biologists make observations of (collect data on) selected
variables on a sample from the population to estimate
the value of one or more parameters of that
population.
Yes.
Variable: any observable feature of the natural world (e.g. number of limpets in a quadrat, sex of a frog, moisture content of a leaf). These are all variables as they have the potential to vary
Yes.
Population: the target group of interest in
the study. Can be finite (e.g. number of fish
in a pond) or infinite (e.g. number of fish in
the ocean)
Yes.
Sample: we cannot practically count every unit in a population, therefore we sample a subset of a population and attempt to draw inferences about the entire population from this sample
Yes.
Parameters: a parameter is some
characteristic of the distribution of the
variables in a population (e.g. the
average or variance of weights of fish in
a pond)
Yes.
VARIABLE A variable is any observable feature of the natural world
Yes.
DATUM A datum, or observation, is any one record of the state of a variable.
Yes.
DATASET Any collection of observations made on a variable is a data set
Yes.
POPULATION The set of all possible observations on a variable is the population
Yes.
FINITE
POPULATION
Populations can be either finite or infinite. Finite populations have a finite, countable
number of elements and can, in theory at least, be completely sampled.
Yes.
INFINITE
POPULATION
Infinite populations have an infinite number of elements and can never be completely
sampled.
Yes.
SAMPLE Large and infinite populations cannot be observed in their entirety, so we take only
(nearly always randomly) a sample (sub)set of observations from a population.
Yes.
PARAMETER A parameter is some characteristic of the distribution of the values of a variable in a
population.
Yes.
STATISTIC The term “statistic” is used in two ways: to refer to the entire body of procedures for
dealing with data; or to refer to estimates of population parameters based on samples.
Yes.
NOMINAL/
CLASSIFICATION
Features which can be classified into named groups, lacking
order
Yes.
ORDINAL/
RANKING
Features which can be ranked in order
Yes.
NUMERICAL/
QUANTITATIVE
Features which can be enumerated or quantified (counted or
measured)
e.g. weight, number, temperature, counts of animals
Can be subdivided into
• e.g. Interval v Ratio: arbitrary zero and unit
(temperature [Celsius]) v true zero (weight)
• Discrete v Continuous: values which are whole
numbers (counts) v values which can be fractions
(weight)
Yes.
Measures of location Mode most common value Median middle value Mean average value
Yes.
Symbols Sample mean ⨱ Population mean μ
Yes.
Measures of shape Variance spread of distribution Skewness skew of peak of distribution to one side of the mean Kurtosis “peakedness” or “flatness” of distribution
Yes.
Symbols Sample variance s2 Population variance σ2 Sample standard deviation s Population standard deviation σ
Yes.
VARIANCE (s2): measures the dispersion of data around their
mean value
Yes.
NORMAL DISTRIBUTION: a symmetric distribution, often called a
bell-curve which describes many parameters of the natural world,
e.g. height, weight, test scores in a very large class
Yes.