5.Review of Descriptive statistics and hypothesis Flashcards
what are descriptive statistics?
statistics simply to describe data collected, whether it be sample or population data. It is a Screen of the data and observation of trends
what are inferential statistics?
use sample statistics to infer something about a population
to test whether a difference/relationship seen in sample data is sufficiently large to accept it may be real in the population
allows us to test hypotheses and make decisions based on sample data
what do equations aim to do?
achieve specific things for specific purposes
what are equations made up of?
subcomponents all of which do something useful for achieving that purpose
what do equations produce?
numbers that are meaningful with respect to that purpose
what is the first thing we want to do when we imagine a set of data?
have a look at its distribution and we might want to think about how to characterise that distribution numerically
what are the characteristics of a data set?
central tendency, variability and shape
what is central tendency
mean
median
mode
what is variability?
sum of squares variance standard deviation range standard error
what is the normal distribution?
a function that represents the distribution of many random variables as a symmetrical bell-shaped graph.
what is modality (with regard to the shape of a distribution)?
the number of central clusters that a distribution possess
what are the two types of modality?
unimodal and bimodal
unimodal
scores vary around one central point
bimodal
scores vary around two “central” points
what does kurtosis mean?
“Peakedness” - how tightly clustered are scores arond the mean?
skew
the symmetry of the rails of the distribution
what are the characteristics of “normality” curve?
distribution is unimodal
has moderate peakedness
and has symmetric tails.
what does Sigma designate?
“The sum of” - so simply add them up
what is the symbol for sigma?
∑
what does ∑x mean?
the sum of all values of x
what does the mean tell us?
something useful about the center of the data-set
what does the mean not tell us?
doesnt tell us anything about the variability around the mean
what is the equation of the mean?
mean=M= (∑x)/n
what is a simple way we can calculate how each participant’s score varies with respect to the mean?
subtract the mean from each participant’s score.
X-M
how does subtracting the mean from each participant’s score characterise the data set as a whole?
when using sigma
thus ∑(X-M)
This will always sum to zero.
This is because we have subtracted the mean from each score that contributes to the mean. All we have left is the variability around the mean (which is 0)
what is ^2 (to the power of 2) also known as?
squared
what does X^2 designate?
X squared or X * X (X multiplied by itself)
why is using “square” handy?
because the square of negative numbers is positive
what is the abbreviation of sum of squared deviation?
SS
what is another way to say “Sum of squared deviation”
sum of squares
what is the equation for sum of squares or sum of squared deviations?
SS= ∑(X-M)^2
what does the sum of squared tell us?
it tells us something about the total variability in the data set, but does not really characterise the degree to which each participant varies around the mean
what is the abbreviation for variance?
SD^2
or
σ^2
how do we calculate the variance?
by dividing the sum of squares by the number of operations minus 1
That is:
σ^2= SS/(n-1)
what is the complete equation for variance?
σ^2=
(n-1)
what happens when you take the square root of the variance?
we can calculate the standard deviation
what is the abbreviation for standard deviation?
σ or SD
what is the complete equation for the standard deviation?
σ = √( SS / (n-1) )
what is the standard deviation?
the average amount of variability around the mean.. This is useful as any information about the degree of variability around the mean is important
what is the degrees of freedom?
the number of values in the final calculations of a statistic that are free to vary
what is the abbreviation of degrees of freedom?
df
what is the initial degrees of freedom equal to?
the number of observations
what is the abbreviation for the number of observations?
N
what is the equation for degrees of freedom when testing variability?
N-1
why do we minus 1 from N when calculating variabiliy (standard deviation) using degrees of freedom?
because when calculating the SD you first have to calculate the mean. In doing so, you use up one of your degrees of freedom. Therefore the df that remains for calculating the SD is N-1
what does using a degree of freedom where N-1 allow?
more accurate estimate of population parameters, which is what we want to do since we want to make inferences