Biostatistics Flashcards
What is statistics (using information and data)
Statistics is a body of techniques and tools used in the collection, organization, analysis, interpretation and presentation of information that can be stated numerically
It is the collection, presentation, analysis and interpretation of numerical data.
Explain the two types of statistics
Descriptive:
Describes the population. Summarizes measurements .
Involves: frequencies, proportions, measures of central tendency, measures of dispersion/variation.
e.g weight of final year medical students
Inferential:
Uses data from a sample to represent the population which the sample came from
e.g weight of final year medical students to represent weight of medical students as a whole
Describe types of variables
Quantitative/Numerical:
Are just numbers, whether whole/integers(Discrete) or fractions(continuous). Any thing that can be counted or measured.
Qualitative/Categorical
Describes data that fits into categories.
3 types: Binary, nominal and ordinal
meaning of observations
Any subject that serves as the data source e.g people, schools
meaning of variables
The thing that can be measured e.g blood pressure
meaning of values
The actual result gotten from measuring a variable e.g 130/75mmHg
how many people are at least 7 years old? Is this a qualitative or quantitative variable.
Quantitative because the people can be counted. Specifically discrete quantitative.
What are measures of central tendency/location
They are tools used to summarize entire quantitative datasets into the most likely value (basically like the average)
What are the 3 measures of central tendency
Mean
Median
Mode
mean
Simply the arithemetic average
m = ∑x/n
Mean for grouped data
Mean = ∑fx /n
Where f = frequency of each group or class
x = mean value of the group
n = number of observations
median
The mid value of a series of data
(n + 1)/2
Best for skewed data
Mode
Most frequently occurring observation in a series
4 common measures of variation/dispersion
Range
Interquartile range
Variance
Standard deviation
Range
This is the difference between the largest and smallest values.
For grouped data, it is the difference between the mid-points of the extreme categories