Block 1-4 Flashcards
Statistics can be referring to what 2 things?
Data
Methods
What are 2 types of data?
Measurements
Counts
What are 2 types of statistical methods?
Descriptive
Inferential
Descriptive statistics are used to do what with data?
Organise
Summarize
Present individual data values
Inferential statistics are used to do what with data?
uses methods of probability theory to make inferences about a population from data from a sample.
In practice we cannot obtain data from all individuals in a population. With a good study design the sample subjects will be representative of a wider population. We can then apply the conclusions from a study sample to the population.
Methods of estimation and hypothesis testing are fundamental in making inferences.
What are “variables”? What are the 2 types of variables?
Specific characteristics of groups or individuals that are being compared. 2 types are outcome and explanatory variables.
What is an “outcome variable”? What are two other names for it?
a characteristic which we believe to be affected by the values taken by other variables. It is also called a response or dependent variable.
What is an “explanatory variable”? What are 2 other names for it?
a factor that may influence the outcome. Such a variable partly explains the variability of the outcome.
They are also called independent or predictor variables.
What are 2 types of data?
Qualitative
Quantitative
What is “unordered categorical”?
A qualitative variable that has more than 2 options and can be in any order. Ex: blood group, ethnic group.
What are the 3 ways that qualitative variables can be expressed? What’s an example for each?
- binary– yes/ no, positive/ negative
- unordered categorical– blood group, marital status
- ordered categorical- ordinal data–amt of cigarettes per day are in categories but ordered
“Numerical data” is qualitative or quantitative? and what are the 2 types of numerical data?
quantitative. discrete or continuous.
What is “discrete data”? This qualitative or quantitative?
Result of a count, so always positive integers. Quantitative.
What is “continuous data”? This qualitative or quantitative?
form of measurement, where the value of the variable is not restricted to an integer. Quantitative.
What is “frequency”? It is used for qualitative or quantitative data?
number of times which the different possible values of a variable occur. Can be used for both.
What is a “frequency distribution”?
It is a table that displays the frequency of the different values of a variable.
What is a “relative frequency”? How do you calculate it?
Displays frequency by percentage of their total frequency. What percentage is this value compared to the total data set, allowing for comparison among values within a category. Calculated by:
Relative frequency (%) = (frequency in category/ total frequency) x 100%
What is “cumulative relative frequency”?
the running total of the relative frequencies, reading from top to bottom. For ex– when displayed in this manner, one can look at the table and see what % of the total of men had 4 or fewer sex partners, or 7 or fewer sex partners, etc.
What are some guidelines in grouping quantitative data?
Guidelines for Grouping Data
- Obtain the minimum and maximum values and decide on the number of intervals.
- The number of intervals should be between 5 and 15. Too many intervals will not summarise the data, too few intervals will obscure information.
- Determine the accuracy of the limits of each interval from the accuracy of the raw data.
- Aim for intervals of equal width; although this is not essential it is more convenient.
- Avoid making the first or last intervals open ended.
What types of data do bar chart and pie charts graph?
categorical or discrete data
What types of data do histogram or frequency polygon graph?
continuous data
What are some key points about a bar chart?
- used to display qualitative (or discrete numerical) data
- one bar represents one category, and the height of the bar equals its frequency (or relative frequency)
- each bar is the same width and equally spaced
- bars should have a space between them to stress that they represent categorical data
- the position of each category is arbitrary if the variable is unordered - in this example the categories are in alphabetical order
- it is important that the vertical axis of a bar chart starts at zero, to avoid distortion of true differences between frequencies
What is a “clustered bar chart”?
When we have two-way data. For example, a data set of frequency of bacteria in GI infection is further divided into male/ female.
How many variables at a time can a pie chart graph?
1