13: Data analysis Flashcards
What is quantitative data?
concerns quantitative variables such as measurements or counts and expressed numerically
What is qualitative data?
qualitative attributes which CANNOT be expressed numerically
What is discrete data?
Takes exact values
What are continuous variables?
Can take any value within a range and we tend to analyse the values based on which range they fall within
Information should be ACCURATE and complete. What does this acronym stand for?
Accurate
Complete
Cost-benefitial
User-targeted
Relevant
Authoritative
Timely
Easy-to-use
What is meant by descriptive statistics?
The use of statistics that summerises the data in a data set
What is inferential statistics?
Statistical methods that deduce the characteristics of a bigger population from a small but representative sample
What is exploratory data analysis?
Identifying relationships in a set of data
What is confirmatory data analysis?
Confirming a pre-determined hypothesis
What is a representative sample?
A sample that reflects the characteristics of the population from which it is drawn
Explain:
Simple random sampling
Systematic sampling
Stratified sampling
Simple random:
- RNG from whole population
- every point assigned a number
Systematic:
- all items assigned a number
- RNG for first value
- Every nth value afterward
Stratified:
- population divided into strata based on characteristic
- relative size of strata determines sample size
- RNG from each item in a strata
What are surveys?
widely used by organisations to obtain useful information for decision making and research
What is meant by ‘survey fatigue’?
When surveys are too long and don’t get too the point such that the user doesn’t answer the questions properly or quits midway
What are some top tips for writing good survey questions? (7)
- simple short clear questions
- questions that require specific answers
- avoid broad questions
- using scales rather than yes/no
- avoiding leading questions
- avoid ‘double-barrelled- questions
- qualitative data can be as important as quantitative – use focus groups
What are the Excel commands listed in the textbook?
=SUM
=AVERAGE
= COUNTIF(B2:B7, “>25000”)
What is a risk of spreadsheet use?
Errors in spreadsheets: could be errors in formulae, human error, or logic
ICAEW Principles of good spreadsheet practice: (16) + (4 risks)
- Determine what role spreadsheets play
- Adopt a standard
- Ensure everyone has appropriate knowledge and competence
- Collaborative work
- Satisfy yourself that spreadsheets are the right tool for job
- Identify your audience
- Include an ‘About’ or ‘Welcome’ sheet
- Design for longevity
- Focus on required outputs
- Seperate and clearly identify inputs
- Be consistent in structure
- Be consistent in use of formulae
- Keep formulae short and practical
- Never embed numbers in a formulae that might change
- Perform any calculation once
- Avoid using advanced features when same result can be obtained with simpler features
What is comparability?
The extent to which differences between statistics can be attributed to differences between true value of statistics
Explain data bias
Data is biased when it is not representative of the population that is being analysed
Bias can be inherent or introduced by those analysing it
What is selection bias?
When data is not selected randomly and leads to a sample that is not at all representative of a population
What is self-selection bias?
When individuals select themselves to be part of a sample
Example: online questionaire
What is observer bias?
Occurs when observing and recording results and relates to interpretation
The researcher allows their assumptions, conscious or unconscious, to influence observation
What is omitted variable bias?
where key variables are not included within the data to be analysed
What is cognitive bias?
where the perception of whether something is good or not is influenced by being shown a previous or expected value for that variable