Data & Tech Flashcards
What is discrete data?
Exact values that can be counted
What are continuous variables?
Values within a range - eg age 18-24
Information should be ACCURATE. What does this stand for?
Accurate
Complete
Cost-beneficial
User targeted
Relevant
Authoritative
Timely
Easy to use
Match the key terms to their definitions
a) Descriptive statistics
b) Inferential statistics
c) Exploratory data analysis
d) Confirmatory data analysis
- Using statistical methods to confirm a pre-determined hypothesis
- Statistical methods that deduce the characteristics of a bigger population from a small but representative sample
- Identifying relationships in data sets
- Statistics that summarise the data in a data set
a4
b2
3c
d1
Match the key terms to their definitions
a) Simple Random Sampling
b) Systemic Sampling
c) Stratified Sampling
- All items are assigned a number. First number randomly chosen, then every nth number
- All items are assigned a number. Random number generator selects sample
- Population is grouped - samples are selected from group based on representation of population
a 2
b 1
c 3
What are type 1 and type 2 errors in hypothesis testing?
Type 1: False positive, where the null hypothesis is true but is rejected because the sample results are significantly different
Type 2: False negative, where the null hypothesis is false but is accepted because the sample agrees with it
What are the four Vs of big data?
Volume
Velocity
Variety
Veracity - trustworthiness
What is the difference between structured and unstructured big data?
Structured - collected with a purpose in mind
Unstructured - collected without an objective