13: Data analysis Flashcards

1
Q

What is quantitative data?

A

concerns quantitative variables such as measurements or counts and expressed numerically

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is qualitative data?

A

qualitative attributes which CANNOT be expressed numerically

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is discrete data?

A

Takes exact values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are continuous variables?

A

Can take any value within a range and we tend to analyse the values based on which range they fall within

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Information should be ACCURATE and complete. What does this acronym stand for?

A

Accurate
Complete
Cost-benefitial
User-targeted
Relevant
Authoritative
Timely
Easy-to-use

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is meant by descriptive statistics?

A

The use of statistics that summerises the data in a data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is inferential statistics?

A

Statistical methods that deduce the characteristics of a bigger population from a small but representative sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is exploratory data analysis?

A

Identifying relationships in a set of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is confirmatory data analysis?

A

Confirming a pre-determined hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a representative sample?

A

A sample that reflects the characteristics of the population from which it is drawn

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Explain:
Simple random sampling
Systematic sampling
Stratified sampling

A

Simple random:
- RNG from whole population
- every point assigned a number

Systematic:
- all items assigned a number
- RNG for first value
- Every nth value afterward

Stratified:
- population divided into strata based on characteristic
- relative size of strata determines sample size
- RNG from each item in a strata

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are surveys?

A

widely used by organisations to obtain useful information for decision making and research

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is meant by ‘survey fatigue’?

A

When surveys are too long and don’t get too the point such that the user doesn’t answer the questions properly or quits midway

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are some top tips for writing good survey questions? (7)

A
  • simple short clear questions
  • questions that require specific answers
  • avoid broad questions
  • using scales rather than yes/no
  • avoiding leading questions
  • avoid ‘double-barrelled- questions
  • qualitative data can be as important as quantitative – use focus groups
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the Excel commands listed in the textbook?

A

=SUM

=AVERAGE

= COUNTIF(B2:B7, “>25000”)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a risk of spreadsheet use?

A

Errors in spreadsheets: could be errors in formulae, human error, or logic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

ICAEW Principles of good spreadsheet practice: (16) + (4 risks)

A
  1. Determine what role spreadsheets play
  2. Adopt a standard
  3. Ensure everyone has appropriate knowledge and competence
  4. Collaborative work
  5. Satisfy yourself that spreadsheets are the right tool for job
  6. Identify your audience
  7. Include an ‘About’ or ‘Welcome’ sheet
  8. Design for longevity
  9. Focus on required outputs
  10. Seperate and clearly identify inputs
  11. Be consistent in structure
  12. Be consistent in use of formulae
  13. Keep formulae short and practical
  14. Never embed numbers in a formulae that might change
  15. Perform any calculation once
  16. Avoid using advanced features when same result can be obtained with simpler features
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is comparability?

A

The extent to which differences between statistics can be attributed to differences between true value of statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Explain data bias

A

Data is biased when it is not representative of the population that is being analysed

Bias can be inherent or introduced by those analysing it

20
Q

What is selection bias?

A

When data is not selected randomly and leads to a sample that is not at all representative of a population

21
Q

What is self-selection bias?

A

When individuals select themselves to be part of a sample

Example: online questionaire

22
Q

What is observer bias?

A

Occurs when observing and recording results and relates to interpretation

The researcher allows their assumptions, conscious or unconscious, to influence observation

23
Q

What is omitted variable bias?

A

where key variables are not included within the data to be analysed

24
Q

What is cognitive bias?

A

where the perception of whether something is good or not is influenced by being shown a previous or expected value for that variable

25
Q

What is confirmation bias?

A

occurs when people see data that confirms their beliefs and they ignore data that disagrees with their beliefs

26
Q

What is survivorship bias?

A

sample only contains items that survived some previous event

Example: exam results

27
Q

What is statistical significance?

A

the results generated by testing or experimentation are unlikely to occur by chance or randomly, but occur due to a specific cause

28
Q

What is Type I and Type II error?

A

Type I:
- ‘false positive’
- where the null hypothesis Ho is true

Type II:
- ‘false negative’
- where the null hypothesis is false, but is accepted because the sample result is not statistically different to Ho

29
Q

What are the four principles of effective data visualisation?

A
  1. appropriate type of chart has to be picked
  2. an appropriate scale should be chosen
  3. charts should have a clear title
  4. appropriate use of colour and shading
30
Q

What are the three data visualisation examples given in the textbook?

A
  1. Bar charts
  2. Pie charts
  3. Line charts
31
Q

What is meant by big data?

A

Datasets whose size is beyond the ability of typical database software to caputre, store, manage, and analyse

32
Q

What are the four characteristics of Big Data? (The Four Vs)

A
  • Volume
  • Velocity
  • Variety
  • Veracity (trustworthiness or accuracy)
33
Q

Types of structured data: (4)

A
  • created data
  • provoked data
  • transacted data
  • compiled data
34
Q

Types of unstructured data (2)

A
  • captured data
  • user-generated data
35
Q

ICAEW sets out the driving forces behind the increasing importance of big data (3)

A
  • new sources of data is unstructured human-sourced and machine-generated data
  • exponential growth in computing power and storage means entire data can be captured and processed regardless of size
  • new infrastructure for knowledge creation such as crowdsourcing and open-source software
36
Q

What is data analytics?

A

The process of collecting, organising, and analysing large sets of data to discover patterns and other information which an organisation can use for its future business decisions

37
Q

What are the four types of data analytic and what associated question to they require to be answered?

A

Descriptive analytics
- What has happened?

Diagnostic analytics
- Why has something happened?

Predictive analytics
- What is likely to happen in the future?

Prescriptive analytics
- What is the best course of action?

38
Q

A report by the management consultants at McKinsey highlight the following six ways big data and data analytics can be used by a business to create value:

A
  • Enhance transparency
  • Performance improvement
  • Market segmentation and customisation
  • Decision making
  • Innovation
  • Risk management
39
Q

What are the risks of big data and data analytics? (6)

A
  1. Storage
  2. Workforce skills
  3. Data dependency
  4. Information overload
  5. Data privacy
  6. Data security
40
Q

What legislation governs data protection in the UK?

A

Data Protection Act 2018

41
Q

What is IP and what is it governed by?

A

Intellectual property

Intellectual property law

42
Q

What are the five methods of IP protection?

A
  • Copyright
  • Design right
  • Trademark
  • Registered design
  • Patent
43
Q

What are the protections offered for copyright law? (Lengths of time for each medium)

A

written, dramatic, musical, artistic:
- 70 years from artist death

sound and music recording:
- 70 years from publishing

films
- 70 years after death of the director

broadcasts
- 50 years from when first broadcasted

published written, dramatic, or musical work
- 25 years from when first published

44
Q

What are the protections offered for design rights? (Lengths of time for each medium)

A

15 years after created
10 years after it is sold, whichever is earlier

45
Q

What are the protections offered for trademark? (Lengths of time for each medium)

46
Q

What are the protections offered for registered design? (Lengths of time for each medium)

47
Q

What are the protections offered for patent? (Lengths of time for each medium)