Lecture 2 REVISED Flashcards

1
Q

what are common applications of statistics?

A
  • predictive modelling
  • pattern recognition
  • anomaly detection
  • classification
  • sentiment analysis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what are common business use cases of statistics?

A
  • customer analytics
  • targetted advertising
  • website personalisation
  • risk management
  • investment optimisation
  • fraud detection
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

examples of challenges in statistics?

A
  • varied/massive amounts of data
  • varied types of data ((un)/(semi)/structured data)
  • eliminating bias
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

“numbers don’t lie” ?

A

even when numbers are correct, people and organisations with their own agendas may use them to mislead

can skew the story and hide relevant facts

‘numbers don’t lie’ is false

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

unethical uses of statistics

A
  • biased sampling
  • eradicating data that doesn’t support your views
  • eradicating data without justifiable reason
  • using jargon
  • deliberately using wrong method of analysis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

statistical enquiry circle? (PPDAC)

A
  • problem
  • plan
  • data
  • analysis
  • conclusion
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

primary/secondary data?

A

primary = data collected directly from the source

secondary = data previously collected by someone else

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

differences between data and information?

A

data = raw facts/figures, input, meaningless unless contextualised

information = polished data with context, meaningful, easier to understand, output

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

qualitative/quantitative data?

A

quantitative data = represents measures/counts - always numeric (interval/ratio scale)

qualitative data = names or labels used to identify an attribute (nominal/ordinal scale)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what does level of measurement determine?

A

the amount of information contained in the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what are the 4 levels of measurement?

A
  • nominal
  • ordinal
  • interval
  • ratio
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

nominal data

A
  • consists of labels/names used for identification
  • can be numeric or non-numeric
  • categories are in no logical order and have no particular relationship
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

ordinal data

A
  • exhibits properties of nominal data and may be rank ordered
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

interval data

A

represented by numbers but doesn’t have a true 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

ratio data

A

represented by numbers and has a true 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

how do measurement levels work?

A

there are qualitative and quantitative measurement levels

qualitative = nominal & ordinal

quantitative = interval & ratio

17
Q

the higher the level of measurement…

A

the more precise the data is

precision doesn’t ensure accuracy

18
Q

big data

A

refers to the large & diverse sets of information

19
Q

3 V’s of big data?

A

volume, variety, velocity

  • the volume of information
  • the velocity/speed at which data is created/collected
  • the variety of data available
20
Q

structured & unstructured data?

A

structured = easily formatted & stored

unstructured = free form, less quantifiable