Data Flashcards

1
Q

What are data outliers?

A

Observations that are abnormal and can significantly distort the results

Can be removed from the data set

BUT must be a clear and valid reason, or may be a risk of data manipulation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is big data?

A

Data sets so large and varied they are beyond the capability of traditional data-processing

Obtained in addition to the traditional management information data

Provide a deeper understanding of customer’s needs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The 4Vs of big data?

A

Volume: scale and amount of information

Velocity: timeliness of the data

Variety: formats, including structured and unstructured data

Veracity: reliability of information, keeping it clean and free from bias, interpretation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

The purposes of big data in budgeting?

A

Identify trends and other correlations

Improve forecasting

Improve overall profitability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is data analytics and data mining?

A

Analytics: process if collecting, organising and analysing data to generate trends and aid decision making

Data mining: sorting through data to identify patterns and relationships, using algorithms

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Structured v unstructured data?

A

Structured: data contained within a field in a data record or file

Unstructured: data not easily contained within structured data fields

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Ads of big data?

A

Substantial amount of info can be processed

Different sources

Accurate model of future demand

Understand customer’s preferences

Short term and long term decisions

Provides real time information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Dis of big data?

A

Company needs to be seen as trustworthy

Lack of forecasting tools

Infringement of privacy

Security required to hold information

Incorrect data

Lack of skilled data analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

AI versus Machine Leaning?

A

AI - use of computers to do tasks which are thought to require human intelligence

ML - field in AI whereby computers and learn and do things rather than follow pre-programmed rules

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

The 7 types of data bias?

A

Selection - sample size doesn’t represent population

Self-selection - individuals select themselves

Observer - researcher allows assumptions to influence the observation

Omitted variable - key data not included

Cognitive bias - presentation of data is misleading

Confirmation - people see data that confirms their beliefs

Survivorship - sample only contains items that survived a previous event

How well did you know this?
1
Not at all
2
3
4
5
Perfectly