Data Mining Flashcards

1
Q

what is data mining?

A

the automatic analysis of large data sets in a data warehouse. pattern recognition are used to idenify patterns and to predict trends. data is combined from multiple sources

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what are the main features of data mining?

A

its involves analysing large data sets to identify patterns to predict future trends

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is big data?

A

its a term associated with data sets than are so complex that tradirional database and other processing applications are unable to capture, manage and process within acceptable time frame

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what are the big data challenges?

A

volume- amount of data to be processed

variety- the number of types of data to be anaylsed

velocity- the speed of data processing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what does digital technology in data mining allow?

A

it allows us to collect data for further analysis using mthids such as online forms, mobile phones data transmission, email data and stock market data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what are commonly used data souces?

A

social media

machine data- data regenerated from devices such as RFID generated chip readers, GPS results

transactional data- data regenerated from companies such as ebay, amazon

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

internal data sources

A

customer details, product details, sales data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

external data sources

A

data collected from business partners, data suppliers, internet

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what is key requirements of big data storage?

A

it can handle very large amounts of data and keep scaling to keep up with large amounts of data and keep up with the growth of data sets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is Network Attached System?

A

this is a file access shared storage which can easily be scaled out to meet the increased capacity or computing requirements required for big data analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what are some of the methods of processing in big data storage?

A

cluster analysis- where groups of data records are identified

classification- where the data mining process is used to determine an appropriate structure to new data.

anomaly detection- where unusual records are identified.

regression- where relationships between data variables are investigated to help how a change in an independent variable can impact upon a dependent data variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what can big data do for organisations?

A

help gain insight, help into potential revenue increases he or p them determine how to improve operations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what are the key objectives of using big data in financial sector?

A

ensuring they are complying with regulations- using traditional data processing platforms to support objective- increase expense

improving risk analysis- can help identify fraudulent activity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

how does retail use big data?

A

predicting trends and forecasting demand

price optimisation

identifying potential customer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly