de volgende stukje over data volgens mij Flashcards

1
Q

Big data

A

these data sets with volumes so huge that they are beyond the ability of typical DBMS (data base management systems) to capture, store and analyze.
- Massive sets of unstructured / semi structured data from web traffic, social media, sensors and so on.
- Can reveal more patterns and relationships, but requires new tools and technologies to manage and analyze

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

van data
- Volume
- Variety
- Velocity
- Veracity

A

-scale of data
-different forms of data
-analysis of data
-uncertainty of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Hadoop

A

collection of inexpensive computers. It breaks big data down and distributes it in to 1000 of inexpensive computers and then combines result into smaller data sets which are easier to analyze.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Technology impact on business firms

A
  1. Every company can use internet technology, making it easy for rivals to compete and for new competitors to enter the market.
  2. Because information is available to everyone, the internet raises the bargaining power of customers, who can quickly find the lowest- cost provider on the web.
  3. Internet nearly distorted some industries and has threatened more.
  4. Internet created also new markets and provided new opportunities for building brands with very large and loyal customer bases.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Tools facilitating big data analysis

A
  1. Data warehouse
    A database that stores current and historical data of potential interest to decision makers throughout the organisation.
    Three types of information stores: company historical data, company actual data, relevant external data.
    Data marts
    Subset of data warehouse in which a summarized or highly focused portion of the organisations data is placed in a separate database for a specific population of users.
  2. HADOOP (very large volume of data)
    Hadoop enables distributed parallel processing of big data across inexpensive computers
  3. In memory computing
    Relies on computer main memory (RAM) for data storage: faster and more predictable outcome.
  4. Analytical platforms
    Full featured technology solution. Joins different tools and analytical systems together. Designed for high speed analysis of large data sets.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

OLAP (online analytical processing)

A

tool that enables users to view the same data in different ways using multiple dimensions. (product, pricing, cost, region or time period)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Data mining

A

discovery driven. Finds hidden patterns in customer buying behavior to predict future behavior of customers. (type of information obtainable from data mining includes associations, sequences, classifications, clusters and forecasts).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q
  1. Associations
A

Occurrences linked to a single event

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q
  1. Sequences
A

events that are linked over time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q
  1. Classifications
A

recognizes patterns that describe the group to which an item belongs by examining existing items that have been classified and by inferring a set of rules.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q
  1. Clustering
A

works in a manner of similar to classifications when no group yet been defined.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q
  1. Forecasting
A

used predictions in a different way than the other ones, it uses a set of existing values to forecast what other values will be.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q
  1. Text mining
A

Able to extract key elements from unstructured big data sets, discover patterns and trends. (+ summarize the information) (structured data)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q
  1. Sentiment analysis
A

Mine text comments in mails etc. to detect favourable and unfavourable opinions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q
  1. Web mining
A

Analysis of useful patterns and information from the web (unstructured data).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q
  1. Web content mining
A

Process of extracting knowledge from the content of webpages.

17
Q
  1. Web structured mining
A

Examines data related to the structure of a particular website.

18
Q
  1. Web usage mining
A

Examines user interaction data recovered by a web server whenever request for a website resource are received.

19
Q

Privacy

A

the claim of individuals to be left alone, free from surveillance or interference from other individuals or organizations.
Customers must provide their informed consent before any company can legally use data about them and they have the right to access that information, correct it, and request that no further data be collected.