Lecture 9 - Big Data and data science Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

Give examples of SQL databases

A

mySQL, MongoDB, SQL Server, PostGreSQL, Vertica

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
1
Q

Give examples of SQL databases

A

mySQL, SQL Server, PostGreSQL, Vertica

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Give an example of no-SQL databases

A

Graph Database (like DBPedia)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does API stand for?

A

Application Programmer Interface

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Under which circumstances SQL / no-SQL should be used?

A

SQL: data is structured and unchanging
no-SQL :
- storing a large volume of data with little structure
- data changes rapidly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Hadoop?

A

open-source Java implementation of Map-Reduce

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Map-Reduce?

A

Framework to distribute/parallelize processing tasks across multiple computers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Apache Spark?

A

“successor” of Hadoop

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Name the advantages of Spark over Hadoop

A
  • provides real-time, in-memory processing
  • much faster than Hadoop
    –> suitable for streaming real-time data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why SaaS?

A
  • pay as you go
  • scale up/down
  • low maintenance
  • performance, better infrastructure

(disadvantage: data privacy)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly