L1 - Introduction Flashcards
1
Q
What is Big Data?
A
Big Data is high-volume, high-velocity and high-variety information assets that demand cost-effective, innovative forms of information processing for enhanced insight and decision making (Source: Gartner)
2
Q
Why can we now use Big Data?
A
- Now have ways of storing and retrieving large amounts of Data –> greater computing powers
- Memory requirements can now me met
- Parallel and Cloud computing
- we have good algorithms now
3
Q
What are the three different types of Machine Learning Algorithms?
A
—Supervised Learning
•The dataset is divided into training and testing –> ( it has been labelled by a group of people of what is meant to be in the image)
—Unsupervised Learning
•The data are not prepared or labelled, the algorithm learns by similarity
—Semi-Supervised Learning