Module 17 Flashcards
What is Big Data?
term used to describe extremely large volumes of information within a business
What are the four Vs of Big Data?
Volume
Velocity
Variety
Veracity - must be reliable
What is tagging data?
to assist with finding data later on
e.g.
timestamp, location based tag
keyword
What must a data retention policy detail?
how long it should be stored for, how it is stored, security
how can data be stored?
list text file array stack queue index
What are the three levels of modelling?
-conceptual -mapping between information. also known as data normalisation
logical - actual tables and columns to be used in the system
physical - storage of the data. most details level spaces between data, software etc
end user doesn’t see logical and physical!
what does a database ensure?
data has integrity, provide concurrency control, fault tolerance
What is a storage area network?
a group data storage devices networked together so that location doesn’t matter.
What is Network attached storage?
a storage device attached to network that appears as remote storage
What is data mining?
identifying patterns or trends from a large data set.
What is the knowledge discovery in databases process?
- business understanding
- data understanding
- data preparation
- data mining/modelling
- evaluation
- deployment
What are the features of supercomputer?
powerful
expensive
fast to process
-massive data manipulation
What are the features of mainframes?
powerful expensive allows many concurrent users operates high speed users: manufactures, insurance, airlines
What are the features of servers?
- simultaneous multiple users
- running networks and internet applications
- large memory/storage
- fast and efficient
- can fail
What are the features of Microcomputers?
-personal computers
-common
-easy
can be networked together
often break