1. Database Management - Segment 1 [ Week 1 & 2- Data and Data Sources] Flashcards
On which Data analysis is relies on ?
Data analysis relies on Data & Data sources.
In which form data is collected by Various Data Sources ?
Data is in raw format
What are the Different types of data .
- Scientific data
- Multimedia data
- Transactional data or structure data
- Relational data
- Web data
- Flat files data
What is scientific data ?
Data that comes from various sensors and scientific equipments.
What is multimedia data ?
The data that comes from cameras, satellite images, videos, and CCTV footage are referred to as multimedia data. They typically contain audio and video content over a period of time.
What is transactional data or structured data ?
Predefined or prestructured data taken at different time stands
What is relational data ?
Data comes in row and column format
What is web data ?
Collected by Web scrapping ,
What is Flat files data?
csv or excel files , stored in a local system
What is big data ?
Non-relational and non-structure data are generally referred to as big data
What are the Types of Storage Based on the Connectivity ?
- Device-Attached Storage (DAS)
- Network-Attached Storage (NAS)
- Storage Area Network (SAN)
What is DAS ?
Device attached storage
the file system and disk storage are directly connected.are available in the same physical location.
What is NAS ?
In NAS, the file system and disk storage are available remotely
What is SAN ?
In SAN, only the disk storage is remote. The file system accesses the storage over the network.The file system is in the system itself
What is the basic difference between SAN & NAS ?
The basic difference between SAN & NAS is ,
in NAS both file system and storage are on remote site,
But in SAN storage is on remote side and the file system is on he system itself
NAS is a single storage device while a SAN is a tightly coupled network of multiple devices .
NAS devices deliver shared storage as network mounted volumes and use protocols like NFS and SMB/CIFS,
while SAN-connected disks appear to the user as local drives.
What are the Types of Storage Based on the Location of Nodes ?
It is a two types -
* 1. Warehouse storage/ On-premise storage:
* 2. Cloud storage
Define Storage Based on the Location of Nodes ?
- Warehouse storage/ On-premise storage -Nodes are present in the same physical location. This will ensure that accessing data is quick and network-delays not impact applications.
- Cloud storage - Data is stored on cloud nodes.Cloud storage is always less expensive compared to physical storage.The real-time data can be ingested and stored directly into cloud storage, scaling both in and out in response to data volume.
What is Hadoop Model ?
Hadoop is an open-source framework for processing large datasets. Hadoop uses a unique file system called Hadoop Distributed File System (HDFS). Internally, this file system can be connected to any type of storage model- DAS,NAS SAN
HDFS provides an abstraction. As a result, the storage appears as a locally attached disk.
What is HDFS ?
HDFS - Hadoop Distributed File System
It is a file system used in Hadoop model
What is the good solution to handle big data.
Hadoop
It provides scaling of storage as the data continues to grow
What are the basic requirements of big data?
- Type of storage used
- Handle large amount of data
- It should continue to scale as the data continuous to grow