Unit 2 Flashcards
Big Data
Vast amounts of structured, semi-structured, and unstructured data are characterized by its volume, velocity, variety, and value, which, when analyzed, can provide competitive advantages and drive digital transformations.
Big Data Cluster
A distributed computing environment comprising thousands or tens of thousands of interconnected computers that collectively store and process large datasets.
Broad Network Access
Access to the Cloud is available using a wide variety of client devices, such as PCs, laptops, tablets, and smartphones.
Cloud Computing
The delivery of on-demand computing resources, including networks, servers, storage, applications, services, and data centers, over the Internet on a pay-for-use basis.
Infrastructure as a Service (IaaS)
A cloud service model that provides access to computing infrastructure, including servers, storage, and networking, without the need for users to manage or operate them.
Software as a Service (SaaS)
A form of cloud computing where a firm subscribes to a third-party software and receives a service that is delivered online.
Platform as a Service (PaaS)
A cloud service in which consumers can install and run their own specialized applications on the cloud computing network.
Data Replication
A strategy in which data is duplicated across multiple nodes in a cluster to ensure data durability and availability, reducing the risk of data loss due to hardware failures.
Deep Learning
Involves artificial neural networks inspired by the human brain, capable of learning and making complex decisions from data on their own.
Digital Transformation
A strategic and cultural organizational change driven by data science, especially Big Data, to integrate digital technology across all areas of the organization, resulting in fundamental operational and value delivery changes.
Distributed Data
The practice of dividing data into smaller chunks and distributing them across multiple computers within a cluster enables parallel processing for data analysis.
Measured Service
A characteristic where users are billed for cloud resources based on their actual usage, with resource utilization transparently monitored, measured, and reported.
On-demand Self Service
The capability for users to access and provision cloud resources such as processing power, storage, and networking using simple interfaces without human interaction with service providers
Rapid Elasticity
The ability to quickly scale cloud resources up or down based on demand, allowing users to access more resources when needed and release them when not in use.
Resource Pooling
A cloud characteristic where computing resources are shared and dynamically assigned to multiple consumers, promoting economies of scale and cost-efficiency.