Unit 2 Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

Big Data

A

Vast amounts of structured, semi-structured, and unstructured data are characterized by its volume, velocity, variety, and value, which, when analyzed, can provide competitive advantages and drive digital transformations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Big Data Cluster

A

A distributed computing environment comprising thousands or tens of thousands of interconnected computers that collectively store and process large datasets.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Broad Network Access

A

Access to the Cloud is available using a wide variety of client devices, such as PCs, laptops, tablets, and smartphones.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Cloud Computing

A

The delivery of on-demand computing resources, including networks, servers, storage, applications, services, and data centers, over the Internet on a pay-for-use basis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Infrastructure as a Service (IaaS)

A

A cloud service model that provides access to computing infrastructure, including servers, storage, and networking, without the need for users to manage or operate them.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Software as a Service (SaaS)

A

A form of cloud computing where a firm subscribes to a third-party software and receives a service that is delivered online.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Platform as a Service (PaaS)

A

A cloud service in which consumers can install and run their own specialized applications on the cloud computing network.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Data Replication

A

A strategy in which data is duplicated across multiple nodes in a cluster to ensure data durability and availability, reducing the risk of data loss due to hardware failures.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Deep Learning

A

Involves artificial neural networks inspired by the human brain, capable of learning and making complex decisions from data on their own.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Digital Transformation

A

A strategic and cultural organizational change driven by data science, especially Big Data, to integrate digital technology across all areas of the organization, resulting in fundamental operational and value delivery changes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Distributed Data

A

The practice of dividing data into smaller chunks and distributing them across multiple computers within a cluster enables parallel processing for data analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Measured Service

A

A characteristic where users are billed for cloud resources based on their actual usage, with resource utilization transparently monitored, measured, and reported.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

On-demand Self Service

A

The capability for users to access and provision cloud resources such as processing power, storage, and networking using simple interfaces without human interaction with service providers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Rapid Elasticity

A

The ability to quickly scale cloud resources up or down based on demand, allowing users to access more resources when needed and release them when not in use.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Resource Pooling

A

A cloud characteristic where computing resources are shared and dynamically assigned to multiple consumers, promoting economies of scale and cost-efficiency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Variety

A

The diversity of data types, including structured and unstructured data from various sources such as text, images, video, and more, posing data management challenges.

17
Q

Velocity

A

The speed at which data accumulates and is generated, often in real-time or near-real-time, drives the need for rapid data processing and analytics.

18
Q

Veracity

A

The quality and accuracy of data, ensuring that it conforms to facts and is consistent, complete, and free from ambiguity, impacts data reliability and trustworthiness.

19
Q

Volume

A

The scale of data generated and stored is driven by increased data sources, higher-resolution sensors, and scalable infrastructure.

20
Q

Artificial Neural Network

A

Collections of small computing units (neurons) that process data and learn to make decisions over time.

21
Q

Cluster Analysis

A

The process of grouping similar data points together based on certain features or attributes.

22
Q

Data Mining

A

The process of automatically searching and analyzing data to discover patterns and insights that were previously unknown.

23
Q

Decision Trees

A

A type of machine learning algorithm used for decision-making by creating a tree-like structure of decisions.

24
Q

Generative AI

A

A subset of AI that focuses on creating new data, such as images, music, text, or code, rather than just analyzing existing data.

25
Q

Natural Language Processing (NLP)

A

A field of AI that enables machines to understand, generate, and interact with human language, revolutionizing content creation and chatbots.

26
Q

Predictive Analytics

A

Using machine learning techniques to predict future outcomes or events.

27
Q

Synthetic Data

A

Artificially generated data with properties similar to real data, used by data scientists to augment their datasets and improve model training.