U5 Flashcards

Question

In terms of consistency, two major types of inconsistencies exist:

Answer 1

read and write.

Answer 2

master/slave slave master node peer-to-peer multiple node

Answer 3

two contradicting metrics.

Answer 4

the nature of the IoT application.

Answer 5

a user can prohibit read and write inconsistencies through considering slaves as hot standby without reading from them.

Answer 6

a powerful query language.

Answer 7

BJSON data structures.

Answer 8

-query in single-table of relational databases, -and it also supports indexing. -Furthermore, MongoDB has the advantage of supporting high-speed access to mass data.

Answer 9

ten times higher than MySQL (Yan, 2015).

Answer 10

considering MongoDB instead of relational database.

Answer 11

the Apache Cassandra. It offers good scalability and high availability without compromising performance.

Answer 12

fault-tolerance on commodity hardware (i.e., cloud infrastructures) and linear scalability, thus making it the ideal platform for mission-critical data.

Answer 13

replication across multiple datacenters, offering lower latency for data availability during regional outages.

Answer 14

powerful built-in caching mechanism.

Answer 15

Cassandra to deal with huge, active, online interactive datasets.

Answer 16

300 TB (terabytes) of information in over 400 machines.

Answer 17

a shift from the client server model of data processing where a client node pulls the data from a server node.

Answer 18

cluster nodes partitioned

Answer 19

the MapReduce framework and it also interestingly uses key-value pairs.

Answer 20

process data in parallel, which significantly shortens the time between the operational events and presenting the analytics results.

Answer 21

MapReduce sensor statistical MIN, MAX, SUM, and MEAN

Answer 22

analyzing large amounts of web logs, and more speciﬁcally deriving such linearly computable statistics from the logs.

Answer 23

they are similarly repetitive, and the typical statistical computations which are often performed on sensor data for many applications are linear in nature.

Answer 24

the sensor-generated data are often performed on sensor data for many applications are linear in nature.

Answer 25

the maximum temperature each year is to be determined from sensor data recorded over a long period of time. To this end, the “Map” and “Reduce” functions of MapReduce are deﬁned with respect to data structured in (key, value) pairs.

Answer 26

the data can be in the form of (year, value) where the year is the key. The Map function takes a list of pairs (year, value) from one domain and then returns a list of pairs (year, local max value). The local max value denotes the local maximum in the subset of the data processed by that node. This computation is typically performed in parallel by dividing the key value pairs across different distributed computers.

Answer 27

all pairs with the same key from all lists, thus creating one group for each one of the different generated keys.

Answer 28

communication computers original data Map step compact

Answer 29

the implementation of the adopted MapReduce, and also on the exact nature of the distributed data.

Answer 30

the data is originally created at different locations, and it is too expensive to move the data around.

Answer 31

IoT implementation and scenario utilized

Answer 32

parallel to each group.

Answer 33

values in the same domain.

Answer 34

(k2, list(V2)) (v3)

Answer 35

possible for the function to return more than one value.

Answer 36

the input to the “Reduce” function will be a list in the form (Year [local max1, local max2, …, local maxr]), where the local maximum values are determined by the execution of the different Map functions. The Reduce function determines the maximum value over the corresponding list in each call of the Reduce function.

Answer 37

the “jobtracker” running on the master server.

Answer 38

jobtracker “tasktrackers” slave nodes

Answer 39

alive, idle or busy.

Answer 40

the jobs to run on available nodes.

Answer 41

name-nodes and data-nodes.

Answer 42

responsible for keeping the metadata about the data on each data-node.

Answer 43

the name-node to get the locations of data block to be read from or written to.

Answer 44

Hadoop starts, and is dynamically maintained.

Answer 45

heart beats.

Answer 46

a secondary name-node mainly used to store the latest checkpoints of HDFS states.

Answer 47

optimized for input/output efficiency. Specifically, both the Map and Reduce functions are “block operations” in which data transition cannot proceed to the next stage until the tasks of the current stage have ﬁnished.

Answer 48

ﬁrst written into HDFS before shuffled to the reducers.

Answer 49

degrade the performance of each node.

Answer 50

a database management system and does not optimize data transferring across various nodes.

Answer 51

Because Hadoop has a latency problem due to its inherent nature.

Answer 52

the challenge of capturing too much data with too little inter-operability.

Answer 53

too little knowledge about the ability to utilize diﬀerent resources which are available in real time.

Answer 54

deﬁnes service interfaces which enable developers to make all types of sensors, transducers, and sensor data repositories discoverable, accessible, and usable via the Web.

Answer 55

they hide the heterogeneity of the underlying IoT devices from the applications that use them.

Answer 56

an infrastructure enabling access to IoT devices and archived sensor data.

Answer 57

standard protocols and APIs situation awareness responses

Answer 58

the infrastructure and mechanisms to synthesize, interpret, and apply this data intelligently via automated means.

Answer 59

“nearby,” “far,” “soon,” “immediately,” “dangerously high,” “safe,” “blocked,” or “smooth.” Ontologies are at the heart of the semantic sensor web technology.

Answer 60

knowledge sharing and reuse.

Answer 61

resources, the Resource Description Framework (RDF) data model is widely used to describe resources.

Answer 62

any device or concept, e.g., person, place, restaurant. Each resource is uniquely identiﬁed by a URI.

Answer 63

specifying how resources are inter-related through performing inference.

Answer 64

a 3-tuple of the form where subject, predicate, and object are interpreted as in a natural language sentence.

Answer 65

graph ovals rectangles edges between ovals ovals and rectangles

Answer 66

the triple representation of the sentence, “Washington, D.C. is the capital of the United States,” is illustrated in the following figure.

Answer 67

another ontology formalism that was developed to overcome the drawbacks of RDF. Specifically, RDFs do not provide ways to represent constraints, e.g., domain or range constraints.

Answer 68

transitive or inverse properties

Answer 69

a formal specification in OWL.

Answer 70

the availability of tools for ontology creation and authoring.

Answer 71

Protege, which supports RDF and OWL formats, data storage, and management stores, such as OpenSesame, for efficient storage and querying of data in RDF or OWL formats.

Answer 72

RDF and OWL

Answer 73

an ontology which relies on the OWL data model to describe sensors and observations.

Answer 74

their capabilities, measurement processes, observations, and deployments.

Answer 75

ten modules four sensor perspective, observation perspective, system perspective, and feature and property perspective.

Answer 76

41 concepts and 39 object properties.

Answer 77

sensors, the accuracy and capabilities of such sensors, observations, and methods used for sensing.

Answer 78

these are often part of a given speciﬁcation for a sensor, along with its performance within those ranges.

Answer 79

describe deployment lifetimes and sensing purposes of the deployed macro instrument.

Answer 80

common agreements on providing and describing the IoT data.

Answer 81

associated to the data.

Answer 82

describe and represent the data to make it seamlessly accessible and processable across heterogeneous platforms.

Answer 83

describe and represent the data to make it seamlessly accessible and processable across heterogeneous platforms.

Answer 84

enable autonomous processing and interpretation of the IoT data.

Answer 85

machine-interpretable descriptions of different components and resources in the IoT framework, e.g., sensors, actuators, and network resources.

Answer 86

the IoT data and resources.

Answer 87

Since IoT environments are often dynamic and pervasive,

Answer 88

individual data items to support semantic query and inferences on the data coming from the physical and virtual objects. In other words, linked data simply refers to data published on the Web in such a way that it is machine-readable, its meaning is explicitly defined, and it is readily linked to other external data sets.

Answer 89

formalism RDF and OWL interoperability heterogeneous

Answer 90

of existing information rather than creating new information.

Answer 91

simply providing links to the data in it.

Answer 92

the DBpedia project -extracts structured information from Wikipedia. -DBpedia enables sophisticated queries over the information that exists in Wikipedia. - Moreover, it provides new ways of browsing and navigation through the semantic links.

Answer 93

processed locally and according to the domain descriptions (i.e., ontologies) and their properties.

Answer 94

(1) using URIs as names for data; (2) providing HTTP access to those URIs; (3) providing useful information for URIs using the standards such as RDF and SPARQL; and finally (4) including links to other URIs.

Answer 95

connect sensor descriptions to potentially endless data existing on the Web. Specifically, the action of relating sensor data attributes, such as location, type, and measurement features, to the other resources on the Web of data enables the users to integrate physical world data and the logical world data.

Answer 96

drawing beneficial conclusions, creating business intelligence, enabling smart environments, and supporting automated decision-making systems.

Answer 97

microservices is recommended.

Answer 98

each service is running as a separate process, communicating through simple mechanisms .

Answer 99

decomposing a service or an application into smaller components, i.e., microservices.

Answer 100

we can independently develop, deploy, upgrade, and scale every microservice.

Answer 101

Since the different microservices may have different workloads.

Answer 102

-use an optimal amount of resources making microservices architecture a natural fit for achieving both scalability and elasticity. -We can separately control every microservice where they are easily manageable thanks to being small.

Answer 103

the employment of different technologies, e.g., different programming languages for each microservice

Answer 104

the redeployment of the whole application, but only the corresponding microservice.

Answer 105

REST, or through remote procedure calls (RPC).

Answer 106

as communication between processes may become costly.

Answer 107

we compare it to the traditional monolithic architecture.

Answer 108

monolithic web applications have the client-side, the server-side , and the database in a single logical executable. Similarly, monolithic IoT applications have the whole logic for communication with IoT devices, processing of devices’ data, communication with databases, and visualization, in a single logical executable.

Answer 109

deployed or terminated. However, different application functionalities rarely have an equal share of the workload.

Answer 110

an independent component in the microservices approach.

Answer 111

microservice scalability enable a scalable elastic, and resource-efficient

Answer 112

the latter usually runs as a single process.

Answer 113

redeployed for the changes to take effect;

Answer 114

no changes or only minor changes to the other microservices.

Answer 115

relatively expensive and has to be deliberately minimized.

Answer 116

the advantages of the microservices architecture might be lost.

Answer 117

Because a change in how an application communicates with IoT devices and receives data from them must have no impact or only minimal impact on how we process the data.

Answer 118

the difficulty of software management, being more vulnerable, and being harder to update.

Answer 119

as they cause the whole application to crash, whereas in microservices architecture only the corresponding microservice collapse.

Answer 120

running and only the speciﬁc functionality implemented by the malfunctioned microservice is unavailable.

Answer 121

microservices-based architectures if a microservice which communicates with a certain group of sensors crashes, such a crash will not affect or stop the processing of the data provided by microservices which communicate with other sensors. The other components of the application will still be up and properly running.

Answer 122

scalability.

Answer 123

designing distributed architectures rather than monolithic ones.

Answer 124

adaptable to the requirements of IoT applications.

Answer 125

components libraries

Answer 126

libraries are essentially linked to a main program and when the program is running, there is only one process. On the other hand, the microservices architecture tends to componentize a project into services, where each service is running in its own separate process.

Answer 127

As the microservices architecture tends to componentize a project into services, where each service is running in its own separate process.

Answer 128

componentization into microservices, the problem of vast heterogeneity of devices could be simply addressed.

Answer 129

proxies for the IoT devices that communicate using different protocols, e.g., Wi-Fi, LoRa, BLE

Answer 130

adding a microservice acting as a proxy between protocols.

Answer 131

verb-based and noun-based strategies.

Answer 132

decomposition ill-suited IoT

Answer 133

temperature sensors, the data processing logic, and the visualization logic for this certain group of sensors in one microservice.

Answer 134

as the scaling of different modules is dependent upon different factors.

Answer 135

The communication with devices is most dependent upon the number of devices and the amount of data they generate, while the visualization application must consider the number of users which access it simultaneously.

Answer 136

every operation related to a certain functionality.

Answer 137

the devices and exchanges data with them

Answer 138

CEP engine;

Answer 139

a database for later processing;

Answer 140

data visualization.

Answer 141

each functionality can be separately scaled.

Answer 142

Since microservices are independent components.

Answer 143

deployment and management of applications in cloud infrastructures.

Answer 144

an optimal amount of resources.

Answer 145

a service (PaaS) cloud computing mode.

Answer 146

low-level operations like management of virtual machines (VMs), application deployment, load balancing, etc.

Answer 147

easier management of applications and is especially suitable for microservices. It enables system designers to focus on the IoT aspect and not on the essential low-level part of the application.

Answer 148

monolithic applications.

Answer 149

perform the job of configuration and management of the application.

Answer 150

“vertical scaling.” memory and CPU capabilities

Answer 151

its nature, as it is impractical to continuously add resources due to some physical constraints.

Answer 152

Because IoT applications often deal with a huge number of events and thus require lots of resources, which a single node often cannot provide.

Answer 153

X-axis scaling Y-axis scaling Z-axis scaling

Answer 154

X-axis scaling is a typical horizontal scaling in which several application instances are used to distribute the workload evenly. It is common for monolithic applications to use this kind of scaling.

Answer 155

1/N of the workload.

Answer 156

separately scaled

Answer 157

specify the number of instances for a specific microservice. Some PaaS providers also offer automatic scaling, i.e., microservices are dynamically scaled depending on the current workload.

Answer 158

the application into multiple, different services. Each service is responsible for one or more closely related functions.

Answer 159

scaling of IoT applications.

Answer 160

an identical copy of the code.

Answer 161

each server is responsible for only a subset of the data. Some component of the system is responsible for routing each request to the appropriate server. A common use case where Z-axis scaling is applicable is division according to the user category.

Answer 162

requests of users who have paid for a service are routed to different, more powerful servers.

Answer 163

since Premium users usually have higher service level agreements (SLA) than non-premium users.

Answer 164

less restrictive policy on the creation of new instances or redirection to more powerful instances.

Answer 165

we might process and visualize data from additional sensors, available to premium users only. Therefore, they will be redirected to servers which are hosting these additional services.

Answer 166

Since each server only deals with a subset of the data, the cache utilization is improved while reducing the memory usage and the input/output traffic.

Answer 167

a failure only makes part of the data in accessible.

Answer 168

the increased application complexity. It is necessary to implement a partitioning scheme, which can be tricky especially if we ever need to repartition the data.

Answer 169

an approach that identifies data and application traffic as events, correlates these events to reveal predefined patterns, and reacts to them by generating actions to systems, people, and devices.

Answer 170

the observation that actions are mostly triggered not by a single event, but by a complex composition of events, happening at different times, and within different contexts.

Answer 171

the ability to define, manage, and predict events, situations, conditions, opportunities, and threats.

Answer 172

the computation is triggered by the receipt of event data.

Answer 173

leads, orders, or customer service calls.

Answer 174

news items, text messages, social media posts, stock market feeds, traffic reports, weather reports, or other kinds of data.

Answer 175

a change of state, when a measurement exceeds a predefined threshold of time, temperature, or other value—that is really where IoT comes in.

Answer 176

more IoT devices are deployed to collect more data.

Answer 177

save companies millions of dollars and is one of the pillars of IoT functionality.

Answer 178

a more mainstream solution for IoT deployments.

Answer 179

various kinds of sources such as sensors which constitute a WSN, RFID readers, GPS, social media, etc.

Answer 180

each reading operation of the RFID reader at a garage generates a simple event but a complex event like “a car leaves the garage” is the kind of event that a user is really concerned with. To get such a complex event, we need to combine many simple events based on some rule.

Answer 181

an input to a CEP engine.

Answer 182

an action is generated to react to the complex events.

Answer 183

centralized or distributed bandwidth and computational a single point failure or network break.

Answer 184

As many IoT applications are naturally distributed and hence complex events are to be detected from the distributed system.

Answer 185

react to the occurrence of critical situations with low latency.

Answer 186

accidents, load variations in power consumption, or changing weather conditions.

Answer 187

a steadily growing infrastructure of globally deployed sensors.

Answer 188

detect the relevant situations.

Answer 189

updating a traffic route or changing the configuration in power consumption.

Answer 190

(1) the time at which events are delivered by the CEP system, and (2) the view of the application on the set of delivered events at decision time.

Answer 191

no false-negatives no false-positives

Answer 192

consider a scenario where it is required to establish an overtaking ban on a certain region of a highway. To this end, two camera-based sensors are deployed at the beginning and end of the no-passing zone.

Answer 193

an input to a CEP operator to detect when a vehicle overtakes another one.

Answer 194

the consistent and low-latency detection of complex events that signal when a vehicle has overtaken another one.

Answer 195

issued transgressor feedback violated

Answer 196

acceptable bounds for communication and processing latencies.

Answer 197

the CEP operator suffers from overload, i.e., the arrival rate of events exceeds the achievable processing rate.

Answer 198

“load shedding” or “buffering.”

Answer 199

not capable of processing in time.

Answer 200

inconsistencies, which are not tolerable in several IoT applications.

Answer 201

since In the second strategy, a high number of events are to be buffered before being processed, which can cause an unacceptable latency in event detection.

Answer 202

since many applications, e.g., traffic monitoring and smart grids, comprise very high and fluctuating event rates.

Answer 203

speed up the CEP systems.

Answer 204

meeting the buffering limits.

Answer 205

underutilized for a large part of the day when the traffic intensity is lower.

Answer 206

high by reacting to dozens or even hundreds of different patterns that can occur in large-scale IoT applications, such as a smart city.

Answer 207

to dynamically adjust the parallelization degree of operators according to the current workload given by the arrival rate of events.

Answer 208

avoiding underutilization of the system resources.

Answer 209

the operator can meet a buffering limit for a given workload at minimal cost.

Answer 210

intra-operator parallelization and data parallelization.

Answer 211

the operator logic is split accordingly, and the identified processing steps are executed in parallel on the incoming event streams.

Answer 212

the number of variables in the query.

Answer 213

applying stream partitioning.

Answer 214

partitions that can be processed by a number of identical instances of the operator.

Answer 215

First, a splitter assigns the events, according to a partitioning model, to several identical operator instances. The execution of an operator instance is controlled by a runtime environment (RE). The RE receives information about the assigned partitions and the corresponding events. Afterward, it manages the operator execution so that the assigned partitions are processed. Finally, a merger ensures that an ordering between all produced events is established. The ordered events are consecutively assigned sequence numbers.

Answer 216

key-based and batch-based.

Answer 217

a key that is encoded in each event.

Answer 218

The parallelization degree is limited to the number of different key values.

Answer 219

batches that are large enough to fit any match to a queried pattern.

Answer 220

patterns fluctuate in their size in terms of comprised events,

Answer 221

insufficient to support consistent partitioning for operators that detect patterns of an unknown size.

Answer 222

the early 1800s to Bayes' theorem and the least squares method of fitting data. Both models still widely used in machine learning models.

Answer 223

perceptrons unified beyond the reasonable resources of existing computers solving complex engineering problems

Answer 224

mechanical engineering and automatic software design.

Answer 225

a form of probabilistic AI, such as Bayesian inference models, and was successfully applied to research in gesture recognition and bioinformatics.

Answer 226

easily describe symbolic expressions.

Answer 227

the inability of logic-based semantics to think like a human.

Answer 228

a well-defined problem trained by experts in that particular domain. One could think of them as a rule-based engine for a control system.

Answer 229

corporate and business settings commercially

Answer 230

researchers at Hitachi discovered how fuzzy logic could be successfully applied to control systems.

Answer 231

automotive and electronics AI expert logic-based emulate

Answer 232

1970s and the early 1980s. using a novel technique to find the best hyperplanes to categorize data sets.

Answer 233

using a novel technique to find the best hyperplanes to categorize data sets.

Answer 234

handwriting analysis, before being evolved into uses for neural networks.

Answer 235

convolutional neural network (CNN), because RNN could be applied to a problem involving the notion of time, such as audio and speech recognition.

Answer 236

the advent of GPU processors.

Answer 237

from self-driving cars to speech recognition in Siri, to tools emulating humans in online customer service, to medical imaging, to retailers using machine learning models to identify consumer interest in shopping and fashion as they move about a store.

Answer 238

what a collection of sensors measures.

Answer 239

time-correlated series.

Answer 240

cameras, synthetic sensors, audio, and analog signals.

Answer 241

operational expenses and potentially capital expenses by adopting IoT and machine learning.

Answer 242

a widget, another robot to cut parts out of metal or plastic, conveyor belts, lighting and heating systems, packaging machine, and inventory control systems.

Answer 243

various AI-based models.

Answer 244

behavioral effects.

Answer 245

observing some behavioral effects, e.g., a machine may start creaking in a certain way.

Answer 246

a predicative maintenance system.

Answer 247

understand how that factory is performing at that very instant based on a collection of millions or even billions of events from every machine.

Answer 248

machine learning can handle big data that cannot be handled by humans.

Answer 249

all IoT data. In particular, each model has its specific strength and the use case it serves.

Answer 250

supervised learning. The training data provided to the model has an associated label with each entry. For instance, a dataset may be a collection of pictures each labeled with the content of that image, e.g., cat, dog, banana, car. Many machine learning models today are supervised. Supervised learning is mainly used for solving classification and regression problems. unsupervised learning. It requires no labels for the training data. This type of learning model utilizes mathematical rules to reduce redundancy. A typical use case is to find clusters of like things. semi-supervised learning. It mixes labeled data and unlabeled data. The main objective is to force the machine learning model to organize data as well as make inferences.

Answer 251

classification, regression, and anomaly detection.

Answer 252

is a form of supervised learning where labeled data is used to detect a name, value, or category. For instance, neural networks can be used to scan images for the sake of detecting pictures of a shoe.

Answer 253

a straight line; thus, it is called a linear classifier.

Answer 254

SVM can find the best hyperplane, i.e., creates artificial segments, to divide the colored data points. After several thousand iterations, the division is somewhat optimal where the artificial segments properly separate the different data sets, i.e., the red data points exist in the red hyperplane. In fact, SVM algorithm cannot support nonlinear relationships which are common in machine learning. In fact, employing a linear model over nonlinear data would cause severe error rates.

Answer 255

predict a continuous value.

Answer 256

regression analysis can be used to predict the average selling price for a home based on the selling prices of all the homes in the neighborhood.

Answer 257

least squares method, linear regression, and logistic regression.

Answer 258

a subset of another machine learning model called the decision tree.

Answer 259

a statistical learning algorithm that simply takes several variables into consideration and produces a single output that classifies the dataset.

Answer 260

-typically used for both classification and regression. -The decision tree generates a set of probabilities that a path has taken based on the input. -The model behaves with “if this, then that” conditions ultimately yielding a specific result.

Answer 261

(1) they are easy to interpret and make for straightforward visualizations; (2) they can handle both numerical and categorical data; (3) they perform well on large datasets; and (4) they are extremely fast.

Answer 262

prone to overfitting, especially when a tree is particularly deep. In this context, overfitting means: the generation of a machine learning model that corresponds too closely or exactly to a particular set of data and may therefore fail to fit additional data or predict future observations reliably.

Answer 263

usually used for classification and regression. It is a supervised learning algorithm where it creates a forest of decision trees. In other words, random forest builds multiple decision trees and merges them together to get a more accurate and stable prediction.

Answer 264

agrees different outlier single decision biased

Answer 265

based on Bayes' theorem which describes the probability that an event will occur based on prior knowledge of the system.

Answer 266

Bayesian models can estimate the probability that a machine will fail based on the temperature of the device.

Answer 267

an extension of Bayes' theorem in the form of a graphical probability model, i.e., a directed acyclic graph (DAG).

Answer 268

in one direction and there are no loopbacks to previous states.

Answer 269

expert knowledge, historical data, logs, trends, or a combination thereof.

Answer 270

environments in IoT that cannot be completely observed or when the data is unreliable.

Answer 271

their robustness against poor sampled data, noisy data, and missing data.

Answer 272

time-correlated series from sensors and find and filter malicious packets in networking.

Answer 273

developing an IoT solution.

Answer 274

Google Cloud and Amazon Web Services (AWS).

Answer 275

Microsoft-managed cloud services that connect, monitor, and control billions of IoT devices.

Answer 276

one or more IoT devices and one or more back-end services running in the Azure Cloud that communicate with each other.

Answer 277

(1) receiving telemetry, data collected by the IoT devices, at large scale, and determining how to process and store that data; (2) analyzing the telemetry to provide insights, either in real time or in batch processing; (3) sending commands from the cloud to a specific IoT device; and finally (4) controlling the state of the IoT devices and monitor their activities.

Answer 278

in a predictive maintenance scenario, the cloud back-end service stores historical telemetry. The solution uses this data to identify potential anomalous behavior on specific machines before they cause a real problem. Data analytics can be used to identify an appropriate solution, before sending a command back to the device to take a corrective action. This process generates an automated feedback loop between the device and the cloud that greatly increases the solution efficiency.

Answer 279

the Azure IoT Hub service, i.e., it represents the first endpoint of any IoT application running over Microsoft Azure.

Answer 280

the Azure IoT Hub service, i.e., it represents the first endpoint of any IoT application running over Microsoft Azure.

Answer 281

reliable device-to-cloud (D2C) and cloud-to-device (C2D) messaging at scale together with enabling secure communications.

Answer 282

the state of their machines and assets so that proper actions can be taken.

Answer 283

commands and notifications to any connected devices.

Answer 284

the most popular languages and platforms, thus making it versatile and common.

Answer 285

versatile and common.

Answer 286

all information about provisioned devices.

Answer 287

identity and authentication of the devices.

Answer 288

connection status (i.e., connected or disconnected) and last activity time. Accordingly, customers can easily enable and disable the devices using this registry. “device identity management,” which is used to create, retrieve, update, and delete devices. monitoring the Azure IoT Hub and performing operations on the received data streams.

Answer 289

specific patterns that may result in triggering some sort of action, depending on the event.

Answer 290

in a healthcare application, the Stream Analytics service will constantly monitor the raw sensor data of the IoT Hub that is related to a user’s heart rate. Once a sudden change occurs, an appropriate action is to be taken, e.g., calling a person or notifying an ambulance.

Answer 291

an SQL-like language which Microsoft has developed for the Stream Analytics services.

Answer 292

In fact, detecting more complicated patterns and events can be more challenging.

Answer 293

straightforward to create statistical models of sensor use cases using previous historical data, deploy them, and then use them as functions on the Stream Analytics service to help easily recognize new similar events that match the patterns of the trained models.

Answer 294

many IoT applications.

Answer 295

predictive monitoring, i.e., predicting when an engine will break, can be performed based on similar sensor data patterns of engines that previously broke.

Answer 296

storing the received data and possibly visualizing it.

Answer 297

SQL Database, SQL Data Warehouse, and DocumentDB for storage, and PowerBI for visualization.

Answer 298

all the services can interact with each other seamlessly.

Answer 299

all the services can interact with each other seamlessly.

Answer 300

delay, availability, or security issues, since all the services reside on the same data centers used to host the entire solution.

Answer 301

a collection of cloud computing services including data storage, data analytics, and machine learning.

Answer 302

its end-user products, e.g., Google Search and YouTube.

Answer 303

virtual resources like VMs that are contained in Google’s data centers around the globe.

Answer 304

highly scalable and reliable infrastructure provided by Google.

Answer 305

the code and Google handles issues regarding infrastructure, computing power, and data storage facility.

Answer 306

(1) its fast global network, (2) providing better pricing than its competitors, (3) offering support of various available services of cloud such as Firebase, PubSub, and Telit Wireless Solutions, and (4) its powerful mechanism for connecting IoT devices to Firebase and Cassandra services.

Answer 307

the Google Cloud Platform.

Answer 308

produce a massive amount of data.

Answer 309

manage those devices and handle all that information and make it suitable for application.

Answer 310

storing, processing, and analyzing the big data.

Answer 311

stripped-down version of Android for IoT devices called “Android Things.”

Answer 312

over-the-air software updates to the operating system and applications on the IoT devices.

Answer 313

large-scale data collection from IoT devices.

Answer 314

data ingestion from millions of devices.

Answer 315

a single channel to receive the required data.

Answer 316

route certain events FireStore BigQuery

Answer 317

SQL-like queries against terabytes of data in seconds.

Answer 318

deploy and manage Hadoop clusters and to do so at reasonable costs.

Answer 319

deploy and manage Hadoop clusters and to do so at reasonable costs.

Answer 320

train the models that can be later exported to run elsewhere on different data.

Answer 321

a database that supports location-aware queries in real time is highly required.

Answer 322

Pokémon Go (2016) was built on a predecessor of Firestore, referred to as Datastore.

Answer 323

collection of services that form a cloud computing platform (Raj and Raman, 2017).

Answer 324

Amazon Elastic Compute Cloud (EC2) and Amazon Simple Storage Service (S3).

Answer 325

the privately built and maintained servers.

Answer 326

resource provisioning, load balancing, scaling, and monitoring.

Answer 327

device state.

Answer 328

publishing messages to the message broker through topics.

Answer 329

all clients subscribed on the specific topics.

Answer 330

to run a single Amazon EC2 micro instance and an Elastic Load Balancer.

Answer 331

workload thing registry AWS create, delete, and update

Answer 332

software components that easily and quickly enables connecting an IoT device or a mobile application to AWS IoT Core.

Answer 333

it also enables authentication and messages exchange with AWS IoT Core using the MQTT, HTTP, or WebSockets protocols.

Answer 334

C and JavaScript, where it supports Arduino hardware platforms.

Answer 335

entry point for IoT devices connecting to AWS.

Answer 336

the IoT devices can securely and efficiently communicate with AWS IoT Core.

Answer 337

MQTT, WebSockets, and HTTP 1.1.

Answer 338

send and receive messages at any time with low latency.

Answer 339

fully managed and scales automatically to support over a billion devices.

Answer 340

the IoT devices and applications in a timely manner.

Answer 341

control messaging and broadcast notification systems.

Answer 342

individual connections at the topic level, thus ensuring that the IoT devices and applications only send and receive the relevant data.

Answer 343

mutual authentication and encryption at all points of connection.

Answer 344

exchanged between devices and AWS IoT Core without a proven identity.

Answer 345

SigV4, X.509 certificate-based authentication, and customer-created token-based authentication.

Answer 346

certificate-based authentication, and WebSockets connections can use SigV4 or custom authorizers.

Answer 347

access, or revoke access without ever touching the IoT device.

Answer 348

the IoT devices from the console or using the API.

Answer 349

an X.509 certificate, so that the device can more easily access other AWS services such as DynamoDB or S3.

Answer 350

the Device Shadow.

Answer 351

the device’s latest state so that applications or other devices can read messages and interact with the device.

Answer 352

the last reported state and desired future state of each device.

Answer 353

a device or set a desired future state through the API or using the rules engine.

Answer 354

accounting for the device’s current state.

Answer 355

the desired and last reported state, and then command the device to make up the difference.

Answer 356

store the state of your devices for up to a year.

Answer 357

the IoT connected devices at a global scale without having to manage any infrastructure.

Answer 358

AWS IoT Core before delivering them to another device or a cloud service, based on customer-defined business rules.

Answer 359

it can simultaneously take one or many actions.

Answer 360

AWS Lambda, Amazon Kinesis, Amazon S3, Amazon Machine Learning, Amazon DynamoDB, Amazon CloudWatch, Amazon Simple Notification Service (SNS), Amazon Simple Queue Service (SQS), AWS IoT Analytics, Amazon Elasticsearch Service with built-in Kibana integration, and AWS Step Functions.

Answer 361

new rules using an SQL-like syntax.

Answer 362

the application.

Answer 363

if a temperature reading exceeds a certain threshold it could trigger a rule to transmit data to AWS Lambda.

Answer 364

account other data in the cloud, such as data from other devices.

Answer 365

a rule can be designed to make an action if the temperature is more than 15 percent higher than the average of 5 other devices.

Answer 366

1-provides dozens of available functions that can be used to process the collected data. 2- Moreover, a myriad of other functions can be created through the AWS Lambda component. For instance, if we are dealing with a wide range of values, we could take the average of the incoming numbers.

Answer 367

maximum flexibility and power to process device data.

Answer 368

IoT SQL unstructured storage and elaboration

Answer 369

scale horizontally and to run on clusters.

Answer 370

rapid organization and analysis of huge amount of heterogeneous data.

Answer 371

ad hoc approaches (e.g., sharding).

Answer 372

MongoDB and Apache Cassandra.

Answer 373

the MapReduce pattern.

Answer 374

It uses hundreds or thousands of pluggable nodes in a cluster to process data in parallel, which significantly shortens the time between the events and the presentation of their analysis.

Answer 375

the Apache Hadoop.

Answer 376

a suitable infrastructure and mechanism to synthesize and interpret the data to make actionable decisions is set up. In other words, there must be a way to generate and represent knowledge extracted from the heterogeneous data base.

Answer 377

describing resources and connections/relations among them.

Answer 378

any device or concept (e.g., person, place).

Answer 379

a subject, predicate, object triple.

Answer 380

resource description that extends the RDF by solving some of its limits.

Answer 381

the Semantic Sensor Network (SSN) ontology.

Answer 382

It can describe sensors and their characteristics, observations, and methods used for sensing.

Answer 383

standards source semantic query and inferences

Answer 384

information reuse and interoperability among heterogeneous sources. DBpedia, which enables sophisticated queries over the information contained in Wikipedia.

Answer 385

DBpedia, which enables sophisticated queries over the information contained in Wikipedia.

Answer 386

an approach that identifies data and application traffic as events, correlates these events to reveal some patterns, and reacts to them by generating actions to systems, people, and devices.

Answer 387

the computation is triggered by the receipt of event data, and aims at a real-time reaction.

Answer 388

the overload which occurs when the arrival rate of events exceeds the achievable processing rate.

Answer 389

buffering, load shedding or parallelization.

Answer 390

processing of IoT data, in particular to perform classification, regression and anomaly detection.

Answer 391

develop IoT applications and solutions. Examples are Microsoft Azure, Google Cloud Platform and Amazon Web Services.

Answer 392

competitive advantage innovation operations engagement

U5 Flashcards

(421 cards)