Lesson 6.3 Advanced Analytics Flashcards

1
Q

What is the “trough of Disillusionment?”

A

phase of the hype cycle where our perceptions did not meet reality

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

This is a centralized repository that allows you to store all your structured and unstructured data in its natural state and in its entirety.

A

data lake

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

This is an application of artificial intelligence (AI) that provides
systems with the ability to automatically learn and improve from experi-
ence without being explicitly programmed.

A

Machine learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

This is anything that happens at a clearly defined time and that can be specifically recorded

A

event

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

these usually include data about the type of activity, when the activity occurred as well as it’s location and cause

A

event objects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

this is a constant and continuous flow of event objects that navigate into and around companies from thousands of connected devices, medical internet of things and any other sensors

A

stream

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

this is the final act of analyzing all of this data

A

processsing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

This is a step-by step set of instructions for carrying out a process for problem-solving

A

algorithm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

this is data in a data set that does not match an expected or projected pattern

A

anomaly detection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

this is The theory and development of computer systems able to
perform tasks that normally require human intelligence,
such as visual perception, speech recognition, decision-
making, and translation between languages

A

artificial intelligence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

This is Identifying data in a data set that is similar and grouping it
together to understand the similarities as well as the
differences within a data set

A

clustering analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

this is an analysis of data to determine a positive or negative relationship

A

correlation analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

this is A subset of machine learning, utilizing a hierarchical level of
artificial neural networks to carry out the process of
machine learning

A

deep learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

This is a process in which data is extracted from a source, then transformed and loaded into a data warehouse

A

extract, transform, and load

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

this is an open source framework for the storage and processing of Big Data across a distributed file system

A

Hadoop

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

This is a column-oriented data store allowing for fast access to data stored in HDFS

A

HBase

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

This is a file system for the storage of data across many computers

A

Hadoop Distributed File SystemHDFS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

This is the use of super computers to rapidly solve complex problems

A

High performance computing (HPC)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

This is a Hadoop data system that facilitates the interrogation of data stored in HDFS using structured query language (SQL)

A

Hive

20
Q

This is a database management system that stores data in memory not on a disk, resulting in fast processing

A

in-memory

21
Q

this is the name for medical devices connected to the internet via sensors

A

Medical Internet of Things

22
Q

This is a process in which software learns during data processing and becomes more accurate over time

A

machine learning

23
Q

This is the processes of breaking up problems into pieces that are then distributed across multiple computers on the same network or cluster

A

MapReduce

24
Q

This means data about data, information about stored data elements

A

Metadata

25
Q

this is an open source, reliable, high performance, scalable, document database

A

MongoDB

26
Q

This is extracting information from text

A

natural language processing

27
Q

this is an open source graph database

A

Neo4j

28
Q

this is a data flow management application

A

NiFi

29
Q

These are databases that do not use the relational model, such as databases that store documents, tweets and so on

A

NoSQL

30
Q

This is a representation of a body of knowledge as a set of domain-specific concepts

A

Ontology

31
Q

This is a data movement in which data sets are made available to the public for use without charge.

A

Open Data

32
Q

This means applications in which the source code is available to the general public for use or modification

A

open source

33
Q

this is identification of patterns in data via algorithms

A

pattern recognition

34
Q

this is a programming language used in the Hadoop framework

A

Pig

35
Q

this is the use of existing data sets and algorithms to predict the probability that a future event will occur

A

predictive analytics

36
Q

this is a movement to incorporate data acquisition about self into all aspects of a person’s daily living

A

Quantified self

37
Q

This is an open source programing lanquage used for statistical computation, most commonly used to develop statistical software

A

R

38
Q

This is a system in which treatments, therapies, and medications are recommended based on patient data

A

recommender systems

39
Q

this is the use of algorithms to understand human feelins

A

sentiment analysis

40
Q

this is data that is organized in a predetermined structure

A

structured data

41
Q

this is data that does not prescribe to a predetermined structure, such as free text

A

unstructured data

42
Q

What system of the processing of the brain presented by Daniel Kahneman represents the automatic and intuitive thinking process?

A

System 1

43
Q

What system of the processing of the brain presented by Daniel Kahneman represents the thinking process that requires effort and attention?

A

System 2

44
Q

Why is it important when developing visualizations of healthcare data to insure that they invoke system 1 processes?

A

You do not want the viewers spending time trying to figure out what the data is representing, you want them to understand immediately

45
Q

What other factors should be considered when developing data visualizations?

A

colorblind palate
make sure they can render in any platform

46
Q
A