Lesson 6.3 Advanced Analytics Flashcards

1
Q

What is the “trough of Disillusionment?”

A

phase of the hype cycle where our perceptions did not meet reality

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

This is a centralized repository that allows you to store all your structured and unstructured data in its natural state and in its entirety.

A

data lake

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

This is an application of artificial intelligence (AI) that provides
systems with the ability to automatically learn and improve from experi-
ence without being explicitly programmed.

A

Machine learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

This is anything that happens at a clearly defined time and that can be specifically recorded

A

event

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

these usually include data about the type of activity, when the activity occurred as well as it’s location and cause

A

event objects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

this is a constant and continuous flow of event objects that navigate into and around companies from thousands of connected devices, medical internet of things and any other sensors

A

stream

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

this is the final act of analyzing all of this data

A

processsing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

This is a step-by step set of instructions for carrying out a process for problem-solving

A

algorithm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

this is data in a data set that does not match an expected or projected pattern

A

anomaly detection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

this is The theory and development of computer systems able to
perform tasks that normally require human intelligence,
such as visual perception, speech recognition, decision-
making, and translation between languages

A

artificial intelligence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

This is Identifying data in a data set that is similar and grouping it
together to understand the similarities as well as the
differences within a data set

A

clustering analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

this is an analysis of data to determine a positive or negative relationship

A

correlation analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

this is A subset of machine learning, utilizing a hierarchical level of
artificial neural networks to carry out the process of
machine learning

A

deep learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

This is a process in which data is extracted from a source, then transformed and loaded into a data warehouse

A

extract, transform, and load

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

this is an open source framework for the storage and processing of Big Data across a distributed file system

A

Hadoop

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

This is a column-oriented data store allowing for fast access to data stored in HDFS

A

HBase

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

This is a file system for the storage of data across many computers

A

Hadoop Distributed File SystemHDFS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

This is the use of super computers to rapidly solve complex problems

A

High performance computing (HPC)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

This is a Hadoop data system that facilitates the interrogation of data stored in HDFS using structured query language (SQL)

20
Q

This is a database management system that stores data in memory not on a disk, resulting in fast processing

21
Q

this is the name for medical devices connected to the internet via sensors

A

Medical Internet of Things

22
Q

This is a process in which software learns during data processing and becomes more accurate over time

A

machine learning

23
Q

This is the processes of breaking up problems into pieces that are then distributed across multiple computers on the same network or cluster

24
Q

This means data about data, information about stored data elements

25
this is an open source, reliable, high performance, scalable, document database
MongoDB
26
This is extracting information from text
natural language processing
27
this is an open source graph database
Neo4j
28
this is a data flow management application
NiFi
29
These are databases that do not use the relational model, such as databases that store documents, tweets and so on
NoSQL
30
This is a representation of a body of knowledge as a set of domain-specific concepts
Ontology
31
This is a data movement in which data sets are made available to the public for use without charge.
Open Data
32
This means applications in which the source code is available to the general public for use or modification
open source
33
this is identification of patterns in data via algorithms
pattern recognition
34
this is a programming language used in the Hadoop framework
Pig
35
this is the use of existing data sets and algorithms to predict the probability that a future event will occur
predictive analytics
36
this is a movement to incorporate data acquisition about self into all aspects of a person's daily living
Quantified self
37
This is an open source programing lanquage used for statistical computation, most commonly used to develop statistical software
R
38
This is a system in which treatments, therapies, and medications are recommended based on patient data
recommender systems
39
this is the use of algorithms to understand human feelins
sentiment analysis
40
this is data that is organized in a predetermined structure
structured data
41
this is data that does not prescribe to a predetermined structure, such as free text
unstructured data
42
What system of the processing of the brain presented by Daniel Kahneman represents the automatic and intuitive thinking process?
System 1
43
What system of the processing of the brain presented by Daniel Kahneman represents the thinking process that requires effort and attention?
System 2
44
Why is it important when developing visualizations of healthcare data to insure that they invoke system 1 processes?
You do not want the viewers spending time trying to figure out what the data is representing, you want them to understand immediately
45
What other factors should be considered when developing data visualizations?
colorblind palate make sure they can render in any platform
46