Lesson 6.3 Advanced Analytics Flashcards
What is the “trough of Disillusionment?”
phase of the hype cycle where our perceptions did not meet reality
This is a centralized repository that allows you to store all your structured and unstructured data in its natural state and in its entirety.
data lake
This is an application of artificial intelligence (AI) that provides
systems with the ability to automatically learn and improve from experi-
ence without being explicitly programmed.
Machine learning
This is anything that happens at a clearly defined time and that can be specifically recorded
event
these usually include data about the type of activity, when the activity occurred as well as it’s location and cause
event objects
this is a constant and continuous flow of event objects that navigate into and around companies from thousands of connected devices, medical internet of things and any other sensors
stream
this is the final act of analyzing all of this data
processsing
This is a step-by step set of instructions for carrying out a process for problem-solving
algorithm
this is data in a data set that does not match an expected or projected pattern
anomaly detection
this is The theory and development of computer systems able to
perform tasks that normally require human intelligence,
such as visual perception, speech recognition, decision-
making, and translation between languages
artificial intelligence
This is Identifying data in a data set that is similar and grouping it
together to understand the similarities as well as the
differences within a data set
clustering analysis
this is an analysis of data to determine a positive or negative relationship
correlation analysis
this is A subset of machine learning, utilizing a hierarchical level of
artificial neural networks to carry out the process of
machine learning
deep learning
This is a process in which data is extracted from a source, then transformed and loaded into a data warehouse
extract, transform, and load
this is an open source framework for the storage and processing of Big Data across a distributed file system
Hadoop
This is a column-oriented data store allowing for fast access to data stored in HDFS
HBase
This is a file system for the storage of data across many computers
Hadoop Distributed File SystemHDFS
This is the use of super computers to rapidly solve complex problems
High performance computing (HPC)