chapter 1 Flashcards
Define Data Analytics
the process of evaluating data with the purpose of drawing conclusions to address business questions.
What is used to analyze data to give organizations the information they need to make sound and timely decisions?
technologies, systems, practices, methodologies, databases, statistics, and applications.
Patterns are discovered from…
past archives
What is an analytics mindset?
recognizing when and how data analytics can address accounting questions
what is Data scrubbing and data preparation
comprehend the process needed to extract (query), clean, and prepare the data before analysis
Define data quality
recognize what is meant by data quality, be it completeness, reliability, or validity
descriptive data analysis
perform basic analysis to understand the quality of the underlying data and their ability to address the business question
data analysis through data manipulation
demonstrate ability to sort, rearrange, merge, and reconfigure data in a manner that allows enhanced analysis
problem solving through statistical data analysis
identify and implement an approach that will use statistical data analysis to draw conclusions and make recommendations on a timely basis
data visualization and data reporting
report results of analysis in an accessible way to each varied decision maker and his or her specific needs
what is the objective of data extraction
to identify and obtain the data from the appropriate source
what is the objective of transforming data
to validate the data for completeness and integrety
what is the objective of loading data
to load the data into the appropriate tool for analysis
what are the five steps of the ETL process
determine the purpose and scope of the data request, obtain the data, validate the data for completeness and integrity, clean the data, load the data for data analysis.
Define classification
an attempt to assign each unit in a population into a few categories
define Regression
a data approach that attempts to estimate or predict, for each unit, the numerical value of some variable using some type of statistical model.
define similarity matching
a data approach that attempts to identify similar individuals based on data known about them
define clustering
an attempt to divide individuals into groups in a useful or meaningful way
define co-occurrance grouping
a data approach that attempts to discover associations between individuals based on transactions involving them (i.e. when amazon says customers who bought this also bought…
define profiling
a data approach that attempts to characterize the “typical” behavior of an individual, group, or population by generating summary statistics about the data (mean, median, stnd deviation)
define link prediction
a data approach that attempts to predict a relationship between 2 data items (i.e. facebook sees you have 20 mutual friends w someone, suggests them as a friend)
define structured data
data that are stored in a database or spreadsheet and are readily searchable