M1 Flashcards

1
Q

[ ] are fundamental components in various fields, providing tools for data interpretation and decision-making.

A

Statistical analysis and Modeling (SAM)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

[ ] primarily focuses on the collection, analysis, interpretation, presentation, and organization of data.

A

Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

[ ] provides foundational tools for understanding data distributions, variability, and relationships through methods such as hypothesis testing and regression analysis.

A

Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

[ ] encompasses a broader scope, integrating statistical methods with advanced computational techniques to derive insights from data.

A

Analytics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

[ ] emphasizes predictive modeling, data mining, and the application of algorithms to inform strategic decisions and optimize processes.

A

Analytics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

A [ ] is an interlinked set of activities that an organization performs to convert inputs to outputs that are valuable to a market.

A

value chain

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Insights that prescribe direct and meaningful actions then [ ].

A

drive decision making

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

When is the value of data to organizations and their market is realized?

A

When actions or decisions are implemented from them.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the five main data sources?

A

(TransData - ContractSub - Surve - DataPool - Unstruct)
- Transactional Data
- Contractual, Subscription, or Account Data
- Surveys
- Data Poolers
- Unstructured Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

This type of data source consists of structured, detailed information capturing the key characteristics of a transaction.

A

Transaction Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

This type of data source includes information about the type of product combined with customer characteristics.

A

Contractual, Subscription, or Account Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

This type of data source are questionnaires aimed at extracting sociodemographic and behavioral data from a particular group of people.

A

Surveys

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

These are companies that gather data in particular settings or for particular purposes and sell them to interested customers looking to enrich or extend their data sources.

A

Data Poolers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

This refers to information that does not reside in a traditional row-column database in the world of big data.

A

Unstructured Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the phases of Data Analytics?

A

(Bu - Du - Dp - M - E - D)
- Business Understanding
- Data Understanding
- Data Preparation
- Modelling
- Evaluation
- Deployment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

[ ] is knowing what the study is for or identifying a business task.

A

Business Understanding

17
Q

[ ] is when you select the related data from many available databases to correctly describe a given business task; identifying relevant data for the problem description.

A

Data Understanding

18
Q

[ ] is also known as data preprocessing.

A

Data Preparation

19
Q

[ ] is to filter, aggregate, and fill-in (impute) missing values.

A

Data Preparation

20
Q

[ ] uses mathematical formulations to convert different measurements into a unified numerical scale.

A

Data Transformation

21
Q

Transforming numerical to numerical scales [ ].

A

shrinks or enlarges the data

22
Q

Transforming categorical to numerical values can be [ ].

A

ordinal (less, moderate, strong) or nominal (red, yellow, blue).

23
Q

What are the two major categories of modeling?

A
  • Predictive Modeling
  • Descriptive Modeling
24
Q

[ ] predicts the value of an attribute based on the values of other attributes.

A

Predictive Modeling

25
Q

[ ] derives patters that summarizes the underlying relationships in the data.

A

Descriptive Modeling

26
Q

[ ] summarizes the general characteristics or features of a target class of data.

A

Characterization

27
Q

What are the methods of Characterization and Discrimination?

A
  • data summaries based on stat measures and plots.
  • user-controlled data summarization using OLAP, EXCEL, Spreadsheet, SQL, Python, etc.
28
Q

What are the outputs of Characterization and Discrimination?

A
  • pie charts, bar charts, curves, crosstabs
  • Characteristic Rules
29
Q

[ ] compares the general features of the target class data objects against the general features of objects from one or multiple contrasting classes.

A

Discrimination

30
Q

[ ] is the process of finding a model or function that describes and distinguishes classes or concepts.

A

Classification

31
Q

[ ] is a statistical methodology that is most often used for numeric prediction.

A

Regression Analysis

32
Q

[ ] is when objects are grouped based on the principle of maximizing the intraclass similarity and minimizing the interclass similarity.

A

Clustering

33
Q

[ ] detects objects in data that do not follow norms and its methods may include statistical tests or using distance measures.

A

Outlier Analysis

34
Q

What are the two things to consider in the Data Interpretation stage?

A
  • How to recognize business value from knowledge patters discovered.
  • How to visualize the results to properly interpret patterns.
35
Q

A pattern is interesting if:

A
  • easily understood by humans
  • valid on new or test data with some degree of certainty
  • potentially useful
  • novel
36
Q

What are the two primary techniques of descriptive analytics?

A
  • Data Aggregation
  • Data Presentation