Intro to Data Analytics Flashcards

1
Q

Which role is responsible for project initiation and providing the requirements for a project?

A

Project sponsor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Which job position is primarily responsible for designing and constructing data pipelines within the field of data analytics?

A

Data engineer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Which role in a data analytics project helps data scientists shape data for analysis?

A

Data engineer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Which role in a data analytics project provides expertise for analytical techniques?

A

Data scientist

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Which skills are required by data scientists for converting unstructured data to structured data in data analytics projects?

A

Data wrangling skills

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Which skill must a business intelligence analyst possess to collect and organize data?

A

Data preparation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Which project-related activity typically takes up the majority of a data analyst’s time?

A

Data preparation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Which software do business intelligence analysts use to perform their responsibilities?

A

Microsoft Excel

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a skill required of a data engineer?

A

Maintaining databases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Which group of stakeholders comprises the professionals, such as line managers?

A

Business users

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Which stakeholder is primarily responsible for ensuring the desired quality of the project?

A

Project managers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Which stakeholder has access to essential tables or storage systems and guarantees the highest levels of security in the data repository?

A

Database administrator

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Why is it significant to establish failure criteria for a data analytics project in the discovery phase?

A

It helps the team determine when it is best to accept the conclusions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Which stakeholder extracts and transforms data during the discovery phase?

A

Data engineer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

A person has been assigned to manage a project to implement a company-wide customer relationship management (CRM) system. The CRM system aims to centralize customer details, automate sales processes, and improve customer service. What skills are crucial for the project team members working on the CRM system implementation?

A

Data analysis, system integration, and training

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Why is formulating an initial hypothesis an integral part of the discovery phase of the data analytics lifecycle?

A

It guides the subsequent data collection, processing, and analysis activities.

17
Q

Who should be included as stakeholders in an analytics project?

A

Anyone who will benefit from the project

18
Q

Who offers suggestions on ideas to test as the team formulates hypotheses during the discovery phase of a data analytics project?

A

Data scientists

19
Q

Which activity occurs during the data preparation phase of the data analytics lifecycle?

A

Understanding of data

20
Q

Which data visualization is most suitable for understanding the trend and progression of a variable over time in the data preparation phase?

A

Line charts

21
Q

Which task is commonly performed to identify and address data quality issues during the data preparation phase?

A

Conducting data profiling

22
Q

Which common data cleaning task is used to address the missing data in a data set?

A

Imputation

23
Q

Which task is typically performed to handle outliers during the data preparation phase?

A

Truncating extreme valueS

24
Q

A data analyst at a retail company is provided with a large dataset containing sales transactions, customer information, and product details. The analyst is tasked with preparing the data for analysis and modeling. Which activity would the analyst perform during the data preparation phase?

A

Exploring available data to understand its characteristics and suitability

25
Q

Which activity is performed during the model planning phase of a data analysis project?

A

Selecting relevant features for modeling

26
Q

Which programming language is primarily used for statistical analysis and data manipulation in the model planning phase?

A

R

27
Q

Which classification model is based on the concept of probability and assigns class labels to instances based on the possibility of belonging to a particular class?

A

Naive Bayes

28
Q

Which tool is used to connect users to relational databases and data warehouse appliances in the model planning phase?

A

SAS/ACCESS

29
Q

Which regression model is commonly used for predicting a continuous numerical outcome based on a set of input features?

A

Linear regression

30
Q

Which phase of the data analytics life cycle involves running analytical software packages on small datasets to test and refine models?

A

Model execution phase

31
Q

Which step is typically performed after executing the model in the model execution phase?

A

Result analysis

32
Q

Which testing procedure is used for evaluating the performance of a model in the data analytics life cycle?

A

Cross-validation

33
Q

What is the role of the SPSS modeler in the model execution phase of the data analytics life cycle?

A

It is used for applying the trained model to new data for predictions.

34
Q

How does the communication of results tie to the operationalize phase of data analytics?

A

It implements data-driven insights into business functions.

35
Q

Which data visualization tool in the communicate results phase is used to create web-based visualization?

A

D3.js

36
Q

Which measure assesses the validity of a correlation between two variables during the communicate results phase?

A

P-value