Intro - Discover Data Analysis Flashcards

1
Q

Problem: why do businesses need Data Analysts?

A
  • Businesses still fail to treat data as a strategic asset and struggle to use their data in a productive way
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How can businesses get the most out of their data?

A
  • Unlock data through accurate storytelling
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How can it be used to drive competitive advantage?

A
  • Enables a business to make faster decisions on more precise information
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Data Analysis? How does this work?

A
  • Processing - Data analysis is the process of identifying, cleaning, transforming, and modelling data to discover meaningful and useful information.
  • Selling Story - The data is then crafted into a story through reports for analysis to support the critical decision-making process.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is descriptive analytics?

A
  • Describes and summarises historical data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Diagnostic analytics? What is the process?

A
  • Diagnostic analytics answer questions about why events happened
  • Diagnostic techniques supplement findings from descriptive statistics to uncover the cause of events (e.g. why these events became better or worse)
    (1) Identify anomaly
    (2) Collect data related to anomaly
    (3) Use statistical techniques to discover relationships in these patterns
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Predictive analytics?

A
  • Predictive analytics techniques use historical data to identify trends and determine if they are likely to occur again in the future
  • Usually one outcome
  • Includes statistical and machine learning techniques
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Prescriptive analytics?

A
  • Prescriptive analytics help answer questions about which actions should be taken to achieve a goal or target.
  • Analyses past data to estimate the likelihood of different outcomes (multiple outcomes)
  • Uses machine learning techniques
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Cognitive analytics?

A
  • Similar to random forest that learn from past data to improve decisions (e.g. weak learners make strong learners)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why are there so many roles in data?

A
  • Used to be roles with large remits such as Business Analyst and BI developers
  • Large multi-stage projects and the expansion of Big Data and IoT has led to more specialised data roles
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Business analyst?

A
  • Business Analysts are closer to business objectives and are specialists in interpreting data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Data analyst

A
  • A data analyst enables businesses to maximize the value of their data assets through visualization and reporting tools such as Microsoft Power BI
  • Responsibilities - designing and building scalable and effective data models, enabling and implementing the advanced analytics capabilities into reports for analysis, and managing Power BI assets.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Data engineer

A
  • Provides Database: Data engineers provision and set up data platform technologies that are on-premises and in the cloud
  • Manage data types - Data engineers also manage the flow of structured and unstructured data from multiple sources
  • Wrangling - Data engineers process and wrangle large amounts of data so data scientists and data analysts can focus producing insights
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Data scientist

A
  • Data scientists perform advanced analytics to extract value from data
  • Can use descriptive (e.g. EDA) and predictive analytics (e.g. machine learning)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Database administrator

A
  • Operates Database - A database administrator implements and manages the operational aspects of cloud-native and hybrid data platform solutions that are built on Microsoft Azure data services and Microsoft SQL Server.
  • Monitors - A database administrator monitors and manages the overall health of a database and the hardware that it resides on
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Prepare?

A
  • Ultimate goal = to ensure data is accurate
  • Data preparation is the process of profiling, cleaning, and transforming your data to get it ready to model and visualize.
  • Data Integrity - correcting inaccurate data, identify missingness, improving readability
  • Also involves (a) performance implications of connecting data (b) security assurances
17
Q

Model?

A
  • Ultimate goal = create relationships between tables
  • Data modelling is the process of determining how your tables are related to each other by defining and creating relationships between the tables.
  • Good models help: (a) reports more accurate (b) deliver faster insights (c) create better reports (d) faster performance
18
Q

Visualize?

A
  • Ultimate goal = to solve business problems through data storytelling
19
Q

Analyse?

A
  • Ultimate goal - create insights (summary + synthesis) from the report to drive more meaningful insights
  • Analytical capabilities - Power BI has analytical capabilities to identify trends, patterns and predict outcomes
20
Q

Manage?

A
  • Ultimate goal - overseeing the sharing and distribution of Power BI assets
  • Responsibilities = collaboration (e.g. knowledge sharing) , security (e.g. no leaks), reduce silos (e.g. many datasets brought into Power BI)
21
Q

Explain the dependencies between tasks?

A
  • Modelling depends on Prepare to be in a proper state