Data Analysis Flashcards
Help answer questions about what has happened based on historical data
Descriptive Analytics
Summarize large datasets to describe outcomes to stakeholders
Descriptive Analytics
By developing key performance indicators (KPIs), these strategies can help track the success or failure of key objectives.
Descriptive Analytics
Generate reports to provide a view of an organization’s sales and financial data.
Descriptive Analytics
Identify anomalies in the data.
Diagnostic Analytics
Answer questions about why events happened.
Diagnostic Analytics
Supplement basic descriptive analytics, and they use the findings from descriptive analytics to discover the cause of these events.
Diagnostic Analytics
Collect data that’s related to these anomalies.
Diagnostic Analytics
Use statistical techniques to discover relationships and trends that explain these anomalies.
Diagnostic Analytics
Help answer questions about what will happen in the future.
Predictive Analytics
Use historical data to identify trends and determine if they’re likely to recur.
Predictive Analytics
Help answer questions about which actions should be taken to achieve a goal or target.
Prescriptive Analytics
Use insights from predictive analytics, organizations can make data-driven decisions.
Prescriptive Analytics
Rely on machine learning strategies to find patterns in large datasets.
Prescriptive Analytics
Attempt to draw inferences from existing data and patterns, derive conclusions based on existing knowledge bases, and then add these findings back into the knowledge base for future inferences, a self-learning feedback loop.
Cognitive Analytics
Help you learn what might happen if circumstances change and determine how you might handle these situations.
Cognitive Analytics
Closer to the business and is a specialist in interpreting the data that comes from the visualization.
Business Analyst
Enable businesses to maximize the value of their data assets through visualization and reporting tools such as Microsoft Power BI.
Data Analyst
Responsible for profiling, cleaning, and transforming data.
Data Analyst
Design and build scalable and effective data models, and enabling and implementing the advanced analytics capabilities into reports for analysis.
Data Analyst
Turn raw data into relevant and meaningful insights.
Data Analyst
Responsible for the management of Power BI assets, including reports, dashboards, workspaces, and the underlying datasets that are used in the reports.
Data Analyst
Implement and configure proper security procedures, in conjunction with stakeholder requirements, to ensure the safekeeping of all Power BI assets and their data.
Data Analyst
Work with data engineers to determine and locate appropriate data sources that meet stakeholder requirements.
Data Analyst
Works with the data engineer to identify new processes or improve existing processes for collecting data for analysis.
Data Analyst
Provision and set up data platform technologies that are on-premises and in the cloud.
Data Engineer
Manage and secure the flow of structured and unstructured data from multiple sources.
Data Engineer
Ensure that data services securely and seamlessly integrate across data services.
Data Engineer
Use of on-premises and cloud data services and tools to ingest, egress, and transform data from multiple sources.
Data Engineer
Collaborate with business stakeholders to identify and meet data requirements. They design and implement solutions.
Data Engineer
Perform advanced analytics to extract value from data.
Data Scientist
Might work in the realm of deep learning, performing iterative experiments to solve a complex data problem by using customized algorithms.
Data Scientist
Implements and manages the operational aspects of cloud-native and hybrid data platform solutions that are built on Microsoft Azure data services and Microsoft SQL Server.
Database administrator
Responsible for the overall availability and consistent performance and optimizations of the database solutions.
Database administrator
Monitors and manages the overall health of a database and the hardware that it resides on
Database administrator
Responsible for managing the overall security of the data, granting and restricting user access and privileges to the data as determined by business needs and requirements.
Database administrator
Process of profiling, cleaning, and transforming your data to get it ready to model and visualize.
Data Preparation
The process of taking raw data and turning it into information that is trusted and understandable. It involves, among other things, ensuring the integrity of the data, correcting wrong or inaccurate data, identifying missing data, converting data from one structure to another or from one type to another, or even a task as simple as making data more readable.
Data Preparation
Understanding how you’re going to get and connect to the data and the performance implications of the decisions.
Data Preparation
Process of determining how your tables are related to each other. This process is done by defining and creating relationships between the tables. From that point, you can enhance the model by defining metrics and adding custom calculations to enrich your data.
Data Modeling
Prepare, Model, Visualize, Analyze and Manage
Data Analyst Tasks
Has a direct effect on the performance of your report and overall data analysis.
Data Modeling
Bring your data to life
Data Visualization
Goal is to solve business problems.
Data Visualization
Reports should be designed with accessibility in mind from the outset so that no special modifications are needed in the future.
Data Visualization
Understanding and interpreting the information that is displayed on the report.
Data Analyzation
Organizations can drill into the data to predict future patterns and trends, identify activities and behaviors, and enable businesses to ask the appropriate questions about their data.
Advanced Analytics
AI integrations within Power BI can take your analysis to the next level. Integrations with Azure machine learning, cognitive services, and built-in AI visuals will help to enrich your data and analysis.
Data Analyzation
Help reduce data silos within your organization.
Manage Data
Reduce data silos with the use of shared datasets, and it allows you to reuse data that you have prepared and modeled.
Manage Data
Microsoft Windows desktop application called Power BI Desktop
Power BI
Online SaaS (Software as a Service) service called the Power BI service
Power BI
Mobile Power BI apps that are available on any device, with native mobile BI apps for Windows, iOS, and Android.
Power BI
Visualizations, Datasets, Reports, Dashboards, and Tiles
Power BI Building Blocks
Visual representation of data, like a chart, a color-coded map, or other interesting things you can create to represent your data visually.
Visualizations
Collection of data that Power BI uses to create its visualizations.
Datasets
Combination of many different sources, which you can filter and combine to provide a unique collection of data for use in Power BI.
Datasets
Collection of visualizations that appear together on one or more pages.
Reports
Let you create many visualizations, on multiple pages if necessary, and let you arrange those visualizations in whatever way best tells your story.
Reports
Single visualization on a report or a dashboard.
Tile
Collection of preset, ready-made visuals and reports that are shared with an entire organization.
App
Shows you the available sources of data in the Power BI service.
Canvas
Identify each unique, non-null data row.
Primary Keys
Reference rows in a different table
Foreign Keys
Contain observational or event data values: sales orders, product counts, prices, transactional dates and times, and quantities.
Fact Tables
Contain the details about the data in fact tables: products, locations, employees, and order types.
Dimension Tables
Edit the name and description of the column.
Model - General Tab
Add synonyms that can be used to identify the column when you are using the Q&A feature.
Model - General Tab
Add a column into a folder to further organize the table structure.
Model - General Tab
Hide or show the column.
Model - General Tab
Change the data type.
Model - Formatting Tab
Format the date.
Model - Formatting Tab
Sort by a specific column.
Model - Advanced Tab
Assign a specific category to the data.
Model - Advanced Tab
Summarize the data.
Model - Advanced Tab
Determine if the column or table contains null values.
Model - Advanced Tab
Source data tables are mature and ready for immediate use. Identify company holidays. Separate calendar and fiscal year. Identify weekends versus weekdays.
Source Data Date Table
Dates = CALENDAR(DATE(2011, 5, 31), DATE(2021, 5, 31))
DAX Date Table
The CALENDAR() function returns a contiguous range of dates based on a start and end date that are entered as arguments in the function.
DAX Date Table
The CALENDARAUTO() function returns a contiguous, complete range of dates that are automatically determined from your dataset.
DAX Date Table
MonthNum = MONTH(Dates[Date])
DAX Date Table
WeekNum = WEEKNUM(Dates[Date])
DAX Date Table
DayoftheWeek = FORMAT(Dates[Date].[Day], “DDDD”)
DAX Date Table
Use M-language, the development language that is used to build queries in Power Query.
Power Query Date Table
Form through natural segments in your data.
Hierarchies
The process of viewing multiple child levels based on a top-level parent.
Flatten the hierarchy
Create multiple columns in a table to show the hierarchical path of the parent to the child in the same record.
Flatten the hierarchy
Path() returns a text version of the hierarchical path which can be split into multiple columns and turned into a hierarchy
Flatten the hierarchy
Have multiple valid relationships with fact tables, meaning that the same dimension can be used to filter multiple columns or tables of data. As a result, you can filter data differently depending on what information you need to retrieve.
Role-playing dimensions
Requires complex DAX functions.
Role-playing dimensions
Detail represented within your data
Data Granularity
Describes a relationship in which you have many instances of a value in one column that are related to only one unique corresponding instance in another column.
Many-to-one or one-to-many cardinality
Describes the directionality between fact and dimension tables.
Many-to-one or one-to-many cardinality
Is the most common type of directionality and is the Power BI default when you are automatically creating relationships.
Many-to-one or one-to-many cardinality
Describes a relationship in which only one instance of a value is common between two tables.
One-to-one cardinality
Requires unique values in both tables.
One-to-one cardinality
Is not recommended because this relationship stores redundant information and suggests that the model is not designed correctly. It is better practice to combine the tables.
One-to-one cardinality
Describes a relationship where many values are in common between two tables.
Many-to-many cardinality
Does not require unique values in either table in a relationship.
Many-to-many cardinality
Is not recommended; a lack of unique values introduces ambiguity and your users might not know which column of values is referring to what.
Many-to-many cardinality
Only one table in a relationship can be used to filter the data. For instance, Table 1 can be filtered by Table 2, but Table 2 cannot be filtered by Table 1.
Single cross-filter direction
For a one-to-many or many-to-one relationship, the cross-filter direction will be from the “one” side, meaning that the filtering will occur in the table that has unique values.
Single cross-filter direction
One table in a relationship can be used to filter the other. For instance, a dimension table can be filtered through the fact table, and the fact tables can be filtered through the dimension table.
Bi-direction Cross Filter
You might have lower performance when using bi-directional cross-filtering with many-to-many relationships.
Bi-direction Cross Filter
One-to-one relationships filter option.
Bi-direction Cross Filter
Many-to-many relationships filter options
Bi-direction Cross Filter; Single cross-filter direction