Analytics Flashcards
In Analytics there are 4 types of analysis you can make. What are their names?
-Descriptive analytics
-Diagnostic analytics
-Predictive analytics
-Prescriptive analytics
What are Descriptive Analytics?
Descriptive analytics focuses on analyzing present and past data to determine what is happening at present.
What are Diagnostic Analytics?
Diagnostic Analystic focus on analysing data to determine for what reason something happens.
What are Predictive Analytics?
Predictive analytics focuses on determining what might happen
What are Prescriptive Analytics?
Prescriptive analytics are similar to predictive analytics, but instead of only predicting what might happen you also suggests actions to take and what are their consequences
Explain Amazon CodeWhisperer
Amazon CodeWhisperer is an AWS AI Service that generates and comments code ussing LLM tecnology.
True or False: Amazon CodeWhisperer can detect security vulnerabilities in your code
True
What are the 5 big Vs of big data and what do they mean?
-Volume: The amount of date being ingested
-Variety: The number and types of data sources
-Velocity: The speed with which new data is processed and stored
-Veracity: The degree to which the data can be trusted
-Value: The amount of information that can be extracted from the data
What are the 3 most frequent caracterizations of data based on their format?
-Structured Data
-Semi-structured Data
-Unstructured Data
What are the 4 data processing velocities?
-Scheduled
-Periodic
-Near real-time
-Real-time
What is AWS Lake Formation?
A service that simplifies ingesting, cleaning, cataloging, transforming, and securing data on S3 Data Lakes.
AWS’ main data warehousing solution is called AWS __________
Redshift
Complete the following statement regarding ETL on AWS:
When looking at a standard, simplified ETL pipeline on AWS, one should use ________. For customized processes, however, one should use __________.
-Glue
-EMR
What are the 4 main functions of a Data Lake?
-Ingest and store data
-Catalog data for searches
-Secure and protect data
-Allow analytics and insights to be run
What are the 6 stages on an Analytics pipeline?
-Data Source
-Ingestion
-Data Store
-Cataloging and processing stage
-Search and analytics stage
-Visualization stage
What are the 3 main challenges in mantaining a data lake?
-Data governance
-Data quality
-Security
What are AWS Lake Formation’s 4 main features?
-Automate building the data lake environment (collecting, moving, cleansing data, etc)
-Store metadata from raw and processed datasets
-Orchestrate ETL jobs, crawlers and triggers using AWS Glue
-Centralize access control to the data lake
Lake formation security model consists of 3 security roles to be used in managing the lake. What are they?
-Lake formation administrator
-Database Creator
-Table Creator
What permissions does the Lake Formation administrator have?
-Has full read access to resources
-Has data location permissions
-Can grant or revoke access to resources, including self
-Can create databases
-Can grant permission to create databases
What permissions does the Lake Formation Database Creator have?
-Has all database permissions on databases that they create
-Has permissions on tables that they create
-Can use console or API to designate database creators
What permissions does the Lake Formation Table Creator have?
-Has permissions on tables that they create
-Can grant permissions on tables that they create
-Can view databases containing the tables that they create
The AWS Service for Data Mesh is called __________
DataZone