Azure Data Lists Flashcards
1
Q
Data architectures
A
- Lambda architecture
- Kappa architecture
2
Q
Lambda architecture layers
A
Batch layer
Speed layer
Serving layer
3
Q
Data warehouse workload types
A
- Relational
- Non-relational
- Batch
- Streaming
4
Q
Main phases of a data stream flow
A
- Production
- Acquisition
- Aggregation and transformation
- Storage
5
Q
Time window aggregation types
A
- Tumbling window
- Hopping window
- Sliding window
- Session window
6
Q
Data stream concepts
A
- Watermarks
- Consumer groups
- Time window aggregations
7
Q
Batch processing scenarios
A
- Data set transformation and preparation
- ETL and ELT workloads
- Machine learning model training
- Applying machine learning models on data sets for scoring
- Report generation
8
Q
Azure batch Processing Services
A
- Azure Synapse Analytics
- Azure Data Lake Analytics
- Azure HDInsight
- Azure Databricks
9
Q
Batch processing tools
A
- Azure Synapse Analytics
- Azure Data Lake Analytics
- Azure HDInsight
- Azure Databricks
- Apache Hive
- Apache Pig
- Apache Spark
10
Q
Analytical data stores
A
- Azure Synapse Analytics
- Spark SQL
- HBase
- Apache Hive
11
Q
Five V’s of big data
A
- Volume
- Velocity
- Variety
- Veracity
- Value
12
Q
Analytics techniques
A
- Descriptive analysis
- Diagnostic analysis
- Predictive analysis
- Prescriptive analysis
13
Q
TDSP phases
A
- Business needs
- Data discovery and acquisition
- Model development
- Model deployment
14
Q
Common TDSP roles
A
- Subject matter expert
- Data engineer
- Data scientist
- Application developer
15
Q
MLOps best practices
A
- Exploratory data analysis (EDA)
- Data Prep and Feature Engineering
- Model training and tuning
- Model review and governance
- Model inference and serving
- Model deployment and monitoring
- Automated model retraining
16
Q
Azure Data Factory runtime types
A
- Azure
- Self-hosted
- SSIS (SQL Server Integration Services)
17
Q
Azure Data Factory transformation types
A
- External services
- Mapping data flows (uses Apache Spark code, run on Azure Databricks)
- Wrangling data flows (Power Query editor in Microsoft Power BI)
18
Q
Azure Data Factory external services for transformations
A
- Azure SQL Database
- Azure Synapse Analytics
- Azure Databricks
- Azure HDInsight
- Azure Functions
- SQL Server Integration Services (SSIS)
19
Q
Azure Stream Analytics features
A
- Provisioned or on-demand SQL Server pools
- Provisioned or on-demand Spark pools
- Stream processing capabalitiies through window aggregations
- ML models aggregation through the PREDICT statement
- Azure DevOps integration
- Data Factory-like pipelines development experience
- Power BI report editor integration
20
Q
Macro-layers for analytics
A
- Analytical access
- Reporting access
- Dashboarding access