Module 2dc - Exploring Azure Core Products - Data Analytics Flashcards
What is Azure Synapse Analytics?
Azure’s Big Data service that brings together data warehousing and data analytics
What are four (4) features of Azure Synapse Analytics?
- Query data through Serverless or Provisioned Resources
- Query data at scale
- Has it’s own dashboard like IoT and ML
- Supports BI and ML data needs
What is Azure HDInsight?
A fully managed, open-source analytics service. It’s the ORIGINAL and oldest Azure Big Data integration service
Name a few Big Data scenarios/high level use cases that Azure HDInsight supports
- ETL
- Data Warehousing
- ML
- IoT
What Big Data Analytics frameworks does Azure HDInsight support?
Supports most all popular open-source frameworks for Big Data Analytics:
- Apache Spark
- Hadoop
- Kafka
- HBase
- Storm
- Microsoft’s Machine Learning Services
What is Azure Databricks?
Databricks is a 3rd party that Microsoft purchased and added to Azure!
What technology does Azure Databricks support for project collaboration?
Support for setting up Apache Spark environments for collaborating on shared projects
What does Azure Databricks support w.r.t. Machine Learning Platforms?
TensorFlow, PyTorch and scikit-learn
What language support comes with Azure Databricks?
Python, Scala, R, Java and SQL
What is Azure Data Lake Analytics?
An on-demand analytics service for simplifying big data.
Allows you to write queries to transform data and extract insights
Azure Data Lake Analytics is a pay-as-you-go Service. You only pay when your job runs (T/F)?
True
Azure Data Lake Analytics limits scale based on power settings. User sets power and adjusts accordingly (T/F)?
False. Though it’s based on a power setting, the scaling is actually unlimited. Power controls the degree of scale.
Azure Data Lake Analytics requires some initial VM hardware configurations for setting up internal Scale Sets as well as Update Sets (T/F)?
False. No deploying, configuring or tuning hardware. All taken care by Azure!