Model, query, and explore data in Azure Synapse Flashcards

1
Q

function used to read the data that are stored in files in a data lake

A

OPENROWSET

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
1
Q

difference between dedicated SQL pool and serverless SQL pool

A

serverless used to perform SQL queries on files; dedicated defines a relational database in which data can be stored and queried

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

external database object that encapsulates the connection info to a file location in a data lake store

A

DATA SOURCE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

how to analyze parquet file using Spark

A

load the parquet file directly into a Spark dataframe (no need to first load data into a serverless SQL pool)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

HASH vs ROUND_ROBIN

A

HASH provides good read performance for a large table by distributing records across compute nodes based on the hash key

ROUND_ROBIN distributes data evenly, but does not optimize queries on commonly used distribution key fields

How well did you know this?
1
Not at all
2
3
4
5
Perfectly