CHAPTER 6: Building on Subqueries, Common Table Expressions, and Unions Flashcards

Question

Q: What must match between the queries in a UNION statement?

Answer 1

A: Each query must return the same number of columns, and the columns must have compatible data types.

Answer 2

SELECT col1, col2 FROM table1 UNION SELECT col1, col2 FROM table2;

Answer 3

A: UNION ALL avoids the overhead of removing duplicates, which improves performance, especially with large datasets.

Answer 4

A: The ORDER BY clause must be placed at the end of the entire UNION query, not after individual queries. Example: SELECT col1 FROM table1 UNION SELECT col1 FROM table2 ORDER BY col1;

Answer 5

A: The column names and data types are determined by the first query in the UNION.

Answer 6

A: SQL Server will throw an error if the column counts differ or if incompatible data types cannot be implicitly converted.

Answer 7

SELECT CustomerID FROM Sales.Customers UNION SELECT CustomerID FROM Sales.ArchivedCustomers;

Answer 8

SELECT ProductID, Name FROM Products_Current UNION ALL SELECT ProductID, Name FROM Products_Discontinued;

Answer 9

A: SQL Server will throw an error indicating a mismatch in the number of columns.

Answer 10

A: SQL Server follows precedence rules, and incompatible types (e.g., mixing strings and integers) can cause errors or unexpected implicit conversions.

Answer 11

A: Combining production and archived data or displaying data from different divisions of a company as a unified result.

Answer 12

SELECT CustomerID, OrderDate FROM Orders WHERE Year(OrderDate) = 2023 UNION SELECT CustomerID, OrderDate FROM Orders WHERE Year(OrderDate) = 2022;

Answer 13

A: EXCEPT returns rows from the first query that are not present in the second query, removing duplicates by default.

Answer 14

SELECT BusinessEntityID FROM HumanResources.Employee EXCEPT SELECT BusinessEntityID FROM Person.Person;

Answer 15

A: EXCEPT removes duplicates from the results unless explicitly overridden by using additional query constructs.

Answer 16

A: Use EXCEPT when you need to find rows in one query that do not exist in another query, often for identifying mismatches or gaps.

Answer 17

A: Both queries must return the same number of columns, and their data types must be compatible.

Answer 18

A: INTERSECT returns rows that exist in both queries, removing duplicates by default.

Answer 19

SELECT BusinessEntityID FROM HumanResources.Employee INTERSECT SELECT BusinessEntityID FROM Person.Person;

Answer 20

A: INTERSECT automatically removes duplicate rows, ensuring only unique common rows are returned.

Answer 21

A: Use INTERSECT when you want to find common rows between two queries, such as overlapping records between two datasets.

Answer 22

A: The columns in both queries must match in number and have compatible data types.

Answer 23

A: EXCEPT returns rows from the first query that are not in the second query, while INTERSECT returns rows that are present in both queries.

Answer 24

A: Yes, but the ORDER BY clause must be placed after the entire EXCEPT or INTERSECT operation, not within the individual queries.

Answer 25

A: EXCEPT can identify rows in one table that are missing in another, such as detecting missing foreign key references.

Answer 26

A: INTERSECT can be used to find rows that are common across datasets, ensuring data alignment between two sources.

Answer 27

A: A derived table is a subquery that appears in the FROM clause of a query and is treated like a temporary table.

Answer 28

A: Derived tables isolate part of a query's logic, making the query modular and easier to understand and manage.

Answer 29

SELECT FROM (SELECT FROM ) AS ;

Q: Must a derived table be aliased?

A: Yes, a derived table always requires an alias to reference its columns in the outer query.

Q: Can a derived table contain multiple tables or a WHERE clause?

A: Yes, a derived table can include joins, multiple tables, and a WHERE clause.

Q: Can a derived table include an ORDER BY clause?

A: Only if the TOP keyword is used.

Q: Can a derived table reference columns not included in its SELECT list?

A: No, all columns needed for joins or output in the outer query must be explicitly included in the derived table's SELECT list.

Q: Provide an example of a derived table in an INNER JOIN.

SELECT c.CustomerID, s.SalesOrderID FROM Sales.Customer AS c INNER JOIN ( SELECT SalesOrderID, CustomerID FROM Sales.SalesOrderHeader ) AS s ON c.CustomerID = s.CustomerID;

Q: How can derived tables be combined with joins?

A: Derived tables can be used with any type of join, such as INNER JOIN, LEFT JOIN, CROSS JOIN, or FULL JOIN.

Q: Can derived tables reference a CTE?

A: Yes, a derived table can reference a CTE defined earlier in the query.

Q: Can a derived table contain another derived table?

A: Yes, derived tables can be nested within other derived tables for complex queries.

Q: What is one limitation of derived tables compared to other T-SQL techniques?

A: Derived tables cannot define their own CTEs within their scope.

Q: When is it beneficial to use a derived table?

A: Use derived tables to encapsulate logic, simplify joins, or perform intermediate calculations.

Q: What happens if a required column is omitted from the derived table's SELECT list?

A: The query will fail because the outer query cannot access columns that are not explicitly selected in the derived table.

Q: What is a Common Table Expression (CTE)?

A: A CTE is a temporary result set defined within a query that exists only for the duration of the query.

Q: How does a CTE differ from a derived table?

A: Unlike a derived table, a CTE is defined separately using the WITH keyword and can be referenced multiple times in the main query.

Q: What happens to a CTE after the main query runs?

A: A CTE goes out of scope and cannot be reused once the query has completed execution.

Q: What is the basic syntax for defining a CTE?

WITH AS (SELECT FROM

) SELECT FROM ;

Q: How do you define column aliases in a CTE?

A: Specify the aliases after the CTE name: WITH CTE_Name ([Column1], [Column2]) AS ( SELECT Column1, Column2 FROM Table ) SELECT * FROM CTE_Name;

Q: What is the required placement of the WITH keyword in a query?

A: The WITH keyword must be the first statement in the batch or be preceded by a semicolon if other statements appear before it.

Q: Can a CTE include joins, WHERE clauses, or expressions?

A: Yes, a CTE can include joins, filters, expressions, and even reference other tables or views.

Q: Provide an example of a CTE used for an INNER JOIN.

WITH Orders AS ( SELECT SalesOrderID, CustomerID, TotalDue + Freight AS Total FROM Sales.SalesOrderHeader ) SELECT c.CustomerID, o.SalesOrderID, o.Total FROM Sales.Customer AS c INNER JOIN Orders AS o ON c.CustomerID = o.CustomerID;

Q: What happens if a column alias is not defined in a CTE?

A: The column name from the CTE query is used, but defining aliases upfront can improve readability, especially for expressions.

Q: Can a CTE be reused across multiple queries?

A: No, a CTE is valid only for the query in which it is defined.

Q: Can a CTE reference another CTE?

A: Yes, one CTE can reference another within the same WITH clause.

Q: Is it possible to use ORDER BY in a CTE?

A: Yes, but only when the TOP keyword is used.

Q: How do you ensure the CTE is correctly referenced in complex queries?

A: Always use descriptive aliases and verify that required columns are included in the CTE's SELECT list.

Q: What advanced capabilities make CTEs superior to derived tables in some scenarios?

A: CTEs can be recursive, allowing them to solve hierarchical and iterative problems.

Q: What is one major use case for CTEs in advanced queries?

A: CTEs are commonly used for breaking down complex logic into manageable parts or simplifying nested queries.

Q: What is the main difference between UNION and UNION ALL?

A: UNION removes duplicate rows, while UNION ALL includes all rows, including duplicates.

Q: Why does UNION perform worse than UNION ALL?

A: UNION uses additional resources to eliminate duplicates, often employing a Hash Match operation to aggregate results.

Q: When should you use UNION ALL instead of UNION?

A: Use UNION ALL if you are certain there are no duplicates or duplicates are acceptable in the results.

Q: How does SQL Server process duplicate elimination for UNION?

A: It uses the Hash Match Aggregate operator, which can be resource-intensive for large datasets.

Q: Compare the estimated costs for UNION and UNION ALL from the example.

UNION query cost: 2.62912 UNION ALL query cost: 0.806502 UNION took ~77% of the resources in the batch.

Q: How can you view the performance difference between UNION and UNION ALL?

A: Enable Include Actual Execution Plan in SQL Server Management Studio or use Explain in Azure Data Studio.

Q: Why doesn’t wrapping a UNION ALL in a DISTINCT query improve performance?

A: The optimizer processes it like a UNION query internally, performing the same duplicate elimination steps.

Q: When should you use DISTINCT with UNION ALL?

A: Use DISTINCT in individual queries if duplicates exist within the individual datasets but not across them.

Q: Can using UNION ALL in a CTE or derived table help optimize performance?

A: No, unless the underlying queries ensure no duplicates. Otherwise, using DISTINCT or UNION negates performance gains.

Q: What is a best practice for deciding between UNION and UNION ALL?

A: Always prefer UNION ALL if you don’t need to eliminate duplicates, as it improves performance by avoiding unnecessary operations.

Q: What is a common operation used by SQL Server to remove duplicates in a UNION query?

A: The Hash Match Aggregate operation.

Q: How can you combine improved performance and duplicate elimination if necessary?

A: Use DISTINCT within individual queries and combine them with UNION ALL.

Answer 30