SQL Flashcards

Question

What is an Alias in SQL?

Answer 1

An alias is a feature of SQL that is supported by most, if not all, RDBMSs. It is a temporary name assigned to the table or table column for the purpose of a particular SQL query. In addition, aliasing can be employed as an obfuscation technique to secure the real names of database fields. A table alias is also called a correlation name . An alias is represented explicitly by the AS keyword but in some cases the same can be performed without it as well. Nevertheless, using the AS keyword is always a good practice.

Answer 2

A view in SQL is a virtual table based on the result-set of an SQL statement. A view contains rows and columns, just like a real table. The fields in a view are fields from one or more real tables in the database.

Answer 3

Normalization represents the way of organizing structured data in the database efficiently. It includes creation of tables, establishing relationships between them, and defining rules for those relationships. Inconsistency and redundancy can be kept in check based on these rules, hence, adding flexibility to the database.

Answer 4

Denormalization is the inverse process of normalization, where the normalized schema is converted into a schema which has redundant information. The performance is improved by using redundancy and keeping the redundant data consistent. The reason for performing denormalization is the overheads produced in query processor by an over-normalized structure.

Answer 5

Normal Forms are used to eliminate or reduce redundancy in database tables. The different forms are as follows: First Normal Form A relation is in first normal form if every attribute in that relation is a single-valued attribute. If a relation contains composite or multi-valued attribute, it violates the first normal form. Second Normal Form A relation is in second normal form if it satisfies the conditions for first normal form and does not contain any partial dependency. A relation in 2NF has no partial dependency, i.e., it has no non-prime attribute that depends on any proper subset of any candidate key of the table. Often, specifying a single column Primary Key is the solution to the problem Third Normal Form A relation is said to be in the third normal form, if it satisfies the conditions for second normal form and there is no transitive dependency between the non-prime attributes, i.e.,all non-prime attributes are determined only by the candidate keys of the relation and not by any other non-prime attribute. Boyce-Codd Normal Form A relation is in Boyce-Codd Normal Form if satisfies the conditions for third normal form and for every functional dependency, Left-Hand-Side is super key. In other words, a relation in BCNF has non-trivial functional dependencies in the form X –> Y, such that X is always a super key

Answer 6

DELETE statement is used to delete rows from a table. TRUNCATE command is used to delete all the rows from the table and free the space containing the table. DROP command is used to remove an object from the database. If you drop a table, all the rows in the table is deleted and the table structure is removed from the database.

Answer 7

If a table is dropped, all things associated with the tables are dropped as well. This includes - the relationships defined on the table with other tables, the integrity checks and constraints, access privileges and other grants that the table has. To create and use the table again in its original form, all these relations, checks, constraints, privileges and relationships need to be redefined. However, if a table is truncated, none of the above problems exist and the table retains its original structure.

Answer 8

The TRUNCATE command is used to delete all the rows from the table and free the space containing the table. The DELETE command deletes only the rows from the table based on the condition given in the where clause or deletes all the rows from the table if no condition is specified. But it does not free the space containing the table.

Answer 9

An aggregate function performs operations on a collection of values to return a single scalar value. Aggregate functions are often used with the GROUP BY and HAVING clauses of the SELECT statement. Following are the widely used SQL aggregate functions: AVG() - Calculates the mean of a collection of values. COUNT() - Counts the total number of records in a specific table or view. MIN() - Calculates the minimum of a collection of values. MAX() - Calculates the maximum of a collection of values. SUM() - Calculates the sum of a collection of values. FIRST() - Fetches the first element in a collection of values. LAST() - Fetches the last element in a collection of values. Note: All aggregate functions described above ignore NULL values except for the COUNT function. A scalar function returns a single value based on the input value. Following are the widely used SQL scalar functions: LEN() - Calculates the total length of the given field (column). UCASE() - Converts a collection of string values to uppercase characters. LCASE() - Converts a collection of string values to lowercase characters. MID() - Extracts substrings from a collection of string values in a table. CONCAT() - Concatenates two or more strings. RAND() - Generates a random collection of numbers of given length. ROUND() - Calculates the round off integer value for a numeric field (or decimal point values). NOW() - Returns the current data & time.FOR MAT() - Sets the format to display a collection of values.

Answer 10

The user-defined functions in SQL are like functions in any other programming language that accept parameters, perform complex calculations, and return a value. They are written to use the logic repetitively whenever required. There are two types of SQL user-defined functions: Scalar Function: As explained earlier, user-defined scalar functions return a single scalar value. Table Valued Functions: User-defined table-valued functions return a table as output. Inline: returns a table data type based on a single SELECT statement. Multi-statement: returns a tabular result-set but, unlike inline, multiple SELECT statements can be used inside the function body.

Answer 11

OLTP stands for Online Transaction Processing, is a class of software applications capable of supporting transaction-oriented programs. An essential attribute of an OLTP system is its ability to maintain concurrency. To avoid single points of failure, OLTP systems are often decentralized. These systems are usually designed for a large number of users who conduct short transactions. Database queries are usually simple, require sub-second response times and return relatively few records.

Answer 12

OLTP stands for Online Transaction Processing, is a class of software applications capable of supporting transaction-oriented programs. An important attribute of an OLTP system is its ability to maintain concurrency. OLTP systems often follow a decentralized architecture to avoid single points of failure. These systems are generally designed for a large audience of end users who conduct short transactions. Queries involved in such databases are generally simple, need fast response times and return relatively few records. Number of transactions per second acts as an effective measure for such systems. OLAP stands for Online Analytical Processing, a class of software programs which are characterized by relatively low frequency of online transactions. Queries are often too complex and involve a bunch of aggregations. For OLAP systems, the effectiveness measure relies highly on response time. Such systems are widely used for data mining or maintaining aggregated, historical data, usually in multi-dimensional schemas.

Answer 13

Collation refers to a set of rules that determine how data is sorted and compared. Rules defining the correct character sequence are used to sort the character data. It incorporates options for specifying case-sensitivity, accent marks, kana character types and character width. Below are the different types of collation sensitivity: Case sensitivity: A and a are treated differently. Accent sensitivity: a and á are treated differently. Kana sensitivity: Japanese kana characters Hiragana and Katakana are treated differently. Width sensitivity: Same character represented in single-byte (half-width) and double-byte (full-width) are treated differently.

Answer 14

A stored procedure is a subroutine available to applications that access a relational database management system (RDBMS). Such procedures are stored in the database data dictionary. The sole disadvantage of stored procedure is that it can be executed nowhere except in the database and occupies more memory in the database server. It also provides a sense of security and functionality as users who can't access the data directly can be granted access via stored procedures.

Answer 15

A stored procedure which calls itself until a boundary condition is reached, is called a recursive stored procedure. This recursive function helps the programmers to deploy the same set of code several times as and when required. Some SQL programming languages limit the recursion depth to prevent an infinite loop of procedure calls from causing a stack overflow, which slows down the system and may lead to system crashes.

Answer 16

Creating empty tables with the same structure can be done smartly by fetching the records of one table into a new table using the INTO operator while fixing a WHERE clause to be false for all records. Hence, SQL prepares the new table with a duplicate structure to accept the fetched records but since no records get fetched due to the WHERE clause in action, nothing is inserted into the new table.

Answer 17

SQL pattern matching provides for pattern search in data if you have no clue as to what that word should be. This kind of SQL query uses wildcards to match a string pattern, rather than writing the exact word. The LIKE operator is used in conjunction with SQL Wildcards to fetch the required information. Using the % wildcard to perform a simple search The % wildcard matches zero or more characters of any type and can be used to define wildcards both before and after the pattern. Omitting the patterns using the NOT keyword Use the NOT keyword to select records that don't match the pattern. Matching a pattern anywhere using the % wildcard twice Search for a student in the database where he/she has a '%K%' in his/her name. Using the _ wildcard to match pattern at a specific position The _ wildcard matches exactly one character of any type. It can be used in conjunction with % wildcard. Matching patterns for specific length The _ wildcard plays an important role as a limitation when it matches exactly one character. It limits the length and position of the matched results

Answer 18

A window function performs a calculation across a set of table rows that are somehow related to the current row. This is comparable to the type of calculation that can be done with an aggregate function

Answer 19

The main advantage of using Window functions over regular aggregate functions is: Window functions do not cause rows to become grouped into a single output row, the rows retain their separate identities and an aggregated value will be added to each row.

Answer 20

Relative - LAG(column, n) returns column 's value at the row n rows before the current row - LEAD(column, n) returns column 's value at the row n rows after the current row Absolute - FIRST_VALUE(column) returns the rst value in the table or partition - LAST_VALUE(column) returns the last value in the table or partition

Answer 21

- PARTITION BY splits the table into partitions based on a column's unique values - The results aren't rolled into one column - Operated on separately by the window function - ROW_NUMBER will reset for each partition - LAG will only fetch a row's previous value if its previous row is in the same partition

Answer 22

1. ROW_NUMBER() always assigns unique numbers, even if two rows' values are the same 2. RANK() assigns the same number to rows with identical values, skipping over the next numbers insuch cases 3. DENSE_RANK() also assigns the same number to rows with identical values, but doesn't skip overthe next numbers

Answer 23

-Paging: Splitting data into (approximately) equal chunks - Uses - Many APIs return data in "pages"to reduce data being sent - Separating data into quartiles or thirds (top middle 33%, and bottom thirds) to judgeperformance Enter NTILE -NTILE(n) splits the data into n approximately equal pages

Answer 24

The definition of a window used with a window function can include a frame clause. A frame is a subset of the current partition and the frame clause specifies how to define the subset. ROWS BETWEEN -ROWS BETWEEN [START] AND [FINISH] --n PRECEDING : n rows before the current row --CURRENT ROW :the current row --n FOLLOWING : n rows after the current row Examples -ROWS BETWEEN 3 PRECEDING AND CURRENT ROW -ROWS BETWEEN 1 PRECEDING AND 1 FOLLOWING -ROWS BETWEEN 5 PRECEDING AND 1 PRECEDING RANGE BETWEEN - RANGE BETWEEN [START] AND [FINISH] - -Functions much the same as ROWS BETWEEN - -RANGE treats duplicates in OVER 's ORDER BY subclause as a single entity ROWS BETWEEN is almost always used over RANGE BETWEEN

Answer 25

Moving average (MA): Average of last n periods Example: 10-day MA of units sold in sales is the average of the last 10 days' sold units - Used to indicate momentum/trends - Also useful in eliminating seasonality

Answer 26

Moving total: Sum of last n periods Example: Sum of the last 3 Olympic games' medals -Used to indicate performance; if the sum is going down, overall performance is going down

Answer 27

Transforms a table by making columns out of the unique values of one of its columns. Easier to scan, especially if pivoted by a chronologically ordered column

Answer 28

-ROLLUP is a GROUP BY subclause that includes extra rows for group-level aggregations -GROUP BY Country, ROLLUP(Medal) will count all Country - and Medal -level totals,then count only Country -level totals and ll in Medal with null s for these rows - ROLLUP is hierarchical, de-aggregating from the leftmost provided column to the right-most - -ROLLUP(Country, Medal) includes Country -leveltotals - -ROLLUP(Medal, Country) includes Medal -leveltotals - -Both include grand totals Use ROLLUP when you have hierarchical data (e.g., date parts) and don't want all possible group-level aggregations

Answer 29

- CUBE is a non-hierarchical ROLLUP - It generates all possible group-level aggregations - -CUBE(Country, Medal) counts Country -level, Medal -level, and grand totals Use CUBE when you want all possible group-level aggregations

Answer 30

- COALESCE() takes a list of values and returns the first non- null value, going from left to right - COALESCE(null, null, 1, null, 2) ? 1 - Useful when using SQL operations that return null s - -ROLLUP and CUBE - -Pivoting - -LAG and LEAD

Answer 31

STRING_AGG(column, separator) takes all the values of a column and concatenates them, with separator in between each value It is useful when you want to reduce the number of rows that are returned.

Answer 32

Text data types --CHAR , VARCHAR and TEXT Numeric data types --INT and DECIMAL Date / time data types --DATE , TIME , TIMESTAMP , INTERVAL Arrays

SQL Flashcards

(56 cards)