Instructor Created Flashcards

1
Q

What is the purpose of affinity grouping?

A

To evaluate relationships or associations between data elements that demonstrate some kind of affinity between objects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How is estimation defined in data analysis?

A

A process of assigning some continuously valued numeric value to an object

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a key benefit of the estimation process?

A

Results can be ranked by score

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does classification involve?

A

Organizing data into predefined classes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the goal of the classification process?

A

To build a model that can accurately classify new records

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is clustering in data mining?

A

The task of dividing a large collection of entities into smaller groups based on similarity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the main process in data mining?

A

Assemble information, prepare it for mining, apply algorithms, and analyze results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does data mining rely on?

A

Using one set of data for training and another for testing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is data type conversion?

A

Parsing strings representing values and transforming them into the proper form for the target machine

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is data cleansing?

A

Correcting known data errors and automating corrections

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the purpose of integration in data processing?

A

To represent linkage between different tables and maintain metadata

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does referential integrity checking ensure?

A

That referential integrity constraints are not violated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are derivations in data processing?

A

Transformations based on business rules applied during data movement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the difference between denormalization and renormalization?

A

Denormalization breaks data into a simpler form, while renormalization restores a structured form

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is aggregation in data processing?

A

Populating summaries or cube dimensions in the staging area

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the purpose of audit information?

A

To provide a reference for integrity checking

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is null conversion?

A

Transforming different forms of nulls from disparate systems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What are the two key questions in the extraction phase of ETL?

A
  1. What data should be extracted?
  2. How should that data be extracted?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is a data mart?

A

A subject-oriented data repository for decision support and BI needs of a specific department

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What does the ETL process stand for?

A

Extract, Transform, Load

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What are the main steps in the ETL process?

A
  • Get data from the source location
  • Map data into a suitable model
  • Validate and clean data
  • Apply transformations
  • Move data to the repository
  • Load data into the warehouse
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is the financial value associated with increased profitability?

A

Derived from lowered costs or increased revenues

Financial value is crucial for evaluating business performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What does productivity value refer to?

A

Increased throughput with decreased workloads and high-quality outcomes

It emphasizes efficiency in processes like manufacturing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is trust value in a business context?

A

Greater customer, employee, or supplier satisfaction and confidence in forecasting

Trust value also includes better management reports and decision-making.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What does risk value encompass?

A

Improved visibility into credit exposure and confidence in capital investments

It also involves maintaining auditable records.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Define a data warehouse.

A

A centralized repository of information

It is essential for data analysis and reporting.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

How is a data warehouse organized?

A

Around relevant subject areas important to the organization

This organization supports targeted data analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What is the primary use of a data warehouse?

A

For analysis and not for transaction processing

This distinction is critical for understanding its role in data management.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What characteristic does the data in a data warehouse have?

A

Nonvolatile

Nonvolatility means that data remains stable and is not frequently changed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What is the target location for integrating data from multiple sources?

A

A data warehouse

It consolidates data from both internal and external sources.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Fill in the blank: A data warehouse provides a platform for different _______ to submit queries about enterprise information.

A

[consumers]

Consumers can be both human and automated.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What is the purpose of MySQL Workbench?

A

A visual database design and management tool for MySQL databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What are the features of MySQL Workbench?

A

Provides a user-friendly interface for:

  1. database modeling
  2. SQL development
  3. server configuration
  4. user administration
  5. backupand data recovery
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

In which use case is MySQL Workbench ideal?

A

Managing MySQL databases graphically, creating visual representations of database schemas, and managing complex SQL statements.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What is the purpose of the MySQL Command-Line Client?

A

A command-line interface for interacting with MySQL databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

What are the features of the MySQL Command-Line Client?

A

Allows users to execute SQL queries, manage databases, and perform administrative tasks directly through the terminal.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

Who is the target user for MySQL Command-Line Client?

A

Users who prefer command-line operations or are working on servers without a graphical interface.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What is SQL Server Management Studio (SSMS) used for?

A

A management tool for Microsoft SQL Server databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

What are the features of SQL Server Management Studio (SSMS)?

A

Offers a graphical interface for SQL Server database development, management, and administration, including querying and performance monitoring.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

What is phpMyAdmin?

A

A web-based application to manage MySQL databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

What are the features of phpMyAdmin?

A

Offers a browser interface for executing SQL queries, managing tables, and handling database administration tasks.

42
Q

What is the key difference in database compatibility between MySQL Workbench, phpMyAdmin, and SQL Server Management Studio?

A
  1. MySQL Workbench and phpMyAdmin are specifically for MySQL databases
  2. SQL Server Management Studio is exclusively for Microsoft SQL Server.
43
Q

What is a primary key?

A

A unique identifier for each record in a table.

44
Q

What are the characteristics of a primary key?

A
  • Cannot contain null values
  • Must contain unique values
45
Q

What type of relationship does a primary key establish between tables?

A

Establishes a one-to-one or one-to-many relationship with foreign keys in other tables.

46
Q

What is a foreign key?

A

A column or a set of columns in one table that references the primary key in another table.

47
Q

What are the characteristics of a foreign key?

A
  • Can contain null values
  • Can have duplicate values
48
Q

What type of relationship does a foreign key establish between tables?

A

Establishes a many-to-one relationship with the table containing the primary key.

49
Q

What is a composite key?

A

A combination of two or more columns in a table that together uniquely identify a record.

50
Q

What are the characteristics of a composite key?

A
  • May allow null values in some columns
  • Always consists of multiple columns
51
Q

What type of relationship does a composite key establish between tables?

A

Establishes relationships where each part of the composite key must be considered in conjunction with others.

52
Q

What is a candidate key?

A

A column or set of columns that can qualify as a potential primary key for a table.

53
Q

What are the characteristics of a candidate key?

A
  • Can contain null values
  • Must contain unique values
54
Q

What SQL statement is used to order the result set?

A

ORDER BY

55
Q

What is the purpose of the WHERE clause in SQL?

A

Filters records based on a specified condition.

56
Q

What SQL statement is used to group rows that have the same values in specified columns?

A

GROUP BY

57
Q

What is the purpose of the LIMIT clause in SQL?

A

Restricts the number of rows returned by the query.

58
Q

What does the IN operator do in SQL?

A

Checks if a column’s value exists within a list of specified values.

59
Q

What does the BETWEEN operator do in SQL?

A

Checks if a value falls within a specified inclusive range.

60
Q

What is the purpose of the EXISTS operator in SQL?

A

Evaluates whether a subquery returns any rows.

61
Q

What does the ALL operator do in SQL?

A

Checks if all values in a subquery meet a specific condition.

62
Q

What is the syntax of the UPDATE statement in SQL?

A

UPDATE table_name SET column1 = value1, column2 = value2, ... WHERE condition;

63
Q

What is the function of the COMMIT command in SQL?

A

Saves all changes made in the current transaction to the database permanently.

64
Q

What does the ROLLBACK command do?

A

Undoes all changes made in the current transaction.

65
Q

What is the function of the SAVEPOINT command?

A

Creates a savepoint within a transaction allowing for partial rollbacks.

66
Q

What is the UNDO command in SQL?

A

Not a standard SQL command; behavior is generally covered by the ROLLBACK command.

67
Q

What is the purpose of the BETWEEN operator?

A

Filters results within a specific range (inclusive).

68
Q

What is the LIKE operator used for in SQL?

A

Used for pattern matching in SQL.

69
Q

What is the function of the BETWEEN operator in SQL?

A

The BETWEEN operator is used to filter results within a specific range (inclusive).

It checks if a column’s value is within a specified lower and upper limit.

70
Q

What types of values can the BETWEEN operator be used with?

A

Numeric, date, or text values.

71
Q

Provide an example of a query using the BETWEEN operator.

A

SELECT * FROM products WHERE price BETWEEN 50 AND 100;

72
Q

What is the function of the LIKE operator in SQL?

A

The LIKE operator is used for pattern matching in SQL.

73
Q

What wildcard character represents zero or more characters in the LIKE operator?

A

%

74
Q

What wildcard character represents a single character in the LIKE operator?

A

_

75
Q

Provide an example of a query using the LIKE operator.

A

SELECT * FROM employees WHERE last_name LIKE ‘Smi%’;

76
Q

What is the function of the IN operator in SQL?

A

The IN operator is used to specify multiple values in a WHERE clause.

77
Q

What types of values can the IN operator be used with?

A

Numeric, character, or date values.

78
Q

Provide an example of a query using the IN operator.

A

SELECT * FROM customers WHERE country IN (‘USA’, ‘Canada’, ‘Mexico’);

79
Q

What is the function of the EXISTS operator in SQL?

A

The EXISTS operator is used to check for the existence of any records in a subquery.

80
Q

When is the EXISTS operator often used?

A

Frequently used with subqueries to filter records based on conditions in related tables.

81
Q

Provide an example of a query using the EXISTS operator.

A

SELECT * FROM products p WHERE EXISTS (SELECT * FROM orders o WHERE o.product_id = p.product_id);

82
Q

What does the EXISTS operator return if the subquery returns one or more records?

A

True

83
Q

What does the BETWEEN operator do in SQL?

A

Filters data within a specified inclusive range.

84
Q

What operators can achieve the same effect as the BETWEEN operator?

A

> = and <= (Greater Than or Equal To, Less Than or Equal To)

85
Q

Provide an example of using >= and <= instead of BETWEEN.

A

SELECT * FROM employees WHERE salary >= 50000 AND salary <= 100000;

86
Q

What does the > and < operators do in relation to the BETWEEN operator?

A

Similar to BETWEEN, but does not include the boundaries.

87
Q

Provide an example of using > and < instead of BETWEEN.

A

SELECT * FROM employees WHERE salary > 50000 AND salary < 100000;

88
Q

When should you use the BETWEEN operator?

A

When you want to specify a clear range with inclusive boundaries.

89
Q

What is the role of the GROUP BY keyword in SQL?

A

Used to group rows that have the same values in specified columns into aggregate data.

90
Q

Which SQL functions are commonly used with GROUP BY?

A

COUNT, SUM, AVG

91
Q

Provide an example of a query using GROUP BY.

A

SELECT department, COUNT(*) FROM employees GROUP BY department;

92
Q

What is the function of the ORDER BY keyword in SQL?

A

Used to sort the result set of a query by one or more columns.

93
Q

What options can you specify when using ORDER BY?

A

Ascending (ASC) or descending (DESC) order.

94
Q

Provide an example of a query using ORDER BY.

A

SELECT name, salary FROM employees ORDER BY salary DESC;

95
Q

Is SORT BY a defined keyword in standard SQL?

A

No

96
Q

What should you use instead of SORT BY in standard SQL?

A

ORDER BY

97
Q

What is FILTER BY used for in SQL?

A

Typically used in contexts involving window functions.

98
Q

Can FILTER BY be used as a standalone keyword in standard SQL?

A

No

99
Q

Provide an example of a query using FILTER BY.

A

SELECT department, COUNT(*) FILTER (WHERE salary > 50000) FROM employees GROUP BY department;

100
Q

What is the key difference between GROUP BY and ORDER BY?

A

GROUP BY aggregates data based on one or more columns, while ORDER BY sorts the results of the query.