Normalization Flashcards

Data cleaning, First Normal Form, Second Normal Form, Third Normal Form

1
Q

data cleaning

A

fixing or removing incorrect, corrupted, incorrectly formatted, duplicate, or incomplete data within a dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

First Normal Form

A

Table format, no repeating groups, and PK
identified

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Second Normal Form

A

1NF and no partial dependencies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Third Normal Form

A

2NF and no transitive dependencies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Boyce-Codd normal form (BCNF)

A

every determinant is a candidate key (special case of 3NF)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Fourth normal form (4NF)

A

3NF and no independent multivalued
dependencies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is normalization?

A

process for evaluating and decomposing each table (relation) into
multiple tables (relations) to minimize data redundancies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why is normalization required?

A

when we build the database, we have raw data that’s redundant, inconsistent, and lack data integrity.

ex: spreadsheets don’t enforce datatypes or ranges

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

each table should consists of what 2 things?

A

primary key, set of mutually independent attributes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

safety levels checks through normal form

(least important –> most important)

A

!NF –> 2NF –> 3NF –> 4NF –> 5NF

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Fifth Normal Form (5NF) and domain-key normal form are defined but not . . .

A

generally enforced

How well did you know this?
1
Not at all
2
3
4
5
Perfectly