4.5 Database definitions Flashcards

1
Q

What is data consistency?

A

Data consistency refers to the accuracy, reliability, and uniformity of data across a database or system. It ensures that data is correct and coherent throughout its lifecycle.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is data redundancy?

A

Data redundancy occurs when a piece of data is stored in multiple places across a database or system.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is data independence?

A

Data independence refers to the seperation of data from the applications or systems that use it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a relational database?

A

A relational database organises data into tables which consist of rows and columns. (records and attributes). Tables have relationships with each other that are established through keys.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is data normalisation?

A

Data normalisation is the process of organising data in a database to reduce redundancy, improve data integrity, and make the database more efficient. It involves breaking down tables into smaller, more manageable structures while ensuring that relationships between data are preserved.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the features of first normal form?

A
  1. It contains only atomic values
  2. Each column has a unique name
  3. All entries in a column are of the same data type
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the features of second normal form?

A
  1. It is already in 1NF
  2. It has no partial dependencies - all non-key attributes (columns) depend on the entire primary key, not just part of it.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the features of third normal form?

A
  1. Data is already in 2NF
  2. All non-key attributes depend only on the primary key.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

3 Advantages of normalisation

A
  1. Resulting database will take up less storage space
  2. Information retrieval will be more efficient because data is structured effectively.
  3. Less redundancy means less inconsistencies in data because data will only need to be entered once.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

3 disadvantages of normalisation

A
  1. It is a complex process to create the database structure.
  2. Can generate more tables than an unnormalised database which will mean a more complex database.
  3. It is necessary to assign more relationships to interact with larger numbers of tables.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is validation?

A

Validation ensures that data/ input is sensible and reasonable. It does not check the accuracy of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is verification?

A

Verification is used to ensure that the data entered exactly matches the original source.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

2 Methods of Verification

A
  1. Double entry- Entering the data twice and comparing the two copies.
  2. Proofreading data- Someone checking the data entered against the original document.
  3. Referential integrity- For two tables that are linked together, records that include a foreign key can only exist if there is a corresponding primary key.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a data dictionary?

A

A file containing descriptions of the structure and attributes of data items stored in a database. It is a tool used by data managers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is data mining?

A

Data mining refers to the process of discovering patterns in large data sets. (Big data) It combines AI, statistics and database systems.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is big data?

A

Big data is a term associated with data sets that are so complex that traditional databases and other processing applications are unable to capture, curate, manage and process them within an acceptable time frame.

17
Q

What are the 3 stages of data mining?

A

Big data gathering
Big data storage
Big data Processing and Analysis

18
Q

Explain big data gathering.

A

Consumer companies actively scan social media websites to decipher user preferences, choices and perceptions towards their brands.

19
Q

Explain big data storage

A

Using big data may require an organisation to store data sets in the range of terabytes to many petabytes. Big data practitioners such as Google and Facebook all run what are known as hyperscale computing environments which consist of a vast number of servers each with Direct Attached Storage (DAS) - essentially lots of hard drives or flash storage devices.

20
Q

Explain big data processing and analysis.

A

Big data processing techniques analyse data sets at terabyte or even petabyte scale. Some methods are cluster analysis, anomaly detection, summarisation.