Chapter 4 - Preparing data for Analysis Flashcards
What are relational databases?
What are the parts of a relational database?
Best means of storing data in one place:
- Tables: Data organzied into fields and rows
- Field: Columns that contain descriptive characteristics about each observation
- Records: The rows that contain each observation
What are primary keys of relational databases?
Why are they important?
Unique identifier of each record that allows connection to other tables (think customer ID)
- Every table must have a primary key to ensure unique listings.
What are foreign keys of relational databases?
Primary key used in a separate table to create relationships between the tables.
- Not required for every table
What are relational database data dictionaries?
How do they differ from data dictionaries of chapter 3?
Table that
- relates each attribute (transaction ID, date, etc..) to their respective tables and
- explains useful info about each attribute
Contain more fields and desriptive elements of each attribute.
Use this slide to understand relational database data dictionaries.
Note the table field and primary/foreign key field.
What are entity relationship diagrams?
Graphical representation of each table in a database that shows the fields contained within,
- Connects tables through separate diagrams:
What are the advantages of using relational databases for storing data.
Puts all data in one place and ensures:
- Preventive internal controls
- Security of data entry
- Reduced redundancy costs
- Version control
What are the three characteristics that ensure data integrity (faithful representation)?
Data that is:
- free from error
- Complete (no missing info).
- Neutral (not biased)
Why is it better to connect files to a relational data base rather than extracting and creating different version?
Ensures info is up to date.
Extracting data makes data out of date as soon as new transactions are posted and damages the representation.
How do relational databases improve internal controls?
Relational databases ensures that parent data must exist before creating new data:
a foreign key cannot be made without first creating the parent key.
How do relational databases improve security?
Data entry can be verified and show who has entered and viewed new data.
How do relational databases reduce redundancy costs?
Ensures that there is a unique listing of each observation in only one location.
- maintains one version of the truth.
Use this slide to show how to use excel to connect to a database:
Use this slide to show how to refresh connections to databases in excel:
From data tab:
- Select “refresh all” dropdown
- “connection properties…”
What is Structured Query Language (SQL) and why is it useful for extracting data?
Universale database language.
Used to create, update, and delete records in relational databases.
Necessary for connecting to databases too large for excel to handle.