Part 4 - A World Built of Data Flashcards
What is SEO? Why is it important?
Search Engine Optimisation - used to make your website appear higher up the list when people search it in google. The higher links get the most traffic with the top link getting over 30% of all users.
What is SQL? What is it used for?
Structured Query Language is a text based code which allows you to search databases.
What is Cashing?
Many pages on the internet are stored in databases and pulled up when a user runs a query (knowingly or unknowingly). Cashing is where the page stores popular queries as a page to save repeatedly running the same queries.
What is the difference between a database and a spreadsheet?
Databases are meant for storing and querying large amounts of data.
While there is some cross over in functionality, spreadsheets are designed for handling numeric data (such as company accounts) and to make calculations or manipulations on lesser amounts of data
What is an issue with storing data in a flat database?
If a key value is changed then every record holding this value must be manually changed. E.g. if the date of an occasion is adjusted then each persons calendar appointment would have to be adjusted manually in the database
What are Entities and Attributes?
Entities are a “thing” which can be entered into relational databases. Tangible entities such as a person or intangible entities such as a course or an invitation can be within fields of a database. Attributes are fields under the entity so attributes for a person might be height, weight and age. Each person has the same attributes in the database but with different values
What are the benefits of a relational database and normalisation?
If a student is on multiple courses then you don’t have to have separate records or fields for each attribute with empty fields in some areas.
If there is an administrative change then you can make the change without having to find every occurrence in the database
What is sampling?
A method for drawing conclusions from a small cross section of data to avoid having to examine the entire data set
How is Big Data defines?
Using the 3 Vs -
Volume: large amounts of data needs to be processed
Velocity: there is a relentless demand for data to be processed at an increasing pace
Variety: the data being acquired can take any form - not only numeric or text but video or audio
Veracity: ensuring correctness and trustworthiness of the data