Introduction to Modelling Flashcards
(42 cards)
what is metadata
data about data
what is data
unprocessed information
what is information
data associated together
what is knowledge
understanding information
what sort of software manages data
file formats for particular applications .xls .doc .mp4 .jpg
specialist data management applications eg covid tracker
group project last year
4 ways of adding structure to data files
delimited text field
fixed length field
length-based field
identified field
what is a delimeter text field
choosing a special character eg comma or question mark that will not appear as a legitimate character within the info field and this will separate the individual data entries eg. comma separated file csv
What is a fixed length field
use a fixed length for each information field eg 20 characters, padding out when length is less than fixed lenght
what is the disadvantage of delimeter text
the character cannot be used legitimately in the information
what is length based field
writing the length of the information field before the information so we know exactly how much space it takes up
what is identified field
write the name of the information field and then value both represented as delimited text fields
what are the two types of approaches of turning data into information
structured and unstructured
what is a structured way of turning data into information
deliberately associate data together into information eg excel, data bases, datawarehouses
examples of structured approaches of turning data into information
excel
databases
datawarehouses
what is an unstructured way of turning data into information
loosely managed data together to serve a specific information need eg search engine
example of an unstructured approach of turning data into information
search engines
example of structured querying
SQL eg select exact criteria about data types
What does SQL stand for
structured query language
example of unstructured querying
keyword based, phrase based, search engine
examples of structured results
exact, do need to estimate relevance, returns complete set of data that matches query criteria
example of unstructured results
unsure of relevance, we must estimate relevance ourselves egGoogle page
DBs stands for
databases
ACID stands for
atomic
consistent
isolated
durable
different type of data base models
relational
networked
hierarchal
onject-orientated