Chap 2 Flashcards
What is data science
its a multi disciplinary field that uses scientific methods , processes , algorithms and systems to extract knowledge and insights from structured , semi-structured and unstructured data.
what is a data scientist ?
its a person engaging in a systematic activity to acquire knowledge from data.
what is the role of data scientists ?
they perform research toward a more comprehensive understanding of products , systems or nature including physical , mathematical and social realms
what are the skillset of data scientists?
a strong background in
1. statistics and linear algebra
2. programming knowledge
3. data warehousing , mining and modeling to build and analyze algorithms.
what is an algorithm ?
its a set of instructions designed to perform a specific task.
what is Data?
Data can be described as unprocessed facts and figures, it can exist in any form.
what is information ?
its data that has been given meaning and is the processed data on which decisions and actions are based.
what is Data processing
its the restructuring of data by people or machines to increase their usefulness and add value for a particular purpose.
what are the basic steps of data processing ?
- input
-processing
-output
what are some material forms of data?
numbers
text
symbols
images
sound
what are the 2 categories of data forms
qualitative =descriptions
quantitative =numeric records
what’s data type
its what informs the interpreter how the programmer intends to use the data
what are the different types of computer programming perspectives
- integers
-booleans
characters
strings
float
Astrings
what are the 3 common types of data types:
structured
semi structured
unstructured
what is structured data
its data that can be easily organized stored and transferred in a defined data model