Chapter 5 Flashcards
four v’s of big data
volume, velocity, variety, and veracity
data volume
amount of data created and stored by an organization
data velocity
pace at which data is created and stored
data variety
different forms data can take
data veracity
quality or trustworthiness of data
analytics mindset is ability to
ask right questions; extract, transform, and load relevant data; apply appropriate data analytic technique; interpret and share results with stakeholders
asking right questions is the 1st step of analytics mindset: establishing objectives that are smart
specific, measurable, achievable, relevant, timely
etl process
extracting, transforming, and loading data
structured data
data that is highly organized and fits into fixed fields
unstructured data
data that has no uniform structure
semi structured data
organized in some ways but not fully organized to be inserted into a relational database
data warehouses store
structured data
data lake
collection of structured, semi structured, and unstructured data in a single location
dark data
info the organization has collected and stored that would be useful for analysis but is not analyzed and is ignored
data swamps
data repositories that arent accurately documented so the stored data cant be properly identified and analyzed