Chapter 2 - Data Flashcards
Why is the type of data important to data mining?
The type of data determines which tools and techniques can be used to analyze it.
Why is data quality important?
Improving data quality typically improves the quality of the resulting analysis.
What is a data set?
A collection of data objects
What is a data object?
record, point, vector, pattern, event, case, sample, observation, or entity
What are attributes?
A property or characteristic of an object that may vary from one object to another
What is a **measurement scale **when referring to data mining?
A rule (function) that associates a numerical or symbolic value with an attribute of an object.
Describe the process of measurement when referring to data mining.
Using a measurement scale to associate a value with a particular attribute of a specific object.
What are the 4 properties (operations) of numbers that are typically used to describe attributes?
- Distinctness = and != 2. Order , and => 3. Addition + and - 4. Multiplication x and /
What are the 4 types of attributes?
- Nominal 2. Ordinal 3. Interval 4. Ratio
What is a nominal type of attribute?
The values of a nominal attribute are just different names that provide only enough information to distinguish one object from another.
What is an ordinal type of attribute?
The values of an ordinal attribute provide enough information to order objects.
What is an interval type of attribute?
The differences between values of an interval attribute are meaningful (a unit of measurement exists).
What is a ratio type of attribute?
The ratio and differences are both meaningful in a ratio attribute.
What type of attribute is the following: zip codes
Nominal
What type of attribute is the following: employee ID numbers
Nominal
What type of attribute is the following: eye colour
Nominal
What type of attribute is the following: gender
Nominal
What type of attribute is the following: hardness of minerals {good, better, best}
Ordinal
What type of attribute is the following: grades
Ordinal
What type of attribute is the following: street numbers
Ordinal
What type of attribute is the following: calendar dates
Interval
What type of attribute is the following: temperature in Celsius or Fahrenheit
Interval
What type of attribute is the following: monetary quantities
Ratio
What type of attribute is the following: counts
Ratio
