lec 3(done) Flashcards
Data objects
represents an entity.
ex.
1-Sales database: customers, store items, sales
2-Medical database: patients, treatments
3-University database: students, professors, courses
Attribute:
a data field, representing a characteristic or feature of a data object.
Attribute Types:
1-Nominal
2-Binary
3-Ordinal
4-Numeric
nominal attribute:
-Each value represents some kind of category, so nominal attributes are also called categorical.
-The values do not have any meaningful order.
ex.
-Hair_color = {black, blond, brown, grey, red, white}
-marital status, occupation, ID numbers, zip codes
Binary Attributes:
Nominal attribute with only two categories or states (0 and 1)
Symmetric binary: both outcomes equally important
Example: gender: {male, female}
Asymmetric binary: outcomes not equally important.
Example: medical_test {positive, negative}
Ordinal Attributes:
Values have a meaningful order (ranking) but magnitude between successive values is not known.
Examples:
Size = {small, medium, large}
grades, army rankings, satisfaction level
Numeric Attributes
Quantitative (integer or real-valued)
1-Interval-Scaled
-The differences between values are meaningful
-No true zero-point
-Values have order and can be positive, 0, or negative
Examples: temperature in C˚or F˚, calendar dates
2-Ratio-Scaled
-The differences and ratios between values are meaningful
-True zero-point
-Values are ordered
Examples: age, counts, height, weight, speed, monetary quantities
Attribute type with operations
slide 9
Discrete Attribute:
1-Has a finite or countably infinite set of values
2-Sometimes, represented as integer variables
Examples: hair color,zip codes, smoker, anddrink size
Continuous Attribute:
1-Has real numbers as attribute values
2-Typically represented as floating-point variables
Examples: temperature, height, or weight
Measuring the central tendency:
Mean, Median, Mode, Midrange
Measuring the dispersion of data:
1-Variance 2-Standard Deviation 3-Range 4-Quantiles 5-Interquartile Range 6-Five-Number Summary 7-Boxplots.
Graphic displays of basic statistical descriptions of data:
1-Quantile Plot
2-Quantile-Quantile Plot
3-Histograms
4-Scatter plots
Arithmetic mean
Weighted arithmetic mean
Trimmed mean
slide 13
Median:
Middle value if odd number of values, or average of the middle two values otherwise.