lec 3(done) Flashcards

1
Q

Data objects

A

represents an entity.
ex.
1-Sales database: customers, store items, sales
2-Medical database: patients, treatments
3-University database: students, professors, courses

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Attribute:

A

a data field, representing a characteristic or feature of a data object.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Attribute Types:

A

1-Nominal
2-Binary
3-Ordinal
4-Numeric

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

nominal attribute:

A

-Each value represents some kind of category, so nominal attributes are also called categorical.
-The values do not have any meaningful order.
ex.
-Hair_color = {black, blond, brown, grey, red, white}
-marital status, occupation, ID numbers, zip codes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Binary Attributes:

A

Nominal attribute with only two categories or states (0 and 1)

Symmetric binary: both outcomes equally important
Example: gender: {male, female}

Asymmetric binary: outcomes not equally important.
Example: medical_test {positive, negative}

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Ordinal Attributes:

A

Values have a meaningful order (ranking) but magnitude between successive values is not known.

Examples:
Size = {small, medium, large}
grades, army rankings, satisfaction level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Numeric Attributes

A

Quantitative (integer or real-valued)
1-Interval-Scaled
-The differences between values are meaningful
-No true zero-point
-Values have order and can be positive, 0, or negative
Examples: temperature in C˚or F˚, calendar dates

2-Ratio-Scaled
-The differences and ratios between values are meaningful
-True zero-point
-Values are ordered
Examples: age, counts, height, weight, speed, monetary quantities

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Attribute type with operations

A

slide 9

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Discrete Attribute:

A

1-Has a finite or countably infinite set of values
2-Sometimes, represented as integer variables
Examples: hair color,zip codes, smoker, anddrink size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Continuous Attribute:

A

1-Has real numbers as attribute values
2-Typically represented as floating-point variables
Examples: temperature, height, or weight

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Measuring the central tendency:

A

Mean, Median, Mode, Midrange

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Measuring the dispersion of data:

A
1-Variance
2-Standard Deviation
3-Range
 4-Quantiles
 5-Interquartile Range
 6-Five-Number Summary
 7-Boxplots.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Graphic displays of basic statistical descriptions of data:

A

1-Quantile Plot
2-Quantile-Quantile Plot
3-Histograms
4-Scatter plots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Arithmetic mean

Weighted arithmetic mean

Trimmed mean

A

slide 13

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Median:

A

Middle value if odd number of values, or average of the middle two values otherwise.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Mode:

A

Value that occurs most frequently in the data

Data sets with one, two, or three modes are respectively called unimodal , bimodal , and trimodal.

17
Q

Midrange:

A

Average of the smallest and largest values in the set.