Vocabulary Flashcards

0
Q

Data

A

Facts and figures collected, analyzed, and summarized for presentation and interpretation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
1
Q

Statistics

A

Art of science of collecting, summarizing, analyzing, presenting, and interpreting data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Data Set

A

All the data collected in a particular study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Elements

A

Entities on which data are collected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Variable

A

A characteristic of interest for elements

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Observation

A

Set of measurements obtained for a particular element

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Nominal Scale

A

Scale of measurement for a variable when data are labels or names used to identify an attribute of an element. Nominal data may be nonnumeric or numeric

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Ordinal Scale

A

Scale of measurement for a variable if the data exhibit the properties of nominal data and the order or rank of the data is meaningful. Can be nonnumeric or numeric

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Interval Scale

A

Scale of measurement for a variable if the data demonstrate the properties of ordinal data and interval between values is expressed in terms of a fixed unit of measure. Always numeric

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Ratio Scale

A

Scale of measurement for a variable if the data demonstrate all properties of interval data and ratio of two values is meaningful. Always numeric

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Categorical Data

A

Labels/names used to identify an attribute of each element. Use either nominal or ordinal scale of measurement. May be nonnumeric or numeric

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Quantitative Data

A

Numeric values that indicate how much or how many of something. Obtained using either the interval/ratio scale of measurement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Categorical Data

A

Has categorical data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Quantitative Data

A

Variable with quantitative data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Cross-Sectional Data

A

Data collected at the same or approximated the same point in time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Time Series Data

A

Data collected over several time periods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Descriptive Statistic

A

Tabular, graphical and numerical summaries of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Population

A

Set of all elements of interest in a particular study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Sample

A

Subset of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Census

A

Survey to collect data on entire population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Sample Survey

A

Survey to collect data on a sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Statistical Inference

A

Process of using data obtained from a sample to make estimates or test hypotheses about the characteristics of a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Frequency Distribution

A

Tabular summary of data showing the number (frequency) of data values in each of several non overlapping classes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Relative Frequency Distribution

A

Tabular summary of data showing the fraction or proportion of data values in each of several non overlapping classes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Percent Frequency Distribution
Tabular summary of data showing the percentage of data values of each of several non overlapping classes
25
Bar Chart
Graphical device for depicting categorical data that have been summarized in a frequency, relative frequency or percent frequency distribution
26
Pie Chart
Graphical device for presenting data summaries based on subdivision of a circle into sectors that correspond to the relative frequency for each class
27
Class Midpoint
Value halfway between the lower and upper class limits
28
Dot Plot
Graphical device that summarizes data by the number of dots above each data values on the horizontal axis
29
Histogram
Graphical presentation of a frequency distribution, relative frequency distribution, or percent frequency distribution of quantitative data constructed by placing the class intervals on the horizontal axis and the frequencies, relative frequencies, or percent frequencies on the vertical axis
30
Cumulative Frequency Distribution
Tabular summary of quantitative data showing number of data values that are less than or equal to the upper class limit of each class
31
Cumulative Relative Frequency Distribution
Tabular summary of quantitative data showing the percentage of data values that are less than or equal to the upper class limit of each class
32
Ogives
Graph of a cumulative distribution
33
Exploratory Data Analysis
Methods that use simple arithmetic and easy to draw graphs to summarize data quickly
34
Stem-and-Leaf Display
Exploratory data analysis technique that simultaneously rank orders quantitative data and provides insight about the shape of the distribution
35
Crosstabulation
Tabular summary of data for two variables. The classes for one variable are represented by the rows; the classes for the other variable are represented by the columns
36
Simpson's Paradox
Conclusions drawn from two or more desperate cross tabulations that can be reversed when the data are aggregated into a single crosstabulation
37
Scatter Diagram
Graphical presentation of the relationship between two quantitative variables. One variable is shown on horizontal axis and other is shown on vertical axis
38
Trendline
Line that provides an approximation of the relationship between 2 variables
39
Sample Statistic
Numerical value used as a summary measure for a sample
40
Population Parameter
Numerical value used as a summary measure for a population
41
Point Estimator
Sample statistic (Greek letters) that are used to estimate the corresponding population parameter
42
Mean
Measure of central location computed by summing the data values and dividing by the number of observations
43
Median
Measure of central location provided by the value in the middle when the data are arranged in ascending order
44
Mode
A measure of location, defined as the value that occurs with greatest frequency
45
Percentile
Value such that at least p percent of the observation are less than or equal to this value and at least (100-p) percent of the observations are greater than or equal to this value. The 50th percentile is the median
46
Quartiles
Can be used to divide a data set into four parts, when each part containing approximately 25% of the data
47
Range
A measure of variability, defined to be the largest value minus the smallest value
48
Interquartile Range (IQR)
Measure of variability, defined to be the difference between the 3rd and 1st quartiles
49
Variance
Measure of variability based in the squared deviations of the data values about the mean
50
Standard Deviation
Measures of variability computed by taking positive square root of the variance
51
Coefficient of Variation
Measure of relative variability computed by dividing he standard deviation by the mean and multiplying by 100
52
Skewness
Measure of the shape of a data distribution. Data skewed to the left result in negative skewness; a symmetric data distribution results in zero skewness; and data skewed to the right result in positive skewness
53
Z-score
Value computed by dividing the deviation about the mean by the standard deviation. A z-score value and denotes the number of standard deviations the variable is from the mean
54
Chebyshev's Theorem
Theorem that can be used to make statements about the proportion of data values that must be within a specified number of standard deviations of the mean
55
Empirical Rule
Rule that can be used to compute the percentage of data values that must be within one, two, and three standard deviations of the mean for data that exhibit a bell shaped distribution
56
Outlier
Unusually small or large data value
57
Covariance
Measure of linear association between two variables. Positive values indicate a positive relationship; negative values indicate a negative relationship
58
Correlation Coefficient
Measure of linear association between 2 variables that takes on values between -1 and +1. Values near +1 indicate a strong positive linear relationship; values near -1 indicate a strong negative linear relationship; and values near zero indicate the lack of a liner relationship
59
Weighted Mean
Mean obtained by assigning each observation a weight that reflects its importance
60
Grouped Data
Data available in class intervals as summarized by a frequency distribution. Individual values of the original data are not available
61
Probability
A numerical measure of the likelihood that an event will occur
62
Complement of A
The event consisting of all sample points that are not in A
63
Venn Diagram
A graphical representation for showing symbolically the sample space and operations involving events in which the sample space is represented by a rectangle and events are represented as circles within the sample space
64
Union of A and B
The event containing all sample points belonging to A or B or both. The union is denoted A U B
65
Intersection of A and B
The event containing the sample points belonging to both A and B. The intersection is denoted A n B
66
Addition Law
A probability law used to compute the probability of the Union of two events. It is P(A U B) = P(A) + P(B) - P(A n B). For mutually exclusive events, P(A n B) = 0; in this case the addition law reduces to P(A U B) = P(A) + P(B)
67
Mutually Exclusive Events
Events that have no sample points in common; that is, A n B is empty and P(A n B) = 0
68
Conditional Probability
The probability of an event given that another event already occurred. The conditional probability of A given B is P(A|B) = P(A n B)/P(B)
69
Joint Probability
The probability of two events both occurring; that is, the probability of the intersection of two events
70
Marginal Probability
The values in the margins of a joint probability table that provide the probabilities of each event desperately
71
Independent Events
Two events A and B where P(A|B) = P(A) or P(B|A) = P(B); that is the events have no influence on each other
72
Multiplication Law
A probability law used to compute the probability of the intersection of two events. It is P(A n B) = P(B)P(A|B) or P(A n B) = P(A)P(B|A). For independent events it reduces to P(A n B) = P(A)P(B)
73
Random Variable
A numerical description of the outcome of an experiment
74
Discrete Random Variable
A random variable that may assume either a finite number of values or an infinite sequence of values
75
Continuous Random Variable
A random variable that may assume any numerical value in an interval or collection of intervals
76
Probability Distribution
A description of how the probabilities are distributed over the values of the random variable
77
Probability Function
A function, denoted by f(x), that provides the probability that x assumes a particular value for a discrete random variable
78
Discrete Uniform Probability Distribution
A probability distribution for which each possible value of the random variable has the same probability
79
Expected Value
A measure of the central location of a random variable
80
Variance
A measure of the variability, or dispersion, of a random variable
81
Standard Deviation
The positive square root of the variance