Statistics Flashcards

1
Q

Also called a categorical variable. Simple classification. We do not need to count to distinguish one item from another, mutually exclusive.

A

Nominal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

The only discrete-only in scales of measurement.

A

Nominal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The only continuous alone on scales of measurement or have 0.5 as the smallest unit.

A

Ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Cases are ranked or ordered. Represent position in a group where the order matters but not the difference between values.

A

Ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

It uses intervals equal in amount measurement where the difference between two values is meaningful.

A

Interval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Similar to interval but includes a true zero point and relative proportions on the scale make sense.

A

Ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Which among the scales of measurement are parametric and non parametric?

A

P- Interval & ratio
NP- Nominal & Ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the 4 scales of measurement?

A

Nominal
Ordinal
Interval
Ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Refers to the analysis of data of an entire population merely using numbers to describe a known data set.

A

Descriptive Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Value in a group of values which is the most typical for the group, or the score at which all the scores are evenly clustered around. The average or midmost score.

A

Measures of Central Tendency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the measures of central tendency?

A

Mean
Median
Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

The average/arithmetic mean. Sum of a set of measurements in the set. Data is interval only.

A

Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Central value of a set value such that the half the observations fall above it and half below it. The middle score in the distribution. Use ordinal and interval data.

A

Median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Modal value of a set. Most frequently occurring value. For grouped data, it is the midpoint of the class interval with the largest frequency, uses nominal, ordinal and interval data.

A

Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Measures of how much or how little the rest of the values tend to vary around the central or typical value. Variation or error.

A

Measures of variability/Dispersion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the measures of variability/dispersion?

A

Standard deviation
Variance
Range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What level of data does all measures of variability/dispersion use?

A

Interval (some books include ratio)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Square root of variance. Shows the distribution of measurement.

A

Standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

(Sd)²

A

Variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Simplest measure of variation. Difference between the largest and smallest measurement.

A

Range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Used to describe the position of a particular observation in relation to the rest of the data set.

A

Measures of Location

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

In measures of location, The pth percentile of a data set is a value such that at least percent of the observation take on this value or less and at least _ percent of the observations take on this value or more.

A

100-p

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What are the measures of location?

A

Percentiles
Quartiles
Deciles
Frequency Distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Percentage of the total number of observations that are less than the given value. Identifies the point below which a specific percentage of the cases fall.

A

Percentiles

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

The data can be divided into 4 parts instead of two. This is what you call the cut points.

A

Quartiles

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

The data can be divided into 10 parts instead of two or four. This is what you call the cut points.

A

Deciles

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

A classification of data that may help in understanding important features of the data may be graphically presented in the form of a histogram, polygon, etc.

A

Frequency Distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

This measure of location represents the same 2 elements:
Set of categories that make up the original measurement scale.
A record of the frequency, or number of individuals in each category.

A

Frequency Distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

All measures of location use ordinal, interval, and ratio level of data except _ which uses all levels of data.

A

Frequency Distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Measurement of the extent to which pairs of related values on 2 variables tend to change together; gives measure of the extent to which one variable can be predicted from values on the other variable.

A

Measures of correlation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

If one variable increases with the other, the correlation is positive (near _). If the relationship is inverse, it is a negative correlation (near _). A lack of correlation is signified by a value close to _.

A

+1
-1
0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What are the measures of correlation?

A

Pearson’s Product moment correlation
Spearman’s Rho Rank-order
Kendall’s Coefficient of Concordance W
Point-Biserial Coefficient rpb
Phi or Fourfold Coefficient
Lambda

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

A measure of correlation for 2 groups, using interval level of data. Data must be in the form of related pairs of scores. The higher the r , the higher the correlation.

A

Pearson’s Product Moment Correlation (r)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

A measure of correlation for 2 groups, using the ordinal level of data. Data must be in the form of related pairs of scores and is used for ≤ 3. Easy to calculate but non parametric.

A

Spearman’s Rho Rank-order

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

A measure of correlation for ≥ 3 groups, using the ordinal level of data. Data must be ≥ 3 sets of ranks. Easy to calculate but non parametric.

A

Kendall’s Coefficient Concordance W

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

A measure of correlation for 2 groups, using the continuous and dichotomous nominal level of data.

A

Point-Biserial Coefficient rpb

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

A measure of correlation for 2 groups, using 2 dichotomous nominal level of data.

A

Phi or Fourfold Coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

A measure of correlation for ≥ 2 groups, using nominal (dependent/independent) ) levels of data. It is also known as Guttman’s Coefficient of predictability. Gives an indication of the reduction of errors made in a prediction scheme.

A

Lambda

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

A non parametric measure of the agreement between two rankings.

A

Tau Coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

Tests for statistical dependence.

A

Kendall’s Tau Coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

An index of interrater reliability of ordinal data.

A

Coefficient of Concordance (W)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

Measurement of the extent to which pairs of related values on 2 variables tend to change together; gives measure of the extent to which values on one variable can be predicted from the values on the other variable.

A

Inferential statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

What are the inferential statistics tests?

A

Z-test of one sample mean
T-test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

Variation of t-test

A

Independent samples
Dependent samples
Proportions/Percentages
Variances
2 correlation coefficients

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

What level of data do all tests for inferential statistics use?

A

Interval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

A measure for inferential statistics for 1 group.
N ≥30 used to test whether a population parameter is significantly different from some hypothesized value.

A

Z-test of one sample mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

A measure for inferential statistics when n< 30

A

T-test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

This kind of t-test is for 2 groups. It assesses whether the means of 2 groups are statistically different from each other.

A

Independent samples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

This kind of t-test is for 1 group. It is used when the subjects making up the 2 samples are matched on some variable before being put in the 2 groups or the situation where the 2 groups are the same subjects administered a pretest and post test.

A

Dependent samples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
50
Q

This kind of t-test is for 1 group. It is used to test the hypothesis that an observed proportion is equal to a pre-specified proportion.

A

Proportions/Percentages

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
51
Q

This kind of t-test uses the F test for equal and unequal.

A

Variances

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
52
Q

This kind of t-test is for 2 groups. It is used to assess the significance of the difference between two correlation coefficients found in 2 independent samples.

A

2 correlation coefficients

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
53
Q

It is used for problems of predicting one variable from a knowledge of another or possibly several other variables. It is always the regression of the predicted value on the known variable.

A

Regression Equation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
54
Q

What are the regression equations?

A

Linear regression of y on x
Linear regression of x on y
Standard error of estimate (SEE)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
55
Q

Standard deviation of errors of prediction. An indication of the variability about the regression line in the population wherein predictions are being made.

A

Standard error of estimate (SEE)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
56
Q

Among ANOVA and t-tests, which organizes and directs analysis and has easier interpretation of the results?

A

ANOVA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
57
Q

Performing repeated t-tests increases the probability of _?

A

Type I error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
58
Q

ANOVA needs to be followed by what test?

A

Post hoc test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
59
Q

What does the post hoc test determine?

A

Which group differs from each other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
60
Q

We should not conduct a post hoc test unless the null is ?

A

Rejected.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
61
Q

A test designed for a situation with equal sample size per group, but can be adapted to unequal sample sizes as well.

A

Tukey’s (Honestly Significant Difference or HSD) Test

62
Q

Descriptive measure of the utility of the regression equation for making predictions square of the linear correlation coefficient.

A

Coefficient of Determination

63
Q

In determining the coefficient of determination, the nearer the value is to _, the useful is the regression equation in making predictions.

A

1

64
Q

Used to test the significance of the differences among means obtained from independent samples (parametric tests) significance where >2 conditions are used, or even when several independent variables are involved.

A

Analysis of Variance (ANOVA)

65
Q

What are the types of ANOVA?

A

One-Way ANOVA
Two-Way ANOVA
Three-Way ANOVA

66
Q

Tests used if 2 or more samples were drawn from the same population by comparing means or if data from several groups have a common mean. There’s 1 IV and 1 DV and an interval level of data.

A

One-Way ANOVA

67
Q

It tests the hypothesis that the means of 2 variables (factors) from 2 or more groups (2 IV, 1 DV) are equal (drawn from population with the same mean)

A

Two-way ANOVA

68
Q

It has a similar purpose with the different kinds of ANOVA, except that the groups here have 3 categories of defining characteristics. It must have 3 IV and 1 DV

A

Three-Way ANOVA

69
Q

It corrects alpha not just for all pair-wise or simple comparisons of means, but also for complex comparisons (contrast of more than 2 at a time) of means.

A

Scheffe’s Test

70
Q

The most popular of the post hoc procedures, most flexible and most conservative but has least statistically powerful procedure.

A

Scheffe’s Test

71
Q

Versatile formula data must be presented in frequencies. It is categorized under non parametric test but can also be used as a parametric test data

A

Chi-square

72
Q

The 2 chi-square tests

A

Goodness of Fit
Independence

73
Q

Also called one-sample or one-variable chi-square. Involves 1 variable of ≥ 2 categories. It compares the distribution of measures for deviation from a hypothesized distribution. Nominal level of data.

A

Goodness of Fit

74
Q

A chi-square test that involves 2 variables consisting of ≥ 2 categories. It determines whether the 2 variables are related or not. Reveals only the relationship but not the magnitude of the relationship.

A

Independence

75
Q

Parametric Test or Non-Parametric test?
Random selection of subjects from a normal population with equal variances.

A

Parametric Tests

76
Q

Parametric Test or Non-Parametric test?
Whether the groups or samples to be compared are independent samples or correlated.

A

Both

77
Q

Parametric Test or Non-Parametric test?
Whether the number of groups to be compared is ≥ 2.

A

Non-parametric Test

78
Q

Parametric Test or Non-Parametric test?
More power, higher power efficiency.

A

Parametric Test

79
Q

Parametric Test or Non-Parametric test?
Simple and easier to calculate.

A

Non-parametric Test

80
Q

Parametric Test or Non-Parametric test?
No need to meet data requirements at all.

A

Non-parametric Test

81
Q

What are some non parametric tests?

A

Median test
Father’s Sign Test
Wilcoxon Rank Sum Test
Mann-Whitney (U) Test
Wilcoxon Signed Ranks Test (T)
Kruskal-Wallis H Test
Friedman Rank

82
Q

Non parametric test that compares the medians of 2 independent sample (uncorrelated). Only considers the number of cases above and below the median. Presented in an ordinal data.

A

Median Test

83
Q

Non parametric test that compares the 2 correlated samples by obtaining the differences between each pair of observation. Consider the signs of the differences between paired observations in their sizes.

A

Fisher’s Sign Test

84
Q

What level of data do non parametric tests have?

A

Ordinal

85
Q

Non parametric test that is used for comparing 2 independent samples using rank data.

A

Wilcoxon Rank Sum Test

86
Q

Non parametric test that is used with independently drawn random samples, the sizes of which need not to be the same.

A

Mann-Whitney (U) Test

87
Q

Non parametric test that is used for correlated samples, the difference, d, between each pair is calculated (data subjected to computation)

A

Wilcoxon Signed Ranks Test (T)

88
Q

Non parametric test that is used to test whether or not a group of independence samples is from the same or different population. Compares 3 or more independent samples with respect to an ordinal variable.

A

Kruskal-Wallis H Test

89
Q

Non parametric test that is used to test whether or not the data is from the sample under 3 different conditions.

A

Friedman Rank

90
Q

The act of assigning numbers or symbols to characteristics of things according to rules.

A

Measurement

91
Q

A set of numbers or symbols whose properties model empirical properties of the objects to which the numbers are assigned.

A

Scale

92
Q

Permits classification. Rank ordering on some characteristic. Have no absolute zero point.

A

Ordinal scales

93
Q

A set of test scores arrayed for recording or study.

A

Distribution

94
Q

A straightforward, unmodified accounting of performance that is usually numerical.

A

Raw score

95
Q

All scores are listed alongside the number of times each score occurred.

A

Simple Frequency Distribution

96
Q

Class intervals replace the actual test scores.

A

Grouped frequency distribution

97
Q

A graph with vertical lines drawn at the true limits of each test score or class interval forming a series of contiguous rectangles.

A

Histogram

98
Q

Expressed by continuous line connecting the points where the test scores or class intervals (X axis) meet frequencies (Y axis).

A

Frequency Polygon

99
Q

If the distribution is normal, the most appropriate measure of central tendency for mean is what data?

A

Interval or ratio

100
Q

There are two scores that occur with the highest frequency. It is theoretically possible for this distribution to have two modes with falls at the high or low end of the distribution.

A

Bimodal distribution

101
Q

An indication of how scores in a distribution are scattered or dispersed.

A

Variability

102
Q

Statistics that describe the amount of variation in a distribution.

A

Measures of variability

103
Q

It is a measure of variability equal to the difference between Q3 and Q1.

A

Interquatile ranges

104
Q

It is equal to the interquartile range divided by 2.

A

Semi-interquatile range.

105
Q

Quarter refers to an _.

A

Interval

106
Q

The dividing points between the 4 quarters in the distribution. It refers to a specific point.

A

Quartiles

107
Q

Q2 and the _ are exactly the same.

A

Median

108
Q

In a perfectly symmetrical distribution, Q1 and Q3 will be exactly the same distance from the _.

A

Median

109
Q

To obtain this, all the deviation scores are summed and divided by the total number of scores.

A

Average deviation

110
Q

A measure of variability equal to the square root of the variance.

A

Standard deviation

111
Q

It is equal to the arithmetic mean of the squares or the differences between the scores in a distribution and their mean.

A

Variance

112
Q

It is the nature and extent to which symmetry is absent.

A

Skewness

113
Q

Relatively few of the scores fall at the high end of the distribution.

A

Positive skew

114
Q

Relatively few of the scores fall at the low end of the distribution.

A

Negative skew

115
Q

Refers to the steepness of a distribution in its center.

A

Kurtosis

116
Q

What are the 3 general types of curves and what does it mean?

A

Platykuric- relatively flat
Leptokurtic- relatively peaked
Mesokurtic- somewhere in the middle

117
Q

Distribution that have _ kurtosis have a high peak and fatter tails compared to a normal distribution.

A

High

118
Q

Distribution that have _ kurtosis values indicate a distribution with a rounded peak and thinner tails.

A

Lower

119
Q

The development of the concept of a normal curve began in the middle of the 18th century with the work of _ and later the Marquis de Laplace.

A

Abraham DeMoivre

120
Q

Through the early 19th century, scientists referred to the Normal curve as the _.

A

Laplace-Gaussian curve

121
Q

He was credited with being the first to refer to the curve as the normal curve.

A

Karl Pearson

122
Q

The distribution of the normal curve ranges from _ to _.

A

Negative infinity to positive infinity.

123
Q

The normal curve is perfectly symmetrical with _ skewness.

A

No skewness

124
Q

A raw score that has been converted from one scale to another scale where the latter scale has some arbitrarily set mean and standard deviation.

A

Standard scores

125
Q

These scores are more easily interpretable than raw scores.

A

Standard scores

126
Q

It results from the conversion of a raw score into a number indicating how many standard deviation units the raw score is below or above the mean of the distribution.

A

Z scores or zero plus or minus one scale

127
Q

T score is devised by W.A Mccall and named it in honor of his professor _.

A

E. L. Thorndike

128
Q

This standard scores system is composed of a scale that ranges from 5 standard deviations below the mean to 5 standard deviations above the mean. Mean=50; Std.= 10

A

T scores or fifty plus or minus ten scale

129
Q

A standard score which has a mean of 5 and std of 2.

A

Stanine

130
Q

The 5th stanine indicates performance in the _.

A

Average range

131
Q

It is an expression of the degree and direction of correspondence between two things.

A

Correlation

132
Q

A number that provides us with the index of the strength of the relationship between two things.

A

Coefficient of Correlation (r)

133
Q

The meaning of correlation coefficient is interpreted by its _ and _.

A

Sign and magnitude

134
Q

The two ways to describe a perfect correlation between two variables are as either _ of _.

A

+1
-1

135
Q

Magnitude is a number anywhere between _ and _.

A

+1
-1

136
Q

He devised the Pearson r .

A

Karl Pearson

137
Q

Can be the statistical tool of choice when the relationship between the variables is linear and when the two variables being correlated are continuous.

A

Pearson r/ Pearson Correlation Coefficient/ Pearson Product-moment Coefficient of Correlation

138
Q

The Spearman Rho was developed by _.

A

Charles Spearman

139
Q

A measure of correlation that is frequently used when the sample size is small (fewer than 30 pairs of measurement) and when both sets of measurements are in ordinal form.

A

Spearman Rho

140
Q

A simple graphic of the coordinate points for the values of the X-variable (horizontal axis) and the Y-variable (vertical axis). They are useful because they provide a quick indication of the direction and magnitude of the relationship between the 2 variables and also reveals the presence of curvilinearity.

A

Scatterplot

141
Q

An eyeball gauge of how curved a graph is.

A

Curvilinearity

142
Q

An extremely atypical point located at a relatively long distance from the rest of the coordinate points in a scatter plot.

A

Outlier

143
Q

A statistic useful in describing sources of test score variability.

A

Variance

144
Q

Variance from true differences.

A

True variance

145
Q

Variance from irrelevant random sources.

A

Error variance

146
Q

The greater the proportion of the total variance to true variance, the more _ the test.

A

Reliable

147
Q

Refers to all the factors associated with the process of measuring some variable other than the variable being measured.

A

Measurement error

148
Q

A source of error in measuring a targeted variable caused by unpredictable fluctuations and inconsistencies of other variables in the measurement process.

A

Random Error

149
Q

A source of error in measuring a variable that is typically constant or proportionate to what is presumed to be the true value of the variable being measured.

A

Systematic error

150
Q

Give a source of variance during test construction.

A

Item sampling/Content sampling

151
Q

A validity coefficient that is used when correlating self rankings of performance.

A

Spearman Rho Rank-order Correlation