Chapter 5: Statistical Data Treatment and Evaluation Flashcards

1
Q

this interval defines a numerical interval around the mean of a set of replicate results within which the population mean can be expected to lie with a certain probability

A

confidence interval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

the limits of the interval are called

A

confidence limits

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

the probability that the true mean lies
within a certain interval and is often
expressed as a percentage.

A

confidence level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

The probability that a
result is outside the confidence interval is often called the

A

significance level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

90% confidence interval , z=

A

1.64

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

95% confidence interval , z=

A

1.96

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

99% confidence interval , z=

A

2.58

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

formula for finding confidence interval when standard deviation population is known or s is a good estimate of the standard deviation population

A

CI for u=x+- z(sigma)
CI for u= mean +- z(sigma)/square root of N

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

TRUE or FALSE
It is essential to keep in mind at all times that confidence intervals based on CI for u= mean +- z(sigma)/square root of N apply only in the absence of bias and only if we can assume that s is a good approximation of s.

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

studied the limits of the Poisson and binomial distributions,
the sampling distribution of the mean and standard deviation, and several
other topics.

A

W. S. Gossett

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

His most important work on the t test was developed to determine how closely the yeast and alcohol content of various batches of Guinness matched the standard amounts established by the brewery

A

W. S. Gossett

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

is the basis for many decisions made in science and engineering.

A

Hypothesis testing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

postulates that two or more observed quantities are the same.

A

null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Specific examples of hypothesis tests that scientists often use include the comparison
of

A

1) the mean of an experimental data set with what is believed to be the true
value,
2) the mean to a predicted or cutoff (threshold) value, and
3) the means or
the standard deviations from two or more sets of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

TRUE or FALSE
The first, the null hypothesis H0, states that m population is equal to true value of mean population. The second, the alternative hypothesis Ha can be stated in several ways.

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

it does not matter whether the mean is larger or smaller than the known value

A

two-tailed test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

other alternative hypotheses are mean population > true mean population or vice versa, which indicated the direction of the difference matters and is called

A

one-tailed test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

TRUE or FALSE
The crucial elements of a test procedure are the formation of an appropriate test statistic
and the calculation of a probability value—p-value

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

TRUE or FALSE
The smaller the value of p, the stronger the evidence against H0. Conversely, if the p value is large, there is good evidence that H0 is true, and therefore, it should be accepted

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

For tests concerning one or two means, the test statistic might be the ________ if we have a large number of measurements or if we know s

A

z statistic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

used for a small number of measurements with unknown s. Also, when in doubt, it is also should be used.

A

t statistic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

TRUE or FALSE
If the probability p of obtaining
the z (or t) value is very low when assuming H0 is true, reject H0.

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

TRUE or FALSE
using the rejection region approach, if z (or t) lies within the rejection region, reject H0.

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

TRUE or FALSE
The rejection region approach has fallen
out of favor because the region holds only the chosen significance level

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

what test is applicable If a large number of results are available so that s is a good estimate of s

A

large sample z test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

rejection can occur for results in either tail of the distribution.

A

two-tailed test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

For a small number of results, use a similar procedure to the z test except that the test
statistic is the t statistic and the test is called the

A

small sample t test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

TRUE or FALSE
In testing for bias, we do not know initially whether the difference between the experimental mean and the accepted value is due to random error or to an actual
systematic error. The t test is used to determine the significance of the difference.

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

TRUE or FALSE
If the analytical method
had no systematic error, or bias, random errors would give the frequency distribution

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

TRUE or FALSE
t test is used to determine the significance of the difference of experimental mean and the accepted value

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

If it were confirmed by further
experiments that the method always
gave low results, we would say that
the method had a

A

negative bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

TRUE or FALSE
If the absolute value of the test statistic is less than the critical value, the null hypothesis is accepted, and no significant difference between the means has been demonstrated

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

TRUE or FALSE
A test value of t greater than the critical
value indicates a significant difference between the means

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

TRUE or FALSE
Since t is less than or equal to the tcrit, you can conclude that there is a significant difference at % confidence level.

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

TRUE or FALSE
When t is greater than the tcrit value, accept the null hypothesis at % confidence level and conclude that there is no significant difference between the experimental and the accepted value.

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

TRUE or FALSE
The number of degrees of freeon for binding the critical value of t is N1+N2-2

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

If there is good reason to believe that the standard deviations of the two data sets differ, the___________ must be used.1 However, the significance level for this t test is only approximate, and the number of degrees of freedom is more difficult to calculate.

A

two-sample t test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What statistical test should be used by scientists and engineers who often make use of pair measurements on the same sample in order to minimize sources of variability that are not of interest?

A

t test analyzing paired data

39
Q

What statistical test should be used when two methods for determining glucose in blood serum are to be compared

A

t test analyzing paired data

40
Q

occurs when H0 is
rejected although it is actually true.

A

type I error

41
Q

In some sciences, a type I error is called a

A

false negative

42
Q

occurs when H0 is accepted and
it is actually false.

A

type II error

43
Q

type II error is sometimes termed as a

A

false positive

44
Q

The probability of a type II error is given by the symbol

A

beta (B)

45
Q

Making alpha (a) smaller such that 0.01 instead of 0.05 appears to make sense in order to minimize what type of error

A

type I error

46
Q

TRUE or FALSE
decreasing the type I error rate increases
the type II error rate because they are inversely related to each other.

A

TRUE

47
Q

TRUE or FALSE
an a value of 0.05
(95% confidence level) provides an acceptable compromise.

A

TRUE

48
Q

TRUE or FALSE
If a type I error is much more
likely to have serious consequences than a type II error, it is reasonable to choose
a small value of a.

A

TRUE

49
Q

TRUE or FALSE
a type II error would be
quite serious, and so a larger value of a is used to keep the type II error rate under
control.

A

TRUE

50
Q

TRUE or FALSE
As a general rule of thumb, the largest a that is tolerable for the situation
should be used. This ensures the smallest type II error while keeping the type I error
within acceptable limits.

A

TRUE

51
Q

What statistical analysis requires that the standard deviations of the data sets being compared are equal.

A

t test

52
Q

can be used to test this assumption under the provision that the populations follow the
normal (Gaussian) distribution. It is also used in comparing more than two means and in linear regression analysis

A

F test

53
Q

is based on the null hypothesis that the two population variances under
consideration are equal

A

f test

54
Q

TRUE or FALSE
t and z test is based on the null hypothesis that the two population mean under consideration are equal

A

TRUE

55
Q

is defined as the
ratio of the two sample variances is calculated and compared with the
critical value of F at the desired significance level

A

test statistic F

56
Q

TRUE or FALSE
For a one-tailed test, we test the alternative hypothesis that one variance is greater than
the other. Hence, the variance of the supposedly more precise procedure is placed in the denominator and that of the less precise procedure is placed in the numerator

A

TRUE

57
Q

TRUE or FALSE
If F1 is less than fcrit and p> actual value, we accept the null hypothesis

A

TRUE

58
Q

TRUE or FALSE
If F2 is greater than fcrit and p> actual value, we reject the null hypothesis

A

TRUE

59
Q

Analysis of Variance also stands for

A

ANOVA

60
Q

These methods use a single test to determine whether there is
or is not a difference among the population means rather than pairwise comparisons
as is done with the t test

A

ANOVA

61
Q

take advantage of ANOVA in
planning and performing experiments.

A

Experimental design methods

62
Q

Common characteristics in a
comparison are ________. The values of
the factors are ________. Experimental
results are ________

A

factors; levels; responses

63
Q

the populations have differing values of a common characteristic called

A

factor or treatment

64
Q

The different values of the factor of interest are called

A

levels

65
Q

The comparisons among the various populations are made by measuring a

A

response

66
Q

Often, several factors may be involved, such as in an experiment to determine
whether pH and temperature influence the rate of a chemical reaction. In such
a case, the type of ANOVA is known as a

A

two-way ANOVA

67
Q

The basic principle of ________
is to compare the variations
between the different factor
levels (groups) to that within
factor levels.

A

ANOVA

68
Q

In ANOVA, the factor levels are often called

A

groups

69
Q

TRUE or FALSE
The basic principle of ANOVA is to
compare the between-groups variation to the within-groups variation.

A

TRUE

70
Q

The basic statistical test used for ANOVA is the

A

F test

71
Q

detect difference in several population means by comparing the variances

A

ANOVA

72
Q

it is the average of all the data

A

grand average

73
Q

TRUE or FALSE
The error sum of the squares is related to the individual group variances

A

TRUE

74
Q

TRUE or FALSE
As a rough rule of thumb, the largest s should not be much more than
twice the smallest s for equal variances to be assumed

A

TRUE

75
Q

By dividing the sums of squares by their corresponding degrees of freedom, you
can obtain quantities that are estimates of the between-groups and within-groups
variations. These quantities are called

A

mean square values

76
Q

are sums of squares divided by degrees of
freedom.

A

mean square values

77
Q

is an estimate of the variance due to error

A

mean square due to errors

78
Q

is an estimate of the error variance plus the between-groups variance

A

MSF/ mean square due to factor levels

79
Q

TRUE or FALSE
If the factor effect is significant, MSF is
greater than MSE.

A

TRUE

80
Q

There are several methods to determine
which means are significantly different. One of the simplest is the

A

least significant difference method

81
Q

In this method, a difference is calculated that is judged to be the smallest difference that is significant. The difference between each pair of means is then compared to the least significant difference to determine which means are different

A

least significant difference method

82
Q

is a result that is quite
different from the others in the data
set and might be due to a gross error.

A

outliers

83
Q

is a simple, widely used statistical test for deciding whether a suspected
result should be retained or rejected

A

Q test

84
Q

the absolute value of the difference
between the questionable result xq and its nearest neighbor xn is divided by the
spread w of the entire set to give the quantity Q:

A

Q test

85
Q

TRUE or FALSE
If Q is greater than Qcrit, the questionable result can be rejected with the indicated degree of confidence

A

TRUE

86
Q

TRUE or FALSE
the only valid reason for rejecting
a result from a small set of data is the sure knowledge that a mistake was made in the
measurement process. Without this knowledge, a cautious approach to rejection of an outlier is wise.

A

TRUE

87
Q

is used to compare more than two means and to determine whether any differences are real or the result of random errors.

A

ANOVA

88
Q

can be used to minimize sources of variability that influence both
members of a pair of measurements.

A

paired t test

89
Q

computes the t statistic and the probability of t assuming that H0 is true. If p
is small compared to the significance level a, H0 is rejected. Alternatively, a rejection
region is calculated from critical values of t.

A

t test

90
Q

asserts that the mean is equal to the accepted value

A

null hypothesis

91
Q

We can compare an experimental mean to a known value or an accepted level by a
hypothesis test, the ____________, if the population standard deviation is known.

A

z test

92
Q

is the interval within which the population mean is expected to lie with a certain probability.

A

confidence interval

93
Q
A