Practice Test Questions Flashcards by Paul Hankewicz

If a numerical measure describes an aspect of a sample, is it a statistic or a parameter?

Statistic

How well did you know this?

Not at all

Perfectly

If a variable describes an individual by placing the individual in a category or group, is the variable quantitative or qualitative?

Qualitative

How well did you know this?

Not at all

Perfectly

If data consist of names, labels, or categories with no implied criteria by which the data can be ordered from smallest to largest. What is the highest level Of measurement for the data: nominal, ordinal, interval. or ratio?

Nominal (Trick question since data can be ordered, but isn’t)

How well did you know this?

Not at all

Perfectly

If it makes sense to say that one data measurement in a data set is twice that of another measurement in the set. what is the highest level of measurement for the data: nominal, ordinal, interval, or ratio?

Ratio (Anything scaled by a multiple = Ratio)

How well did you know this?

Not at all

Perfectly

Consider a sample of size n. If every sample of size n has equal chance of selected, what is the type of sample: stratified, systematic. simple random, cluster?

Simple Random

How well did you know this?

Not at all

Perfectly

If a treatment is applied to subjects or objects in a study in order to observe a possible change in the variable of interest. Is the study an observational study or an experiment?

Experiment

How well did you know this?

Not at all

Perfectly

Sudoku is a puzzle consisting of squares arranged in 9 rows and 9 columns. fie 81 squares are further divided into nine 3 x 3 square boxes. The object is to fill in the squares with numerals 1 through 9 so that each column, row, and box contains all nine. However, there is a requirement that each number appear only once in any row, column, or box. Each puzzle already has numbers in some of the squares. Would it appropriate to use a random-number table to select a digit for each blank square?

Because of the requirement that each appear number only once in a given column, row, or box, it would be very inefficient to use a random-number table to select the numbers. It’s better to simply look at existing numbers. list possibilities that meet the requirement, and eliminate numbers that don’t work.

How well did you know this?

Not at all

Perfectly

Alisha wants to do a statistical study to determine how long it takes people to complete a Sudoku puzzle: Her plan is as follows: 1. Download 10 different puzzles from the Internet. 2. Find 10 friends willing to participate. 3. Ask each friend to complete one of the puzzles and time him or herself. 4. Gather the completion times from each friend. Describe some of the problems With Alisha’s plan for the study. Are the results from Alisha’s study anecdotal. or do they apply to the general population?

There are many problems. For instance, it is not clear that the puzzles she wants to download are of the same difficulty. Friends willing to participate may have different levels of experience with the puzzles. Conditions under which the puzzles are solved are not specified. The results of the study would be case by case, therefore anecdotal.

How well did you know this?

Not at all

Perfectly

You are conducting a study of students doing work-study jobs on your campus. Among the questions on the survey instrument are: A. How many hours are you scheduled to work each week? Answer to the nearest hour. B. How applicable is this work experience to your future employment goals? Respond using the following scale: I = not at all, 2 = somewhat, 3 = very. (a) Suppose you take random samples from the following groups: freshman. sophomores, juniors, and seniors. What kind of sampling techniques are +you using (simple random, stratified, systematic, cluster, multistage, convenience)?

Stratified

How well did you know this?

Not at all

Perfectly

Students on your campus with work-study jobs

How well did you know this?

Not at all

Perfectly

You are conducting a study of students doing work-study jobs on your campus. Among the questions on the survey instrument are: A. How many hours are you scheduled to work each week? Answer to the nearest hour. (c) What is the variable for question A? Classify the variable as qualitative or quantitative. What is the level of the measurement?

Hours scheduled; quantitative; ratio.

How well did you know this?

Not at all

Perfectly

You are conducting a study of students doing work-study jobs on your campus. Among the questions on the survey instrument are: B. How applicable is this work experience to your future employment goals? Respond using the following scale: I = not at all, 2 = somewhat, 3 = very. (d) What is the variable for question B? Classify the variable as qualitative or quantitative. What is the level of the measurement?

Rating of applicability of work experience to future employment; qualitative; ordinal.

How well did you know this?

Not at all

Perfectly

You are conducting a study of students doing work-study jobs on your campus. Among the questions on the survey instrument are: B. How applicable is this work experience to your future employment goals? Respond using the following scale: I = not at all, 2 = somewhat, 3 = very. (e) Is the proportion of responses “3 = very” to question B a statistic or a parameter?

Statistic (Trick question, 3 = Very is a Parameter but the Proportion of answers is a Statistic)

How well did you know this?

Not at all

Perfectly

You are conducting a study of students doing work-study jobs on your campus. Among the questions on the survey instrument are: A. How many hours are you scheduled to work each week? Answer to the nearest hour. B. How applicable is this work experience to your future employment goals? Respond using the following scale: I = not at all, 2 = somewhat, 3 = very. (f) Suppose only of the students you selected for the sample respond. What is the nonresponse rate? Do you think the nonresponse rate might introduce bias into the study? Explain.

60%; the people choosing not to respond may have some characteristics, such as not working many hours, that would bias the study

How well did you know this?

Not at all

Perfectly

You are conducting a study of students doing work-study jobs on your campus. Among the questions on the survey instrument are: A. How many hours are you scheduled to work each week? Answer to the nearest hour. B. How applicable is this work experience to your future employment goals? Respond using the following scale: I = not at all, 2 = somewhat, 3 = very. (g) Would it be appropriate to generalize the results of your study to all work-study students in the nation? Explain.

No. The sample frame is restricted to one campus.

How well did you know this?

Not at all

Perfectly

A radio talk show host asked listeners to respond either yes or no to the question, “Is the candidate who spends the most on a campaign the most likely to win?” Fifteen people called in, and nine said yes. What is the implied population?

Population: opinions of all listeners

How well did you know this?

Not at all

Perfectly

variable: opinion of a caller

How well did you know this?

Not at all

Perfectly

Yes. a voluntary response

How well did you know this?

Not at all

Perfectly

The U.S. Department of Justice examined all reported cases of identity theft for U.S. residents aged 16 or older. Their data show that of all the reported incidents of identity theft in a recent year. 40% involved existing credit card accounts. You are to design a simulation of seven reported identity thefts showing which ones involve existing credit card accounts and which ones do not. How would you assign the random digits 0 through 9 to the two categories “Does” and “Does not” involve existing credit card accounts? Use your random-digit assignment and the random-number table to generate the results from a random sample of seven identity thefts. If you do the simulation again, do you expect to get exactly the same results?

Assign the digits so that 4 out of the 10 digits (0 through 9) correspond to the result “Does” and 6 of the digits correspond to the result “Does not” involve existing credit card accounts. One assignment is digits 1, 2, 3 correspond to “Does” while the remaining digits 4, 5, 6. 7, 8, 9 correspond to “Does not” involve such accounts. Starting with line 1, block 1 of Table 1, the sequence gives “Does not,” “Does,” “Does not,” “Does,” “Does.” “Does not.” “Does not.” We cannot necessarily expect another simulation to give the same result. The results depend on the response assigned to the digits and the section of the random. number table used.

How well did you know this?

Not at all

Perfectly

What type of sampling is used in the following? (simple random, stratified, systematic, cluster, or convenience) To conduct a pre-election opinion poll on a proposed amendment to the state constitution. a random sample of 10 telephone prefixes (first three digits of the phone number) was selected, and all households from the phone prefixes selected were called.

Cluster

How well did you know this?

Not at all

Perfectly

What type of sampling is used in the following? (simple random, stratified, systematic, cluster, or convenience) To conduct a study on depression among the elderly, a sample of 30 patients in one nursing home was used.

Convenience

How well did you know this?

Not at all

Perfectly

What type of sampling is used in the following? (simple random, stratified, systematic, cluster, or convenience) To maintain quality control in a brewery, every 20th bottle of beer coming off the production line was opened and tested.

Systematic

How well did you know this?

Not at all

Perfectly

What type of sampling is used in the following? (simple random, stratified, systematic, cluster, or convenience) Subscribers to a new smartphone app that streams songs were assigned numbers. Then a sample of 30 subscribers was selected by using a random-number table. The subscribers in the sample were invited to rate the process for selecting the songs in playlist.

Random

How well did you know this?

Not at all

Perfectly

What type of sampling is used in the following? (simple random, stratified, systematic, cluster, or convenience) To judge the appeal of a proposed television sitcom, a random sample of to people from each of three different age categories was selected and those chosen were asked to rate a pilot show.

Stratified

How well did you know this?

Not at all

Perfectly

What type of graphing is used in the following? (histogram, relative frequency graph, ogive) Shows cumulative frequency (or percent of data) falling at or below each upper class boundary in a frequency table

Ogive

What type of graphing is used in the following? (histogram, relative frequency graph, ogive) Shows number of data falling within each distinct class of a frequency table.

Histogram

What type of graphing is used in the following? (histogram, relative frequency graph, ogive) Shows the relative frequency (or percent) or all data falling within each class of a frequency table

relative frequency

Which type(s) of data can be shown in a histogram? (quantitative, qualitative (also known as category). or both)

quantitative

Which type(s) of data can be shown in a bar graph? (quantitative, qualitative (also known as category). or both)

Both

Which graphical display shows each data value (or truncated data value) in order from smallest to largest: histogram, pie chart, or stem-and-leaf?

Steam and Leaf

If a histogram is skewed left, more or the data falls on which side? right or left?

Right

How are data plotted in a time-series graph: by in order from smallest to largest, or at regular intervals over time

at regular intervals over time

Consider these types of graphs: histogram, bar graph, Pareto chart, pie chart, stem-and-leaf display. (a) Which are suitable for qualitative data? (b) Which are suitable for quantitative data?

(a) Bar graph, Pareto chart, pie chart. (b) All.

A consumer interest group is tracking the percentage of household income spent on gasoline over the past years. Which graphical display would be more useful, a histogram or a time-series graph? Why?

Time series

Describe how data outliers might be revealed in histograms and stem-and-leaf plots.

Any large gaps between bars or stems with leaves at the beginning or end of the data set might indicate that the extreme data values are outliers.

How are dotplots and stem-and-leaf displays similar? how are they different?

Dotplots and stem-and-leaf displays both show all the data values. Stem-and-leaf displays group all data values having the same stem, whereas dotplots group only data values that are exactly the same.

The timeplot gives the number of state and federal prisoners per 100,000 population (a) Estimate the number of prisoners per people for 1980 and for 1997.

140 in 1980 and 440 in 1997

The timeplot gives the number of state and federal prisoners per 100,000 population b) During the period shown. there was increased prosecution of the offenses, longer sentences for conunnn crimes, and reduced access to parole. What does the time-series graph say about the prison population change per 100,000 people?

It is Increasing

The timeplot gives the number of state and federal prisoners per 100,000 population (c) In 1997, the U.S. population was approximately 266,574,000 people. At the rate of 444 prisoners per 100,000 population, about how many prisoners were in the system? The projected U.S. population for the year 2020 is 323,724,000. If the rate of prisoners per 100,000 stays the same as in 1997, about how many prisoners do you expect will be in the system in 2020?

1,183.589; 1437.335.

Driving under the influence of alcohol (DUI) is a serious offense. The following data give the ages of a random sample of 50 drivers arrested while driving under the influence of alcohol. This distribution is based on the age distribution of DUI arrests given in the Statistical Abstract of rhe United States (12th edition). (b) Is it skewed to the right or left?

Slightly right

Consider the following measures of central tendency: mean. median, mode. Match each type to the appropriate description: (i) the central value o fa data set after it has been ordered from smallest to largest (ii) the data value occurring most frequently in a data set (iii) the sum of all the data values in a data set divided by the number of data values in the set

(i) Median. (ii) Mode. (iii) Mean

Consider the statement: For a trimmed mean, we eliminate the bottom 5% of the data from an ordered data set and then compute the mean of the remaining data. Is the statement true or false?

**False**: Eliminate 5% of the data from both the bottom and the top Of the ordered

When we compute a sample standard deviation of a data set Terminology do we subtract the sample mean or the sample median from each of the data

Sample Mean

Consider the following terms: outlier, coefficient of variation, range. Match each term to the appropriate description. (i) The difference between the largest and the smallest data value of a data set (ii) The spread of a data set measured by the standard deviation relative 10 the mean of the data set iii) An unusually targe or small data value in a data set

(i) Range (ii) Coefficient of variation (iii) Outlier

Terminology Consider the following symbols: s,σ,x̅,μ Match each of the symbols to the appropriate name: (i) sample mean (ii) sample standard deviation (ii) population mean (iv) population standard deviation

* *1. x̅ 2. s 3. μ 4. σ**

How is the standard deviaion related to the variance of a data set

The standard deviation is the square root of the variance.

In a box-and-whisker plot, which measure of central tendency is displayed: mean, median, or mode?

Median

Consider the following statement: If you answered 90% of the questions on a test correctly, then your score is in the 90th percentile, T/F? Explain.

False. The 90th percentile refers to a score 90% of the scores fall at or below.

What measures of variation indicate spread about the mean?

Variance and standard deviation,

Which graphic display shows the median and the data spread about the median?

Box and Whisker Plot

Look at the two histograms, each involves the same number of data. The data are the whole numbers, so the height of each bar represents the number of values equal to the corresponding midpoint shown on the horizontal axis. Notice both are symmetric. (b) Which distribution has the larger standard deviation? Why?

The second one since more of the data are farther away from the mean.

Look at the two histograms, each involves the same number of data. The data are the whole numbers, so the height of each bar represents the number of values equal to the corresponding midpoint shown on the horizontal axis. Notice both are symmetric. a) Estimate the mode, median, and mean for each histogram.

For both histograms, mode = 7; median = 7; mean = 7.

Consider the following Minitab display of two data sets. (c) Compare the interquartile ranges of the two sets. How do the middle halves of the data sets compare?

The C1 distribution has a larger interquartile range that is symmetric around the median. The C2 distribution has a very compressed interquartile range with the median equal to Q3.

Consider the following Minitab display of two data sets. (b) Which data set seems more symmetric? Why?

The Cl distribution seems more symmetric because the mean and median are equal, and the median is in the center of the interquartile range. In the C2 distribution, the mean is less than the median.

Consider the following Minitab display of two data sets. (a) What are the respective means? the respective ranges?

For both data sets, mean = 20 and range = 24.

"Radon: The Problem No One Wants to Face" is the title of an article appearing in Consumer Reports. Radon is a gas emitted from the ground that can collect in houses and buildings. At certain levels it can cause lung cancer. Radon concentrations are measured in picocuries per liter (pCi/L). A radon level of 4 pCi/L is considered "acceptable." Radon levels in a house vary from week to week. In one house, a sample of 8 weeks had the following readings for radon level (in pCi/L): 1. 9, 2.8, 5.7, 4.2, 1.9, 8.6, 3.9, 7.2 (c) Based on the data, would you recommend radon mitigation in this house? Explain.

Yes

"Radon: The Problem No One Wants to Face" is the title of an article appearing in Consumer Reports. Radon is a gas emitted from the ground that can collect in houses and buildings. At certain levels it can cause lung cancer. Radon concentrations are measured in picocuries per liter (pCi/L). A radon level of 4 pCi/L is considered "acceptable." Radon levels in a house vary from week to week. In one house, a sample of 8 weeks had the following readings for radon level (in pCi/L): 1. 9, 2.8, 5.7, 4.2, 1.9, 8.6, 3.9, 7.2 (b) Find the sample standard deviation, coeffcient of variation. and range.

s - 2.46; cv- 544%; range = 6.7.

``` mean = 4.53; median = 4.05 mode = 1.9 ```

31 33 34 34 35 35 35 36 38 38 38 39 40 40 40 40 41 41 41 41 41 41 41 42 42 43 44 44 44 45 45 46 46 46 46 47 48 49 49 49 49 50 51 52 52 53 53 53 53 53 55 56 56 57 57 59 62 66 66 68 (b) Get Sample Mean and Sample Standard Deviation

``` Mean = 46.15 s = 8.63 ```

31 33 34 34 35 35 35 36 38 38 38 39 40 40 40 40 41 41 41 41 41 41 41 42 42 43 44 44 44 45 45 46 46 46 46 47 48 49 49 49 49 50 51 52 52 53 53 53 53 53 55 56 56 57 57 59 62 66 66 68 (b) Make a frequency table using five classes. Then estimate the meanand sample standard deviation using the frequency table. Compute a 75% Chebyshev interval centered around the mean.

31 33 34 34 35 35 35 36 38 38 38 39 40 40 40 40 41 41 41 41 41 41 41 42 42 43 44 44 44 45 45 46 46 46 46 47 48 49 49 49 49 50 51 52 52 53 53 53 53 53 55 56 56 57 57 59 62 66 66 68 (a) Make a box-and-whisker plot of the data. Find the interquartile range.

``` Low = 31; Q1 = 40; median = 45; Q3 = 52.5: high = 68; IQR = Q3 - Q1 = 12.5. ```

Practice Test Questions Flashcards

(63 cards)