SAS Statistics Practice Exam Flashcards by Jennifer Joseph

What is the order of steps in both traditional statistical analyses and machine learning approaches to modeling?

Data exploration, variable modification, model fitting, model assessment

How well did you know this?

Not at all

Perfectly

Which task is best suited for deep learning as opposed to other machine learning techniques?

Recognizing the objects in an image or video.

How well did you know this?

Not at all

Perfectly

Which basic principle of statistics reflects the strength of support for results that are subject to small perturbations in input data?

Stability

How well did you know this?

Not at all

Perfectly

You are gathering data on purchase size of fountain drinks ranging from small to extra-large. What is the level of measurement for size?

Ordinal

How well did you know this?

Not at all

Perfectly

Which term has a different meaning than the other three?

Field

How well did you know this?

Not at all

Perfectly

Which terms refer to the same component of an analytical data set? (Choose 2.)

Object, Entity

How well did you know this?

Not at all

Perfectly

Which is an example of probability sampling?

Players are selected for teams by flipping a coin.

How well did you know this?

Not at all

Perfectly

Which type of sampling is used when the population is divided into groups using a categorical variable and then observations are randomly selected from these groups?

Stratified Sampling

How well did you know this?

Not at all

Perfectly

When you draw a sample of 10 balls from a bag of 100 balls of two colors using simple random sampling, what is the probability of each ball being selected?

1/100

How well did you know this?

Not at all

Perfectly

Differences in values can be meaningfully calculated for variables on which scales?

Ratio and Interval

How well did you know this?

Not at all

Perfectly

Which describes p-values in hypothesis testing?

A smaller p-value indicates stronger evidence against the null hypothesis.

How well did you know this?

Not at all

Perfectly

A lighting company claims that their LED light bulbs surpass the typical lifespan of traditional incandescent bulbs, which is 1000 hours. After examining a sample of 50 LED bulbs, the sample mean is 2200 hours, and the sample standard deviation is 200 hours. What statistical test is appropriate for determining if the new LED bulbs, on average, outlast traditional incandescent bulbs?

One-sample t-test

How well did you know this?

Not at all

Perfectly

Which measure quantifies the variability of an estimate, such as the sample mean?

Standard errror

How well did you know this?

Not at all

Perfectly

Which describes a Type II error?

Failing to reject the null hypothesis that is false.

How well did you know this?

Not at all

Perfectly

For an analysis in which the null hypothesis is known to be true, you perform a hypothesis test with α = 0.05. The analysis returns a p-value of 0.023. What does this demonstrate?

Type I error

How well did you know this?

Not at all

Perfectly

For an analysis in which the null hypothesis is known to be false, you perform a hypothesis test with α = 0.05. The analysis returns a p-value of 0.23. What does this demonstrate?

Type II error

How well did you know this?

Not at all

Perfectly

You measure patient blood pressure before and after each patient is administered a drug. How do you test the hypothesis that blood pressure remains the same before and after the drug is administered?

Paired t-test

How well did you know this?

Not at all

Perfectly

What is Pearson’s correlation coefficient (r) used for?

Detect collinearity

How well did you know this?

Not at all

Perfectly

Which of the relationships shown in the scatter plots below can be appropriately interpreted using a Pearson correlation coefficient? (Choose 2.)

Pearson correlation coefficients measure the strength of linear relationship, including no relationship. The Pearson correlation coefficient is not appropriate for non-linear relationships such as quadratic and cyclical.

How well did you know this?

Not at all

Perfectly

In a simple linear regression model where x predicts y, R2 = 0.6. What can you conclude from this information?

60% of the variance in y is explained by x.

How well did you know this?

Not at all

Perfectly

The Pearson’s correlation coefficient between x and y is 0.6. What can you conclude from this information?

The estimated linear association between x and y is 0.6.

How well did you know this?

Not at all

Perfectly

In a simple linear regression model where x predicts y, estimated β0 = 0.6. What can you conclude from this information?

When x=0, the predicted value of y is 0.6.

How well did you know this?

Not at all

Perfectly

In a simple linear regression model where x predicts y, estimated β1 = 0.6. What can you conclude from this information?

A one-unit change in x results in a predicted change in y of 0.6.

How well did you know this?

Not at all

Perfectly

Which displays a Pearson’s correlation coefficient of 0.7?

This graph shows a moderately strong positive linear association. /

How well did you know this?

Not at all

Perfectly

Which two values are divided to get the R-Square of 0.3751?

6593.01614, 17579

Which describe information criteria statistics? (Choose 2.)

A model with a lower value shows better fit than a model with a higher value., They allow comparison between models fitted to the same data.

Which of the following information criteria, due to its larger penalty, favors models with fewer parameters?

SBC

How many parameters are estimated when fitting a linear regression model with 3 continuous predictors and one categorical predictor with 3 levels?

Comparing two multiple linear regression models, which statistic indicates better model fit with larger values?

Adjusted R-square

Performing a multiple linear regression analysis, which variable will be removed in backward elimination?

Which phases of predictive modeling require a known target?

Training a model but not scoring a new data.

Which avoids an overly optimistic assessment of a predictive model's performance?

Assess the model performance on a data set not involved in fitting the model.

In terms of accuracy and precision, which indicates overfitting in a predictive model?

High accuracy and low precision

Which activities use a model to generate predictions for a data set that does not contain a target variable? (Choose 2.)

Model deployment, Model scoring

Which describes explanatory modeling?

It is used to test an already existing set of hypotheses.

An odds ratio of Group A to Group B is 4, with a 95% confidence interval [3,5]. What can be inferred from this?

Group B has 0.25 times the odds of the target event as Group A.

To test an association between two categorical variables for statistical significance, we use which statistic?

Chi-square statistic

What can a chi-square test be used for? (Choose 2.)

Feature selection in machine learning., Test the statistical significance of association between two categorical variables.

In logistic regression model assessment, which indicates the best fit?

Maximize concordant pairs; Minimize discordant and tied pairs

What characterizes a discordant pair? (Choose 2.)

Model cannot sort the event and non-event pair correctly., An observation with the event has a lower predicted probability of having the event than an observation without the event.

Using a cutoff of 35%, which Customer IDs would be classified as non-purchasers? (Choose 2.)

2, 4

A logistic regression is fit using maximum likelihood parameter estimation. How many times is the model refit when used to score new data sets?

Zero

What is the default cut-off for classification allocation, in logistic regression?

0.5

You are using a cutoff probability for logistic discrimination, where cases above the cutoff are allocated to class 1 (event) and cases below the cutoff are allocated to class 0 (non-event). What best describes this allocation rule?

The decision boundary is linear irrespective of the cutoff.

You are using a logistic regression model to score an observation. The predicted logit for the observation is zero (0). What is the predicted probability for this observation?

0.5

Which type of machine learning algorithm would you use if some of the training data has labels but most of it doesn't?

Semi-Supervised Learning

A neural network with no hidden layer, one interval target, and a linear function of inputs resembles which of the following statistical models?

Linear regression

What type of learning are nearest-neighbor mapping, k-means clustering, and singular value decomposition?

Unsupervised

Which machine learning method is used when the cost associated with labeling is too high to allow for a full set of labeled training cases, but partial labeling information is available?

Semi-Supervised Learning

A retail company wants to segment their customers into distinct groups based only on their demographic inputs. Which machine learning method is recommended?

Unsupervised learning

A researcher is conducting a study on the effects of a new medication on blood pressure over a 12-week period. Due to scheduling conflicts, 25 out of 300 participants missed their week 6 check-up. The researcher believes that the likelihood of missing the check-up is related to the participants' work schedules and not their actual blood pressure levels.

Missing at Random (MAR)

Which machine learning algorithm is less sensitive to unevenly scaled data, and may not necessarily require you to scale a set of features in your data?

Decision Tree

Which missing data mechanism in a regression analysis would result in the slope being biased and the R-square value being reduced?

Missing Not at Random (MNAR)

You are analyzing data for a predictive model, in which the target is whether a customer is likely to purchase a product. Some of the values of the variable Customer Spend are missing. You find out that this is due to random fluctuations in server traffic, occasionally observations in the database are corrupted, resulting in a small amount of missingness.

Missing completely at random (MCAR)

You are analyzing data for a predictive model, in which the target is whether a customer is likely to purchase a product. Some of the values of the variable Customer Spend are missing. You find out that this is because spending data is only collected for customers who have purchased the product. What is the most likely data missingness pattern?

Missing not at random (MNAR)

You are analyzing data for a predictive model, in which the target is whether a customer is likely to purchase a product. Some of the values of the variable Customer Spend are missing. You find out that this is due to a power outage that caused some of the data values to be corrupted from a specific region, resulting in a small amount of missingness in your training data.

Missing at random (MAR)

Which are the most interpretable models? (Choose 2.)

Regression, Decision Tree

The observed frequency for Bonus Eligible and Irregular Lot Shape is 31. What is the expected frequency for this cell?

13.6856 (Row Total*Column Total) / Sample Size = (93*44) / 299 = 13.6856

In an online sale, 2500 OrderID's were generated on a given day, out of which 125 values of OrderID variable were unique. What is the cardinality of OrderID variable?

125

If an observation with the target event has the same predicted probability as another observation without the target event, what type of pair is this?

Tie

SAS Statistics Practice Exam Flashcards

Practice for certification (60 cards)