SOA PA Flashcards

Question 1

Q

bar chart

Answer

A

geom_bar()

Question 2

Q

box plot

Answer

A

geom_boxplot()

Question 3

Q

histogram

Answer

A

geom_histogram()

Question 4

Q

scatterplot

Answer

A

geom_point()

Question 5

Q

smoothed line

Answer

A

geom_smooth()

Question 6

Q

ggplot alpha

Answer

A

transparency parameter

Question 7

Q

display separate plots

Answer

A

facet_wrap(~, ncol = ))

Question 8

Q

two-dimensional grid of plots

Answer

A

facet_grid( ~ , ncol = )

Question 9

Q

adjust axes range

Answer

A

xlim() & ylim()

Question 10

Q

convert axes to log scales

Answer

A

scale_x_log10() & scale_y_log10()

Question 11

Q

edit titles, subtitles, and captions

Answer

A

labs(), xlab(), ylab(), ggtitle()

Question 12

Q

display multiple graphs

Answer

A

grid.arrange() in gridExtra

Question 13

Q

numeric var descriptive stats code

Answer

A

summary()

Question 14

Q

numeric var distribution displays

Answer

A

histograms, box plots

Question 15

Q

correct for skewness

Answer

A

log transformation

Question 16

Q

categorical var descriptive stats code

Question 17

Q

categorical var graphical displays

Answer

A

bar charts

Question 18

Q

numeric v numeric descriptive stats code

Question 19

Q

numeric v numeric graphical display

Answer

A

scatterplot

Question 20

Q

numeric v categorical descriptive stats

Question 21

Q

numeric v categorical graphical display

Answer

A

split boxplots, histograms

Question 22

Q

categorical v categorical descriptive stats code

Question 23

Q

discrete var

Answer

A

restricted to certain values

Question 24

Q

continuous var

Answer

A

can assume any value in theory

Question 25

Q

levels

Answer

A

predefined values of a categorical var

Question 26

Q

supervised learning

Answer

A

understand relationships of predictors and target var

Question 27

Q

unsupervised learning

Answer

A

no target var; solely var relationship extraction

Question 28

Q

numeric target predictive model

Answer

A

regression model

Question 29

Q

categorical target predictive model

Answer

A

classification model, classifier

Question 30

Q

training/test split

Answer

A

70-80%/20-30%

Question 31

Q

root mean squared error

Answer

A

aggregated prediction errors to measure regression accuracy

Question 32

Q

test classification error rate

Answer

A

measures classifier accuracy

Question 33

Q

cross-validation

Answer

A

technique to select hyperparameters

Question 34

Q

hyperparameters

Answer

A

parameters that have to be supplied in advance and are not optimized as part of the model training process

Question 35

Q

bias-variance tradeoff

Answer

A

more complex models have lower bias but higher variance than a less flexible model

Question 36

Q

bias

Answer

A

difference between the expected value and the true value of the signal function

Question 37

Q

variance

Answer

A

quantifies the amount by which f(x) would change if a different training set is used;

Question 38

Q

irreducible error

Answer

A

variance of the noise

Question 39

Q

more complex model has

Answer

A

lower bias but higher variance

Question 40

Q

overfitting

Answer

A

when a model is unnecessarily complex, resulting in the misinterpretation of noise as the underlying signal

Question 41

Q

underfitting

Answer

A

when a model is too general/basic, resulting in little or no capturing of the signal

Question 42

Q

feature

Answer

A

derivations from the original variables and provide an alternative, more useful view of the information contained in the dataset

Question 43

Q

variables

Answer

A

raw measurement that is recorded and constitutes the original dataset prior to any data transformation

Question 44

Q

feature generation

Answer

A

the process of developing new features based on existing variables in the data

Question 45

Q

feature selection

Answer

A

the procedure of dropping features with limited predictive power and therefore reducing the dimension of the data

Question 46

Q

combining sparse categories with others

Answer

A

ensures that each level has a sufficient number of observations / preserves the differences in the behavior of the target variable among different factor levels

Question 47

Q

simple linear regression

Answer

A

regression using one predictor

Question 48

Q

multiple linear regression

Answer

A

regression using more than one predictor

Question 49

Q

regression coefficient

Answer

A

coefficient of the predictor

Question 50

Q

ordinary least squares

Answer

A

choosing the estimates of coefficients to make the sum of the squared differences between the observed target values and the fitted valuesunder the model the least

Question 51

Q

design matrix

Answer

A

contains the values of predictors

Question 52

Q

residual

Answer

A

the discrepancy between the observed target value and the corresponding predicted value on either the training set or test set

Question 53

Q

t-statistic

Answer

A

the ratio of the corresponding least squares estimate to its estimated standard deviation or error / measures the partial effect of a var on the target var

Question 54

Q

coefficient of determination R^2

Answer

A

proportion of the variation of the target var that can be explained by the fitted linear model

Question 55

Q

f-statistic

Answer

A

assesses the joint significance of the entire set of predictors, against the alternative

Question 56

Q

akaike information criterion

Answer

A

balances the goodness of fit of a model to the training data with the complexity of the model captured by the number of parameters, which acts as a penalty term penalizing an overfitted model / the smallest AIC provides the best model

Question 57

Q

bayesian information criterion

Answer

A

balances the goodness of fit of a model to the training data with the complexity of the model captured by the number of parameters, which acts as a penalty term penalizing an overfitted model / the smallest BIC provides the best model

Question 58

Q

model diagnostics

Answer

A

quantitative and graphical tools that are used to identify evidence against the model assumptions and, if found, to refine the specification of the model in an effort to improve adequacy

Question 59

Q

residuals vs fitted plot

Answer

A

plot of the residuals against the fitted values

Question 60

Q

normal q-q plot

Answer

A

plot of the quantiles of the standardized residuals against the theoretical standard normal quantiles and can be used to check the normality of the random errors

Question 61

Q

polynomial regression

Answer

A

regression where the relationship of the target and predictors is not linear

Question 62

Q

binarization

Answer

A

turns a given categorical predictor into a collection of binary variables, each of which serves as an indicator of one and only one level of the categorical predictor

Question 63

Q

interaction term

Question 64

Q

backward selection

Answer

A

start the model with all features and drop the feature that causes the greatest improvement in the model according to a certain criterion one at a time

Answer 60

A

start the model with just the intercept and augment the model by progressively adding the feature that results in the greatest improvement in the model, until no features can be added to improve the model

Answer 61

A

considering and selecting the all models

Answer 62

A

alternative to stepwise selection for feature selection and reducing model complexity

Answer 63

A

regularization parameter

Answer 64

A

captures the size of the regression coefficients

Answer 65

A

the sum of squares of the slope coefficients

Answer 66

A

the sum of the absolute values of the slope coefficients

Answer 67

A

a combined regularization method of both ridge and lasso regression

Answer 68

A

mixing coefficient

Answer 69

A

model fit and model complexity

Answer 70

A

we want coefficient estimates that match the training data well in the sense that the training data well in the sense that the training RSS is reasonably small

Answer 71

A

we want coefficient estimates that are small in absolute value so that the model is less prone to overfitting

Answer 72

A

before performing regularization, it is judicious to standardize the predictors by dividing each by their standard error

Answer 73

A

regularization penalty vanishes and the coeffcient estimates are identical to the ordinary least squares estimates

Answer 74

A

regularization penalty dominates and the estimates of the slope coefficients have no choice but to all be zero

Answer 75

A

the effect of regularization becomes more severe

Answer 76

A

the coefficients can be forced to 0

Answer 77

A

alpha & lambda

Answer 78

A

cross-validation

Answer 79

A

continuous (positive) data, binary data, count data, aggregate loss data

Answer 80

A

appropriateness of predictions, interpretability, canonical link

Answer 81

A

ensures positive predictions and easy to interpret

Answer 82

A

form of the log link that is usually used for binary data

Answer 83

A

observations of the target var are averaged by exposure

Answer 84

A

observations are values aggregated over all exposure units

Answer 85

A

goodness-of-fit measure for GLMs which measures the extent to which the GLM departs from the saturated model and allows for perfect fit

Answer 86

A

the signed square root of the contribution of the ith observation to the deviance

Answer 87

A

normally distributed even if the target distribution is nor, no systematic patterns, constant variance

Answer 88

A

AIC and BIC

Answer 89

A

tabular display of how the predictions of a binary classifier line up wit the observed classes

Answer 90

A

the relative frequency of correctly predicting an event of interest when the event does take place, or equivalently, the ratio of TP to the total positive events

Answer 91

A

the relative frequency of correctly predicting a non-event when there is indeed no event, or the ratio of TN to the total negative events

Answer 92

A

(FN + FP) / n

Answer 93

A

graphical tool plotting the sensitivity against the specificity of a given classifier for each cutoff in the rand [0,1]

Answer 94

A

the exact value of the AUC may not mean much for the quantitative assessment of a classifier and in real applications it is often the relative value of the AUC that matters; the higher the better

Answer 95

A

the highest possible value of the AUC, which is attained by a classifier with perfect discriminatory power

Answer 96

A

the naive classifier which classifies the observations purely randomly without using the information contained in the predictors

Answer 97

A

point on a decision tree that corresponds to a subset of the training data

Answer 98

A

the node at the top of the decision tree representing the full dataset

Answer 99

A

nodes at the bottom of a tree which are not split further

Answer 100

A

each node has only two children

Answer 101

A

number of tree splits needed to go from the tree’s root node to the furthest terminal node

Answer 102

A

the predictor to split / the corresponding cutoff level, given the split predictor

Answer 103

A

analogous to linear models, the variability of a numeric target variable in a particular node of a decision tree is quantified by the residual sum of squares, or RSS

Answer 104

A

tree model where the target variable can take a discrete set of values

Answer 105

A

value increases with the degree of impurity in the node

Answer 106

A

the higher the degree of node impurity, the higher the value of gini

Answer 107

A

technique of controlling the complexity of a tree

Answer 108

A

training error of a tree scaled by the training error of the simplest tree, i.e., the tree with no splits

Answer 109

A

advanced data analytic technique that transforms a high-dimensional dataset into a smaller, much more manageable set of representative variables that capture most of the information in the original dataset

Answer 110

A

composite variables of the existing variables generated such that they are mutually uncorrelated and collectively simplify the dataset, reducing its dimension and making it more amenable to data exploration

Answer 111

A

weights of the PCs

Answer 112

A

data visualization and feature generation

Answer 113

A

target variable is ignored / interpretability

Answer 114

A

assigning each observation in a dataset into one and only one of k predefined clusters

Answer 115

A

assigning each observation in a dataset into one and only one of k clusters that are NOT predefined

Answer 116

A

function in the MASS package that automates stepwise model selection, in each step adding (forward selection) or dropping (backward selection) the feature to produce the greatest improvement in the model according to a certain criterion (AIC by default)

Answer 117

A

the only difference between the two is the size of the penalty that applies to the number of parameters

Brainscape's Knowledge GenomeTM

SOA PA Flashcards

Brainscape's Knowledge Genome^TM