Introduction Flashcards

Question 1

Q

Non-Technical Definition: Statistical Learning

Answer

A

A broad set of tools for understanding and extracting information from data.

Question 2

Q

Non-Technical Definition: Supervised Learning

Answer

A

Methods for predicting an output (response) based on one or more inputs (predictors) when the correct output is known.

Question 3

Q

Non-Technical Definition: Unsupervised Learning

Answer

A

Methods for finding structure in data with inputs but no known or labeled output.

Question 4

Q

Definition: Regression Problem

Answer

A

Predicting a continuous (quantitative) response.

Question 5

Q

Definition: Classification Problem

Answer

A

Predicting a discrete (qualitative) output variable.

Question 6

Q

What is Dimension Reduction?

Answer

A

Summarizing or transforming high-dimensional data into fewer dimensions while retaining key information.

Question 7

Q

Difference Between Classification and Regression

Answer

A

Classification predicts a discrete category (e.g. ‘Up’ or ‘Down’), while regression predicts a numeric value.

Question 8

Q

Key Premise of ISLR #1

Answer

A

Statistical learning methods are broadly useful across many fields, not just statistics.

Question 9

Q

Key Premise of ISLR #2

Answer

A

Statistical learning should not be seen as a ‘black box’; understanding the assumptions and trade-offs is crucial.

Question 10

Q

Key Premise of ISLR #3

Answer

A

We need not master the deep mathematical details to effectively use these methods.

Question 11

Q

Key Premise of ISLR #4

Answer

A

Practical real-world applications are the main focus, with hands-on labs demonstrating methods in R.

Question 12

Q

Notation: n and p

Answer

A

n is the number of observations in a data set; p is the number of variables (features).

Question 13

Q

Matrix Representation of Data X

Answer

A

X is an n×p matrix, where each row is an observation and each column is a variable.

Question 14

Q

Definition: Transpose of a Matrix (X^T)

Answer

A

A matrix whose rows are the columns of the original matrix (and vice versa).

Question 15

Q

Notation: y_i

Answer

A

The i-th observation of the response variable we wish to predict.

Question 16

Q

Matrix Multiplication Requirement

Answer

Study These Flashcards

A

You can only multiply A (of size r×d) and B (of size d×s) if the number of columns in A equals the number of rows in B.

Question 17

Q

Formula for (AB)_{ij}

Answer

Study These Flashcards

A

The (i, j) element of AB is the sum of the products of corresponding elements from row i of A and column j of B.

Question 18

Q

Distinction Between Bold/Capital vs Lower-Case Font

Answer

Study These Flashcards

A

Bold capitals (e.g., A) are matrices, lower-case bold (e.g., a) are n-length vectors, lower-case normal (e.g., a) are scalars or feature vectors, and capital normal (e.g., A) can denote random variables.

Question 19

Q

Linear vs. Non-Linear Methods

Answer

Study These Flashcards

A

Linear methods assume a linear relationship between predictors and response; non-linear methods can capture more complex, flexible relationships.

Question 20

Q

Examples of Non-Linear Approaches

Answer

Study These Flashcards

A

Tree-based methods (bagging, boosting, random forests), support vector machines, and generalized additive models.

Introduction Flashcards

(20 cards)