C3 Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

classification types

A
  • binary
  • multi-class: each instance has 1 label/class
  • multi-label: each instance has 0 or more labels
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

vector space models

A

(eg. support vector machine)
- each instance is modelled as a vector in a space
- each item in the vector is the weight for one feature

contiguity hypothesis: data points in the same class form a contiguous region (closeness of two points represents similarity)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

support vector machine

A
  • representation of the examples in a vector space
  • divide the examples of the separate categories by a margin as large as possible
  • separating boundary = hyperplane (in an n-dimensional space, the hyperplane has n-1 dimensions)

what if the data is not linearly separable? –> use Kernel Trick: add extra dimension z = x^2 + y^2
learn line k that separates the classes and transform it back to the original space: x^2 + y^2 = k

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

neural networks

A

more powerful classifier than logistic regression:

  • multiple nodes –> multiple functions –> non-linearity
  • multiple layers –> multiple abstractions over the input data

weights are learned automatically (supervised learning using true labels)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

feedforward networks

A

multilayer network in which the units are connected with no cycles (no outputs are passed back to lower layers)
each layer is fully connected, each hidden unit sums over all the input units

output layer gives probability distribution:

  • binary classification: single output node, y is the probability of positive output
  • multi-class classification: one output node for each category, y is the probability of that category
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

training neural networks

A

learn parameters W and b for each layer that make y_estimate for each training observation as close as possible to the true y

  1. loss function to model distance between y_estimate and y
  2. find parameters to minimize this loss function
How well did you know this?
1
Not at all
2
3
4
5
Perfectly