theory lecture 5 Flashcards by e L.

statistics

more theory-based and top-down ideas. it is more model based and focuses on testing hypotheses.

How well did you know this?

Not at all

Perfectly

machine learning

more heuristic and focused on improving performance of a learning agent. it also looks at real-time learning and robotics.

How well did you know this?

Not at all

Perfectly

data mining and knowledge discovery

integrates both theory and heuristics. the focus is on the entire process of knowledge discovery, including data cleaning, learning, and integration and visualisation of results.

How well did you know this?

Not at all

Perfectly

test data

shows how well the machine is learning after the training in supervised learning systems.

How well did you know this?

Not at all

Perfectly

regression

a machine learning model where you try to predict a score.

How well did you know this?

Not at all

Perfectly

association

a type of unsupervised learning where you try to see the data types and how well they associate with each other.

How well did you know this?

Not at all

Perfectly

clustering

a type of unsupervised learning where you eg. try to differentiate dogs and cats.

How well did you know this?

Not at all

Perfectly

ANN

the data is split into three subsets for classification; ~60% training, ~20% validation, and ~20% testing. it is a prediction model that is inspired by the way a brain works with neurons. it is what deep learning is based on.

How well did you know this?

Not at all

Perfectly

overtraining

when you use too much data for training and the algorithm knows everything about the sample, but it may not recognise anything outside of the sample.

How well did you know this?

Not at all

Perfectly

target variable

the variable we are trying to predict based on the attributes in the columns of a table.

How well did you know this?

Not at all

Perfectly

dimensionality of a data set

the sum of the dimensions of the features/attributes.

How well did you know this?

Not at all

Perfectly

curse of dimensionality

when you have too many dimensions and it becomes hard to predict a value.

How well did you know this?

Not at all

Perfectly

CRISP-DM

a model used to show the knowledge discovery process flow. the process is highly repetitive and experimental. you may have to back in steps, eg. if your model is different in practice.

How well did you know this?

Not at all

Perfectly

C&RT

a prediction model. it stands for Classification and Regression Trees.

How well did you know this?

Not at all

Perfectly

Random Forest

a prediction model that combines different trees.

How well did you know this?

Not at all

Perfectly

Boosted Tree

Study These Flashcards

a prediction model that combines trees in a boosting way.

Fusion

Study These Flashcards

a prediction model that combines different algorithms.

1-Away

Study These Flashcards

means the accuracy including a prediction of 1 class away. eg. it predicts 4, but it is actually 5.

SVM

Study These Flashcards

a prediction model, using a line that divides your data.

linear regression

Study These Flashcards

a method used for classification with the formula w0 + w1x + w2y >= 0. it computes w1 from the data to minimise the squared error to ‘fit’ the data. it uses a line to classify data into a class.

decision trees

Study These Flashcards

a method for classification that splits data by drawing multiple horizontal and vertical lines.

confusion matrix

Study These Flashcards

the primary source for accuracy estimation in classification problems. it shows how confused your model is between two classes. you can put your testing data into the matrix to see how many are correct.

precision

Study These Flashcards

given something is positive in a predicted class, how often do you predict it right?

recall

Study These Flashcards

given that the true class is positive, how often do you predict it right?

decision tree

puts your data in a format to split it up. the higher attributes in the tree are more important.

theory lecture 5 Flashcards

(25 cards)