AI REVISION 1 Flashcards
What is the turing test? and examples
The turing test is a test created by Alan turing which was a test that consisted of an interrogator being connected to a person and a machine via a terminal and cannot see either of them, if the machine could trick the interrogator 30% of the time when answering questions the machine was considered intelligent.
Examples -> Siri, chatbot
What is combinatorial explosion?
Occurs in numeric problems where the complexity of the problem rapidly increases caused by the increasing number of possible combinations of inputs
What is machine learning?
Machine learning is a field of study that gives computers ability to learn without being explicitly programmed.
What is supervised learning vs unsupervised learning
Supervised learning relies on labelled input and output training data, whereas unsupervised learning processes unlabelled or raw data.
2 Examples of supervised and unsupervised learning
Supervised learning.
1. HandWriting Recognition
2. Traffic Prediction
Unsupervised Learning
1. AD Targeting (e.g. group similar costumers together)
2. Crime Hotspots (e.g. identify areas with high crime rate and report it to police management.)
What is clustering?
Clustering is an unsupervised technique about grouping similar objects, the input is unlabelled data as mentioned before, and the goal is to find a natural partitioning or groups of similar data points.
What is Association rules?
Association rules is an unsupervised technique that involves the input being a set of transaction records containing items and the goal is to produce dependency rules to predict occurrence of one variable based on occurrences of another variables.
What is data mining?
Is the analysis of large quantities of data to discover, valid, non-obvious, useful, and ultimately humans should be able to understand pattern in data.
How is the data split?
The data is usually split into two groups, training set and test set (used as estimate of how well the model works)
What is model training?
Learn the model parameters
Decision Tree nodes and goal
Internal nodes: Decision Rules on features
Leaf nodes: Predicted class label
Ultimate Goal -> Prediction
Decision tree pros and cons
Pros - 1.Easy to interpret and explain decisions, 2. Reasonable training time, 3. Feature scaling not necessary
Cons - 1. Small variations in data can lead to very different trees, 2. over complex trees can lead to overfitting
What is linear regression?
Supervised learning technique to estimate a relationship between an input variable X, and output variable X.
aim -> find a line of best fit.
Linear regression pros and cons ->
Pros -> 1. easy to interpret
2. easy to train
3. works well even with limited data
Cons ->
1. only applicable if relationship is linear
2. relies on assumptions about the data
What is model evaluation?
How well does the model do on the validation set.
What is model tuning?
adjust model hyperparameters
What is overfitting
the model fits too well to the training data which negatively impacts performance on unseen data.