Fundamentals of AI Flashcards
What is Machine Learning?
A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P if its performance at tasks in T, as measured by P, improves with experience E.
What is Top Down?
Model all different functions and wire all the ‘agents’ together, deduction
What is Bottom Up?
Give the system a lot of data, so it can discover the concepts itself, induction
Three Pillars of Machine Learning
Models and Algorithms, Powerful and cheaper computation, Massive data warehouse
What is Data Mining?
Exploration and analysis of large quantities of data discover valid, novel, useful and understandable patterns in data.
What is Supervised Learning?
Infers a function from labelled training data, each example consisting of input and outputs (Classification and Regression)
What is Unsupervised Learning?
Infers a function to describe hidden structure from unlabelled data (Clustering and Association)
What can be done in Data pre-processing?
Fill in missing data, find outliers, feature selection.
Unsupervised Learning - Clustering
Given: Un-labelled data set and similarities/distance metric
Goal: Find ‘natural’ partitioning, or groups of similar data points
K-means clustering
Choose the number of k clusters and initialise K cluster centroids randomly. Assign each data point to the nearest centroid (based on distance), and update the centroids by the mean of all the data points assigned. Output the final cluster assignments and centroids.
Application of k-clustering
Anomaly detection, Social Media Analysis
Unsupervised Learning - Association
Discover correlation between any two or more variable.
Given: a set of records containing items
Goal: Produce dependency rules to predict occurrence of variable X with variable Y
Categorical data
Learn to predict to which set an instance belongs to based on pre-labeled (classified) instances (Classification)
Continuous data
Finds a linear relationship with the variable X and the variable Y
Supervised Learning: Regression
Based om the given data find the function that minimises its mean squared error to fit the samples