AI & ML Lect6 Flashcards

Question

Use of Anomaly detection

Answer 1

identifying anomalies within the data

Answer 2

Transformation of data form high to low by discarding redundant data, while still retaining meaningful properties of the original data Uses PCA

Answer 3

1. Provide Algo with labeled input and output data to learn 2. Feed the machine new unlabeled info to see if it tags new data approporiately. 3. If not, continue refining the program

Answer 4

Sorting items into categories

Answer 5

1. Input raw data 2. Interpretation 3. Algo 4. Processing 5. Output

Answer 6

1. input raw data 2. enviroment (state of data, agent takes control, selection of algo, best action, reward if correct) 3. output

Answer 7

Feature selection and Feature extraction

Answer 8

identify redundant features and discard

Answer 9

find new set of low dimensional point that represent the original data well

Answer 10

Principal Component Analysis is a method to construct low-dimensional representation of the data by focusing on the principal components

Answer 11

As much variance as possible

Answer 12

Data visualization with a scatter plot

Answer 13

Classification due to its competitive accuracy, and very effecient

Answer 14

Class / decision

Answer 15

Heuristic Algo

Answer 16

Smaller and accurate tree

Answer 17

Attribute that splits examples into subsets that are ideallly all (+) or (-)

Answer 18

A set of decision tress working together to make a single predicition, and allow greater predictive accuracy

Answer 19

A set of decision trees, with each tree having different features

Answer 20

Properties of an input x are likely to be similar to those points in the neighbourhood of x

Answer 21

Find k nearest neighbors of x and find target attribute of x based on corresponding attribute values

Answer 22

Euclidian Distance

Answer 23

Add each training example (x,y) to dataset D

Answer 24

Count the K-nearest neighbors

Answer 25

Classification time eventho accuracy can be quite strong

Answer 26

handwritten character classifications, recommender systems, medical data mining, pattern recognition

Answer 27

Have a set of points, which the regression algo will model relationship between a single feature (explanatory variable x), and a continuous valued response (target variable y)

Answer 28

Find a best fit line such that the cost function is minimized

Answer 29

MSE (Mean squared error), which is average squared diff between an observation's actual and predicted values

Answer 30

k-means algo

Answer 31

A set of examples used for learning a model

Answer 32

Set of examples that can't be used for learning the model but can help tune model parameters. Helps control overfitting.

Answer 33

Used to access the performance of the final model and provide an estimation of the test error.

Answer 34

Train the model on p% of the data Test the model on the other (100-p)% of the data - this data is unseen by the model

Answer 35

Process automation, fraud/security, algo trading, robo-advisory

Answer 36

Replace manual work, automate repetitive tasks, and increase productivity examples include chatbots and JPM's COiN (Contract intelligence)

Answer 37

Fraud detection, financial monitoring, underwriting and credit scoring

Answer 38

1. Unsupervised learning (clustering) 2. If any anomalies detected, trained AI model will separate legitimate and illegitimate transactions. 3. Initial trianing of AI is using supervised learning 4. Trained AI model is then given data, manual review by an expert, feedback (reinforcement learning) to become the final model

Answer 39

1. Monitor trade results in real time and detect patterns 2. Sentiment / news analysis 3. Act to sell, hol and buy stocks 4. Analyze thousands of data sources 5. Squeeze slim advantage over market average → significant profits due to enormous volume 6. Make thousand or million of trades - high-frequency trading

Answer 40

Portofolio Management, and reccomendation of financial products

Answer 41

1. Define hyperplane between classes with support vectors 2. Optimise model to find support vectors that maximize the distance between hyperplane and classes (best line)

Answer 42

Use Kernel Trick - Map data to high dimensional space where they will be linearly separable

AI & ML Lect6 Flashcards

(72 cards)