AI in Drug Discovery Flashcards

Question 1

Q

Where can AI be used in drug discovery?

Answer

A

Question 2

Q

What are the benefits of AI-driven approaches?

Answer

A

Question 3

Q

AI vs. Machine learning vs. Deep learning

Answer

A

AI is any technique where machines attempt to mimic human behaviour
Machine learning is a subset of AI whereby statistics is used to enable machines/algorithms to improve with experience
Deep learning is a subset of ML which makes the computation of multi-layer neural networks feasible

Question 4

Q

Subtypes of machine learning

Answer

A

Unsupervised learning: Unlabelled data is given to an algorithm and it tries to find patterns
Supervised learning: Labelled data is given to an algorithm and it tries to fit or understand how the labels relate to the data
Reinforcement learning: An algorithm tries to interpret data with constant positive and/or negative feedback

Question 5

Q

Properties of unsupervised machine learning

Answer

A

Data is analysed in an unbiased manner
Requires very little manual intervention - less arduous
Can be used to discover anomalies in data
Identifies sets of items that often occur together
Is heavily used for data visualisation and interpretation (the algorithm doesn’t know what the categories are but it sorts them out for you)

Question 6

Q

Disadvantages of unsupervised machine learning

Answer

A

Human interpretation is needed to see if the predicted clustering visualisation makes sense
You cannot easily get precise reasons for why the clusters were assigned in a particular way
Accuracy of the clustering is hard to measure

Question 7

Q

Supervised machine learning subtypes

Answer

A

Classification: The algorithm tries to learn how to predict a label for a sample given its features
Regression: The algorithm tries to learn how to predict a value for a sample given its features

Question 8

Q

Properties of supervised machine learning

Answer

A

The ultimate goal is to be able to take a set of features from unseen data and predict their labels or values
This is done by learning from a previously generated set of data and generating a model that is able to predict labels or values based on the features of the new data
Fitting the data at different iterations and optimising the line of best fit

Question 9

Q

Disadvantages of supervised machine learning

Answer

A

Question 10

Q

Reinforcement learning

Answer

A

The algorithm learns through trial and error by making predictions and receiving positive or negative feedback and adjusting itself to improve
Much slower than other ML types as it involves feedback loops whereby new data is collected or labelled
Reinforcement learning has the potential to learn accurate models with significantly fewer data points than supervised learning
Requires lots of manual interaction

Question 11

Q

Case study: DDR1 inhibitor

Answer

A

A novel DDR1 inhibitor with high patentability was discovered in only 46 days using ML
Existing compounds and their IC50 values against DDR1 were used in a supervised learning model to predict IC50 for 30000 potential compounds
These compounds were reduced to 40 with the best IC50
These 40 were reduced to 6 with the best patentability
Of these 6, 4 were effective and one compound has now completed Phase 2a clinical trials

Brainscape's Knowledge GenomeTM