AI in Drug Discovery Flashcards

Question 1

Q

Why is AI important in drug discovery?

Answer

A

AI accelerates molecular screening, improves prediction accuracy, reduces development time and costs, and complements rather than replaces traditional drug discovery.

Question 2

Q

What are five key benefits of using AI in drug discovery?

Answer

A

Reduced animal use through in silico prediction,
Early toxicity prediction,
Faster synthesis via optimisation,
Reduced screening burden,
Cost reduction from minimising lab work.

Question 3

Q

What is the difference between AI and Machine Learning?

Answer

A

AI refers to systems that mimic human behaviour. Machine Learning (ML) is a subset of AI that learns from data using statistical models.

Question 4

Q

What are the three types of machine learning and their applications?

Answer

A

Unsupervised ML - Clustering molecules;
Supervised ML - Predicting binding affinity;
Reinforcement ML - Molecule optimisation via feedback loops.

Question 5

Q

What are the characteristics and uses of Unsupervised ML in drug discovery?

Answer

A

Unsupervised ML learns patterns in unlabelled data, useful for data visualisation, clustering molecules, and detecting anomalies without predefined categories.

Question 6

Q

What did Aissa et al. 2021 show using unsupervised ML?

Answer

A

Clustering of single cells post-drug treatment into 12 subgroups, autonomously identifying control, tolerant, and sensitive cell populations.

Question 7

Q

How was clustering used with 10,000 molecular fragments in the lab?

Answer

A

Unsupervised ML identified initial hit compounds and helped find structurally similar analogues for drug discovery.

Question 8

Q

List limitations of Unsupervised ML.

Answer

A

Requires human interpretation,
Lacks explainability (black box),
Difficult to assess accuracy.

Question 9

Q

What are the main concepts in Supervised ML?

Answer

A

Supervised ML requires labelled data and can be used for classification (e.g. active/inactive) and regression (e.g. predicting IC50 values).

Question 10

Q

How is binding affinity predicted using Supervised ML?

Answer

A

Drug features are input into a regression model trained on known IC50 values, which is iteratively adjusted to predict affinities for new compounds.

Question 11

Q

What tool is used for receptor activity prediction?

Answer

A

StarDrop (Optibrium), trained on data from known compounds to predict receptor binding of new candidates.

Question 12

Q

List limitations of Supervised ML.

Answer

A

Requires extensive labelled data,
High computational cost,
Difficult to interpret,
Expensive data acquisition.

Question 13

Q

What is the principle behind Reinforcement Learning?

Answer

A

RL uses trial and error with positive/negative feedback to optimise actions and improve molecule design over time.

Question 14

Q

How was RL used in designing DRD2 ligands?

Answer

A

RL model generated new molecules and learned from feedback loops to produce high-affinity DRD2 ligands.

Question 15

Q

What are the pros and cons of Reinforcement Learning?

Answer

A

Pros: Works with fewer data points, optimises through feedback.
Cons: Slower due to repeated cycles, needs manual or simulated feedback.

Question 16

Q

Match ML types to applications in drug discovery.

Answer

A

Unsupervised - Clustering & biomarker discovery;
Supervised - Binding prediction & docking scores;
Reinforcement - Molecule optimisation via feedback.

Question 17

Q

What is DDR1 and why is it important?

Answer

A

DDR1 is a collagen-binding kinase linked to kidney disease, liver cancer, and fibrosis. Selectivity over DDR2 is critical to avoid off-target effects.

Question 18

Q

Summarise the AI-driven workflow to discover DDR1 inhibitors.

Answer

A

Literature & data curation,
Autoencoder generates 30,000 new molecules,
Supervised IC50 prediction,
Docking filters down to 40 molecules,
Patent filtering narrowed to 6 candidates,
In vitro testing 4/6 active.

Question 19

Q

What was the result of the DDR1 AI discovery process?

Answer

A

From 6 candidates, 4 were active in vitro, 1 entered Phase IIa trials, with the whole process taking only 46 days (23 for computation).

Question 20

Q

List four key takeaways from the AI in drug discovery lecture.

Answer

A

AI accelerates drug discovery,
ML types vary in needs and speed,
AI has produced clinical candidates,
AI complements traditional approaches.