AI in Drug Discovery Flashcards

1
Q

Why is AI important in drug discovery?

A

AI accelerates molecular screening, improves prediction accuracy, reduces development time and costs, and complements rather than replaces traditional drug discovery.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are five key benefits of using AI in drug discovery?

A
  1. Reduced animal use through in silico prediction,
  2. Early toxicity prediction,
  3. Faster synthesis via optimisation,
  4. Reduced screening burden,
  5. Cost reduction from minimising lab work.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the difference between AI and Machine Learning?

A

AI refers to systems that mimic human behaviour. Machine Learning (ML) is a subset of AI that learns from data using statistical models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the three types of machine learning and their applications?

A
  1. Unsupervised ML - Clustering molecules;
  2. Supervised ML - Predicting binding affinity;
  3. Reinforcement ML - Molecule optimisation via feedback loops.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the characteristics and uses of Unsupervised ML in drug discovery?

A

Unsupervised ML learns patterns in unlabelled data, useful for data visualisation, clustering molecules, and detecting anomalies without predefined categories.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What did Aissa et al. 2021 show using unsupervised ML?

A

Clustering of single cells post-drug treatment into 12 subgroups, autonomously identifying control, tolerant, and sensitive cell populations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How was clustering used with 10,000 molecular fragments in the lab?

A

Unsupervised ML identified initial hit compounds and helped find structurally similar analogues for drug discovery.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

List limitations of Unsupervised ML.

A
  1. Requires human interpretation,
  2. Lacks explainability (black box),
  3. Difficult to assess accuracy.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the main concepts in Supervised ML?

A

Supervised ML requires labelled data and can be used for classification (e.g. active/inactive) and regression (e.g. predicting IC50 values).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How is binding affinity predicted using Supervised ML?

A

Drug features are input into a regression model trained on known IC50 values, which is iteratively adjusted to predict affinities for new compounds.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What tool is used for receptor activity prediction?

A

StarDrop (Optibrium), trained on data from known compounds to predict receptor binding of new candidates.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

List limitations of Supervised ML.

A
  1. Requires extensive labelled data,
  2. High computational cost,
  3. Difficult to interpret,
  4. Expensive data acquisition.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the principle behind Reinforcement Learning?

A

RL uses trial and error with positive/negative feedback to optimise actions and improve molecule design over time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How was RL used in designing DRD2 ligands?

A

RL model generated new molecules and learned from feedback loops to produce high-affinity DRD2 ligands.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the pros and cons of Reinforcement Learning?

A

Pros: Works with fewer data points, optimises through feedback.
Cons: Slower due to repeated cycles, needs manual or simulated feedback.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Match ML types to applications in drug discovery.

A

Unsupervised - Clustering & biomarker discovery;
Supervised - Binding prediction & docking scores;
Reinforcement - Molecule optimisation via feedback.

17
Q

What is DDR1 and why is it important?

A

DDR1 is a collagen-binding kinase linked to kidney disease, liver cancer, and fibrosis. Selectivity over DDR2 is critical to avoid off-target effects.

18
Q

Summarise the AI-driven workflow to discover DDR1 inhibitors.

A
  1. Literature & data curation,
  2. Autoencoder generates 30,000 new molecules,
  3. Supervised IC50 prediction,
  4. Docking filters down to 40 molecules,
  5. Patent filtering narrowed to 6 candidates,
  6. In vitro testing 4/6 active.
19
Q

What was the result of the DDR1 AI discovery process?

A

From 6 candidates, 4 were active in vitro, 1 entered Phase IIa trials, with the whole process taking only 46 days (23 for computation).

20
Q

List four key takeaways from the AI in drug discovery lecture.

A
  1. AI accelerates drug discovery,
  2. ML types vary in needs and speed,
  3. AI has produced clinical candidates,
  4. AI complements traditional approaches.