Lecture 3 - Optimisation and Hypothesis Space Flashcards

Question 1

Q

Map Example of Optimisation

Answer

A

REFER TO SLIDES
But essentially:
- Attempting to find the shortest apth using nodes, then calculating the minimum path

Question 2

Q

Define Hypothesis Space

Answer

A

The set of all models or hypotheses that can be represented using the selected language or representation.

Question 3

Q

What are the key characteristics of Hypothesis Space?

Answer

A

Key characteristics:
* It is determined by your choice of language - language defined later.
* It defines what’s possible to describe in your AI system.
* It includes all theoretical candidate solutions, regardless of whether they are “good” or “bad”.
REFER TO SLIDES FOR EXAMPLE

Question 4

Q

Define Candidate Solution

Answer

A

Definition: A single model within the hypothesis space; a potential solution to the problem.
Think of it as:
* An individual point inside the hypothesis space that is being tested or evaluated.

Question 5

Q

Define Solution Space

Answer

A

Definition: This is often used synonymously with hypothesis space, but in some contexts it refers to the space of all possible outputs or behaviours of the system based on the hypothesis space.

Question 6

Q

What are the three ingredients of optimisation?

Answer

A

As part of optimisation , there are three requirements that are used to ensure proper optimisation takes place, these are:
- Language (Solution Space/Hypothesis Space)
- Model (Candidate Solution)
- Metric (How good is the model?)

Question 7

Q

Define Language

Answer

A

Language: The formal system or structure used to describe possible solutions or hypothesis.
- Examples include:
○ Mathematical equations
○ Matural languages
○ Grammars
○ Logics
○ Finite automata/finite-state machines
○ Computer programs
○ Logic programs
○ Gantt charts
○ PERT charts
○ Simulation languages
○ Popsticks and glue

Question 8

Q

Why is Language considered important?

Answer

A

If you can’t describe it - you can’t model it, language allow you to do this

Question 9

Q

Generation VS Parsing OR Testing Vs Generate

Answer

A

○ Parsing (Testing):
○ Determines if a solution is valid within a language (efficient).
○ Such as the example above
○ Generating: Enumerates all valid solutions (inefficient, often infinite).

Question 10

Q

What is the Expressiveness of a Language?

Answer

A

Expressiveness: Some languages can express more complex ideas or solutions than others.
- When everything in one language (B) can be also be describe in the other language (A), we say A subsumes B
- If something can be be in one language (A) but not in another language (B), we say B does not subsume A

Question 11

Q

What is Chompsky Hierarchy?

Answer

A

Demonstrates a layered structure of language complexity (Type 3 < Type 2 < Type 1 < Type 0), in terms of expressiveness

Question 12

Q

What does the Chompsky Hierarchy look like

Answer

A

Its build on four types Type 3 < Type 2 < Type 1 < Type 0, with Type 3 being the least powerful and Type 0 being the most powerful

Typically it follows this level:
Type 3 are Regular Languages such as strings or regex
Type 2 are Context Free Languages such as matching parenthesis
Type 1 are Context Sensitive Languages such as symbol matching
Type 0 are Recursively Enumerable Languages which is any lanaguage that can be understood by a computer program

Question 13

Q

Why is Chompsky Hierarchy important (why does it matter)?

Answer

A

It gives us an idea of:
○ What kinds of models can be described
○ How complex the models can be
○ What computational resources are needed to test or parse them

Question 14

Q

Define Model

Answer

A

Model: A specific instance of a hypothesis described in the chosen language.
- Essentially this is an abstraction or approximation of the real world
- An instance of all the possible things that can be described in the language
- Where in the context of AI, it is a candidate solution to a problem.

Question 15

Q

Why are Models important - what does it represent?

Answer

A

The model is the subject of evaluation. It represents our best attempt at mimicking or understanding the target system or data.

Question 16

Q

What are some key characteristics of Models (Key Ideas)

Answer

A

Models may not be perfect representations; they approximate.
All real-world systems can be represented as functions.
The model space is sometimes too large to search exhaustively.

Question 17

Q

Define Metric (Evaluation)

Answer

A

Definition: A function used to assess how good a model (hypothesis) is in comparison to the target (or real-world phenomenon).
Also called: Error function, Cost function, Fitness function, Objective function, Penalty function, Utility function

Question 18

Q

Breakdown of Metric and Types of Metrics

Answer

A

Formally:
* A function from hypothesis space to real numbers: f : H -> ℝ
○ f is a function.
○ It takes an input from the set H — the hypothesis space.
○ It outputs a real number (ℝ), which represents a metric or evaluation score (e.g., error, cost, fitness).
=====
Types of Metrics:
- Numerical: e.g., Mean Squared Error (MSE)

Question 19

Q

Why are Metrics important?

Answer

A

We need a way to measure which solution is better. This is critical in optimisation.

Question 20

Q

What is the issues with Metrics, what can be done instead?

Answer

A

Usually we are happy if we can determine relative closeness
Doesn’t need to be meaningful in an absolute sense, only relative to another
Sometimes we don’t know much about the “real” thing
○ Can just assume its infinitely ‘good’
○ Seek the best hypothesis

Question 21

Q

What is Ideal 1 Defintion of Optimisation?

Answer

A

Find a model within the hypothesis space that is indistinguishable from the target (zero error).
- This is the theoretical best-case scenario.
- You’re aiming to find a model that perfectly replicates the real-world target.
- That means: when evaluated by the metric, the error is exactly zero.

Question 22

Q

Why is Ideal 1 Defintion of Optimisation important?

Answer

A

It gives us a goalpost—a target to aim for in optimisation.
Useful for evaluating how expressive your language is: if your representation can’t describe the perfect model, ideal optimisation is impossible.

Question 23

Q

What are the limitations of Ideal 1 Defintion of Optimisation?

Answer

A

In real-world problems:
○ The hypothesis space might not contain the exact real-world model.
○ Real-world data is often noisy or incomplete.
○ Models are approximations, not exact replicas.
○ Ideal optimisation becomes intractable when the space is too large or the model is too complex.

Question 24

Q

What is Ideal 2 Definition of Optimisation?

Answer

A

Find a model in the hypothesis space that is closest (minimal error) to the target.
- You’re no longer aiming for zero error, but the smallest possible error that can be achieved given your hypothesis space.
- Still assumes perfect knowledge of the metric and ability to search the space effectively.

Question 25

Q

Why is Ideal 2 Defintion of Optimisation important?

Answer

A

This is the standard definition of optimisation in most of machine learning.
You’re finding the best approximation that your system can express.
This model is often referred to as the best hypothesis in your hypothesis space.

Question 26

Q

What are the limitations of Ideal 2 Defintion of Optimisation?

Answer

A

Finding the best model in complex or infinite hypothesis spaces is often computationally infeasible (limited by search complexity).
Sometimes has many local minima.

Question 27

Q

What is the Practical Definition of Optimisation?

Answer

A

Find a model within the hypothesis space (determined by the language) that is as close as possible (good enough) to the target within a specified amount of compute (time and space)
- This is what actually happens in real-world AI systems.
- You aim for a model that gives acceptable performance within real-world constraints:
○ Time (e.g., real-time decisions)
○ Memory/space (e.g., embedded devices)
○ Energy (e.g., mobile computing)

Question 28

Q

Why is the Practical Definition of Optimisation important?

Answer

A

Most practical AI systems are resource-limited.
- There’s always a trade-off:
  ○ Accuracy vs speed
  ○ Accuracy vs memory
  ○ Accuracy vs interpretability

Question 29

Q

What are the limitation Practical Definition of Optimisation?

Answer

A

“Good Enough” is Subjective
Local Optima
Limited Search Scope
Adaptability Challenges

Question 30

Q

What is the “Good Enough” is Subjective limitation?

Answer

A

“Good Enough” is Subjective
- How do you know what level of performance is “acceptable”?
- It might depend on:
  ○ Application risk (life-critical vs entertainment)
  ○ User expectations
  ○ Regulatory requirements

Question 31

Q

What is is the Local Optima limitation?

Answer

A

Local Optima
- Often, due to time/resource limits, the system converges to a local optimum, not the global best.
- Especially true in non-convex problems like neural networks.

Question 32

Q

What is the Limited Search Scope limitation?

Answer

A

Limited Search Scope
- If your compute budget is small, you might not explore enough of the hypothesis space.
- You could miss a much better model that’s just out of reach.

Question 33

Q

What is the Adaptability Challenges limitation?

Answer

A

Adaptability Challenges
- In real-time environments, conditions change.
- The model might perform well initially, but degrade over time unless updated (requiring online learning or continual optimisation).

Question 34

Q

What is Online Optimisation?

Answer

A

Online optimisation:
○ Needs to run in real-time (e.g., autonomous vehicles, robot control).
○ Must respond quickly, even if it’s not the best possible model.
○ Design Priorities:
* Prioritise speed over perfection.
* Use lightweight models or approximate solutions.
May use heuristics or simplified models to ensure fast responses.

Question 35

Q

What is Offline Optimisation?

Answer

A

Offline optimisation:
○ Occurs before deployment.
○ Can take longer (e.g., training a model overnight).
○ Focus is on achieving better accuracy, even if it takes more time.
○ Design Priorities:
* Prioritise accuracy and generalisation.
* Use heavyweight models and explore larger hypothesis spaces.
* Perform hyperparameter tuning, ensemble methods, or neural architecture search.