Machine Learning For Trading Exam 1 Flashcards

1
Q

What is Machine Learning according to Tom Mitchel ?

A

Tom Mitchel defines Machine Learning as the study of systems that learn from experience (E) given some set of Tasks (T) evaluated on some performance measure (P). He asserts that such systems are said to learn from experience if their performance on said Tasks, improves with experience (E).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why is the probablistic model adopted for studying and designing ML systems ?

A
  1. Because it provides an optimal way to make decisions under uncertainty
  2. It is a shared domain language that cuts across the sciences
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Classification ?

A

Classification is a supervised learning technique where, predictions are made over an unordered set of mutually exclusive labels called classes. These classes represent datapoints grouped together based on some characteristics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why is it necessary to carry out an Exploratory Data Analysis before designing a machine learning model and what methods are available for doing such ?

A
  1. Because it helps reveal patterns in data which informs what machine learning method to use
  2. It is useful for spotting potential problems with the data like (label bias or outliers).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Explain Supervised and Unsupervised learning and how they differ

A

Supervised learning is concern with learning a mapping f from a set of inputs x ∈ X to a set of outputs y ∈ Y where x is called the feature, covariate or predictor and y is called the response, label or target. It assumes that for every input featuture x there is a corresponding target value y that has predictive value.

Unsupervised learning on the other hand is concerned with the learning patterns in data without any labelled outputs. This makes is less time consuming and effective since it skips the overhead of labelling targets. On the other hand it is memory inefficient when compared to supervised learning because of the need to store large amounts of training data from which it makes predictions from.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Regression and explain how it is different from Classification

A

Regression is a statisitical learning technique used to predict quantitative values. Classification is a technique that makes qualitiative or categorical predictions. Data is grouped into cateogries or classes based on common characteristics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Clustering ?

A

Clustering is the process of grouping/partitioning datapoints into groups called clusters.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Explain what the Sharpe Ratio is and what it is used for

A

The Sharpe Ratio is a performance metric that compares a portfolios return with its risk. Its also called the risk adjusted return. It is calculated as the expected value (mean) of the portfolio’s return minus the risk-free rate divided by the standard deviation of the portfolio’s return.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are Beta and Alpha in the CAPM formula

A

Both alpha and beta are performance measures used ing CAPM. Beta measures the volatility of a portfolio relative to the market.

Alpha on the other hand measures the excess returns on a portfolio after adjusting for market risks and other fluctuations. It tells how better or worse a portfolio performed relative to its beta predictions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the assumptions of the Efficient Market Hypothesis

A
  1. There are large number of investors who are incentivized to seek profit. That is to say they are constantly seeking opportunities where the price of stock is not reflective of its true value.
  2. Information arrives randomly.
  3. Prices adjust quickly
  4. The current price of a stock reflects all available information
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

In the context of EMH information comes from a variety of sources. Can you name some sources

A
  1. Price/Volume
  2. Fundamentals (earnings, assets, liabilities etc)
  3. Exogenuous (having an external cause or origin)
  4. Company Insiders
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the forms of the EMH and what do they prohibit ?

A
  1. Weak form: Future asset prices cannot be predicted from historical price and volume data. This suggests that it if the weak form is correct, Technical analysis isn’t possible because technical anaysis relies on price and volume data.
  2. Semi-Strong form: Asset prices adjust immediately to all public information. Including that which may be available through a compay’s fundamentals like (financial disclosure)
  3. Strong EMH: Asset prices adjust to refelect all relevant information include that available to insiders.

The weak form if correct prohibits technical analysis
The semi-strong form if correct prohibits both technical analysis and fundamental analysis
The strong form if correct prohibits technical analysis, fundamental analysis and insider information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is an efficient market ?

A

An efficient market is one where information that can affect the price of a stock travels rapidly and causes the prices to adjust accordingly.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the adjusted price of a stock ?

A

The adjusted price of a stock is the price after factoring corporate actions like stock splits and dividends issued.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a stock split ?

A

A stock split is when a company increases the amout of shares by spliting existing shares in a bid to reduce the price per share. The total value of all shares remains the same at a reduced price per share but more shares are made available as a result of the split.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are dividends ?

A

Income earned on a stock without selling it. Divideds are paid from the company’s earnings. When a company pays dividends, the price of the stock is reduced, this is because the company makes these payments from its holdings and by doing so reduces its assets.

17
Q

What is the divided yield and how is it calculated ?

A

The income generating potential of a stock. It is measured as the amout of dividends per share divided by the share price.

18
Q

Briefly explain reinforcement learning and how it differs from supervised learning

A

Reinforcement learning solves problems where an agent learns from its environment. The learning is encoded in the form of a policy which specifies what action an agent should take for every possible input from its environment.

It differs from supervised learning in that the agent is not provided any ground truth (labelled data) but an occasional reward or penalty signal for each action the agent takes.

Key downsides to this approach are
* The agent isn’t told what is the best action to take, It just gets occasional signals on if the action it took was good or bad.
* It is difficult to tell which inputs let to an action

19
Q

Explain what a portfolio is

A

A portfolio is a weighted sum of assets. The sum of all weights must equal 1 or 100%. However for leveraged portoflios, this sum can exceed 1.

20
Q

What is the Market Capitalization of a company ?

A

The market capitalization of a company is the number of shares outstanding multiplied by the share price.

21
Q

Allocations in a portoflio are usually cap weighted, How are these weights calculated ?

A

Weights are calculated by dividing the market capitalization of the asset by the total capitalization of all assets in the portfolio

22
Q

What does CAPM say about α ?

A

The expected value of α is always zero. CAPM stipulates that the return on a stock is dependent on the market alone.