Zetta Playbook Flashcards

1
Q

What are two things AI adoption is determined by?

Zetta

A

Both trust and risk

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the four different developments in AI? And examples

(Zetta)

A

AI personalisation: Associated with low risk for incorrect prediction (eg. Book recommendations)

AI Augmentation: Still function without, augment on top of (eg. Improved search suggestion tool)

AI automation: Workflow fully automated by AI (eg. Tractable visual diagnose car crash)

AI Creation: Products not exist at all without AI (eg. Invenia models predict demand and supply of electricity)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the difference between systems of record to systems of intelligence?

(Zetta)

A

Old model teams would manually pull data, crunch it, and put in a report

Intelligence: Can access in real time, visualise and analyse on the fly through intuitive products

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does building an intelligent system start with?

Zetta

A

Having the right data

And
-effectively storing you data and complimenting it with data from other sources

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

The more complex a system the more AI is vulnerable to what?

(Zetta)

A

Hard-coded inefficiencies, more vulnerable to confounding factors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Move from cloud computing era to applications that do what?

A

Help customers to make decisions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Shifted from code to what in intelligence era?

Zetta

A

Unique data and self-learning code

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

The data value chain flow involves?

Zetta

A
  1. Acquiring data
  2. Storing the data
  3. Annotating the data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Where does the strength of data moats come from?

Zetta

A

Context

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Investors talk about what two different types of data?

Zetta

A

Public data Vs proprietary data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Zetta outlines 8 elements to a strong data set. What are they?

A
  1. Accessibility
  2. Time
  3. Cost
  4. Uniqueness
  5. Dimensionality
  6. Breadth
  7. Perishability
  8. Virtuous loop
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What two elements determined how time impacts the strength of a data set?

(Zetta)

A

Can the data be accessed instantly, or does it take a significant amount of time to obtain and process?

How quickly can the data be amassed and used in the model?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What two elements of cost need to be considered in regards to the value of a data set?

(Zetta)

A

How much money is needed to acquire this data?

Does the user of the data need to pay for licensing rights or pay humans to label the data?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How does uniqueness determine the strength of a data set? And what are examples?

A

Is similar data widely available to others who could then build a model and achieve the same result?

Such so-called proprietary data might better be termed “commodity data” — for example: job listings, widely available document types (like NDAs or loan applications), images of human faces.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How does dimensionality impact the strength of a data set? And what are they relevant for?

(Zetta)

A

How many different attributes are described in a data set?

Are many of them relevant to solving the problem?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What three questions on breadth impact the strength of a data set

A

How widely do the values of attributes vary?

Does the data set account for edge cases and rare exceptions?

Can data or learnings be pooled across customers to provide greater breadth of coverage than data from just one customer?

17
Q

What two elements of perishability determine the strength of a data set?

(Zetta)

A

How broadly applicable over time is this data?

Is a model trained from this data durable over a long time period, or does it need regular updates?

18
Q

How does the virtuous loop play into the strength of a data set? And do what over time?

A

Outcomes such as

  • performance feedback
  • predictive accuracy

can be used as inputs to improve the algorithm

And performance cab compound over time

19
Q

What are one the of the most important ways to differentiate? And how can this be truly defensible?

A

Data sets

What data moats are tied to a particular problem domain, in which unique, fresh, data composes in value as it solves problems for customers

20
Q

What is one fo the biggest costs to overcome in AI for startups?

(Zetta)

A

Cost of annotation or labelling

21
Q

Interesting example: What is Cloud auto ML google doing?

A

Enables developers with limited machine learning expertise to train high-quality models specific to their business needs. Build your own custom machine learning model in minutes.

In sight, language, structure, and platform areas

Paid service

22
Q

What is the Snorkel project out of Stanford? And what three programming operations are used?

A

Snorkel is a system for programmatically building and managing training datasets without manual labeling. In Snorkel, users can develop large training datasets in hours or days rather than hand-labeling them over weeks or months.

Snorkel currently exposes three key programmatic operations:

  1. Labeling data, e.g., using heuristic rules or distant supervision techniques
  2. Transforming data, e.g., rotating or stretching images to perform data augmentation
  3. Slicing data into different critical subsets for monitoring or targeted improvement
    Snorkel then automatically models, cleans, and integrates the resulting training data using novel, theoretically-grounded techniques.