YC - Andrew Flashcards

1
Q

What progress have you made since your application (then you had 2M cases, tens of millions of documents, and nearing MVP)?

A

We have completed our MVP, and identified a number of potential investments, and have reached out. Now iterating

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

You said prototype/beta is coming up from November 1st - where are you at? On track?

A

We’ve already made a couple preliminary models, so this is actually already done. Now we’re just iterating and feature engineering to improve. Now we are pushing towards an investible model Dec 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

You said your advantage is driven by your combination legal and quant finance and your ability to invest in cases below $500K, lower than nearest competitor. Why do these give you an advantage?

A

These advantages allow us to successfully build the technology (and the portfolio). These enable us to access these lower cases and enjoy a better cost structure generally

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

You talk a lot about cost-effectiveness in your app - where is this now?

A

With higher precision technological models, we don’t need as much human diligence. Also more efficient capital deployments

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How do you justify lower rates to LPs?

A

We have higher precision and more efficient capital deployments, leading to higher returns. LP stay in yo lane and enjoy your returns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

You talk about understanding NLP as being how you think about this differently. How would this work? How do you know it’s possible?

A

Abstractly, court documents contain a ton of information about a case. More concretely, I’ve read successful research papers on similar tasks, and we plan on being more sophisticated than what I’ve read.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Is there no one else doing something similar?

A

Legalist and LexShares are the two other players investing in smaller cases. Both are using relatively simple models as a filtering system, and their average case sizes is well above what we’re targeting
Legalist average investment size is $500K, Lexshares is average ~$1.5M, both have been trending up.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How is your product different?

A

Quant-driven approach. Lower cost. Smaller case sizes. Higher precision.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What competition do you fear most?

A

Legalist if they decide to hire sufficient talent and revamp business model. Same for larger players.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Who are your competitors?

A

Legalist most directly. Other large firms like Burford, Lake Whillans

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Who might become competitors?

A

Big players may all become more direct competitors with tech investments

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is your growth like?

A

Had 244 customer conversations. We recently identified a segment of the market that seems very promising. Technical things that have improved.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How large is the market that you are going after?

A

(I think this is the same as below…but) $640M/year in recoveries in our narrow NY cases. 10B if you extrapolate US (by population)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How many people are in your target market?

A

33K cases per year for contracts in our population in NY. 500K est. in US., and this is the subsegment of contract cases which we are finding traction in

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How will you validate?

A

Will try to raise with backtest. If not, we’ll invest smaller amounts with our own money

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What have you learned so far from working on your product?

A

Dataset is highly particular, requiring significant domain knowledge to parse

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What’s new about what you make?

A

Quant approach, including NLP. Lower case sizes. Heavier reliance on tech.

18
Q

How does your product work in more detail?

A

3 data sources, advanced ML techniques, quant approach to the whole portfolio

19
Q

What are you going to do next?

A

Build robust model, get funding, invest in cases

20
Q

What part of your project are you going to build first?

A

Model and investing mechanism

21
Q

What is the next step with the product evolution?

A

Iteration on model. More feature engineering. Next are NLP iterations

22
Q

Where is the rocket science here?

A

Quant investing applied to law; litigation domain knowledge pairing

23
Q

Why is your product better than Legalist?

A

We believe Legalist’s models are simpler than ours, as they appear to be focused on building arrays of data largely based on metadata. Our additional tech (focused on language models/NLP) allows us to be more precise in our analysis of case outcomes, which allows us to invest in < 50K investments.

24
Q

Why do you think Legalist’s models are simple?

A

From their technology team, description of thier process in a June 2020 SEC filing, and all the other research we’ve done.

25
Q

How is your solution better?

A

Better models means lower costs, higher precision. Better portfolio management means higher returns on capital and less risk.

26
Q

Do you have proprietary data?

A

No, but as we fund cases we will build a proprietary database

27
Q

How are you analyzing settlement values?

A

Estimating settlement values based on similar cases, using full data picture at the time of settlement

28
Q

Demo?

A

We don’t have a demo because there is no user interface. We’re happy to show you anything you want to see about our model / backend.

29
Q

Algothmic Bias

A

We actually see this as an opportunity. First level, debias the corpus, but in the best version we can actually confront the counterfactual: what if disadvantaged plaintiffs had been given funding?

30
Q

What has been your progress over 6 months?

A

AH: Over the past four months on the technology side we’ve built scripts for blah blah blah. And during that time, we’ve diligenced hundreds of specific cases that have been identified by our algortihms, and reached out to the lawyers and plaintiffs in those cases.

31
Q

Why would you stick together?

A

We are each emotionally invested in solving this problem for personal reasons, we’ve already handled adversity, and through working together we have developed a deep level of respect for each other. Hell, we all moved to NYC during a pandemic to be together! (If the q is directed at Andrew or wrt ClassWise, mention the dedication levels of the founders.)

32
Q

Tell us about a tough problem you solved?

A

Technical stuff.

33
Q

Tell us something surprising you have done?

A

Living together. We used GPT3 generated phrases to generate our website.

34
Q

What’s an impressive thing you have done?

A

Current tech pipeline to get to multiple input model

35
Q

Why did your team get together?

A

We are also data guys, and this is one of the coolest problems out there. But more are really passionate about fairness as we have been personally impacted by the high cost of legal proceedings.

36
Q

Why did you quit ClassWise? Why will you stick this one out?

A

ClassWise failed fast. Weren’t even able to help schools for free. Team didn’t include domain experts and also wasn’t as dedicated as Patronus.

37
Q

Funders preferred explainability over predictability. What are the implications of building something like this for the legal system where results are unexplainable?

A

Explainability functionality. Counterfactual. See a lot of this as an opportunity. Partial dependence plot. LIME. I don’t think we’ll get this question

38
Q

How would you build a moat?

A

Building a pipeline of law firms. Over time we’ll also have proprietary data. (And also it’s hard)

39
Q

Bottom-up cases

A

Each case is avg 20K/year after paying our investors. So 50 cases = 1M. 5000 cases is 100M. And there are 200K cases every year.

40
Q

First hire

A

Engineer for expanding into new states and helping with model

41
Q

Tell me about your quant trading background

A

Built a new quant trading desk in a new asset. Involved data science, algo development, product management, portfolio optimization.

42
Q

If legalist decides to compete with you, why will you win?

A

The founding team. Sure, they have the resources to make hires, but it matters that the founding team has serious deficits in Data Science, Litigation, Ops, and Finance. That’s a lot to make up for.