Part 1: The Tech Lead Flashcards

Question

In the Nine types of data science projects, what does Monitoring and Rollout refer to?

Answer 1

Accept tracking capabilities and interpret A/B test results with a recommendation for launch decision.

Answer 2

Define metrics to guide business operations and provide visibility on dashboards.

Answer 3

Produce a report highlighting insights on a given area of the business/product.

Answer 4

Deploy machine learning models or APIs with A/B testing to assess effectiveness.

Answer 5

Improving modelling capability through enriching data features with better accuracy and coverage.

Answer 6

Align on single source of truth across different business units for critical business operations/metrics.

Answer 7

Demonstrate productivity gain for partners or the data science team for improving infrastructure/moving to cloud etc.

Answer 8

Specific compliant data infrastructure and document processes for peers to follow to get legal approval.

Answer 9

- Customer of the project is not clearly defined - Stakeholders are not included in the decision process - Project goals and impact are not clarified and aligned to company strategy - Affected partners are not informed - Value of the project is not clearly defined - Delivery mechanism is not defined - Metrics of success are not aligned - Company strategy changes after project definition - Data quality is not sufficient for the success of the project

Answer 10

The project must have a clear customer with a challenge to be resolved. The customer must be able to receive the solution and assess whether of not it have solved the challenge.

Answer 11

A project must have clearly defined: - Inputs: Data it will need. - Outputs: What resolves a customers problem i.e. a risk score from a ML model. - Metrics for success: The anticipated lift in a business relevant metric.

Answer 12

This outlines the following: - Technology choices - Feature engineering and modelling strategy - Configurations, tracking, and testing

Answer 13

The milestone and phases used to execute and deliver a project. This is split in to the following components: - Phases of execution - Synchronization cadence

Answer 14

New data source—Data availability, correctness, and completeness may be at risk. Partner re-org—Disruption of prior alignment with partners; re-alignment is required. New data-driven feature—Tracking bugs, data pipeline issues, population drifts from product feature updates, and integration issues must be discovered and resolved. Existing product—Feature upgrades can change the meaning of metrics and signals. Solution architecture dependencies—Technology platform updates can disrupt the operations of DS capabilities.

Answer 15

Validating data quality: Ensuring data availability and correctness.

Answer 16

Producing a simple model, such as a linear regression, to demonstrate model feasibility.

Answer 17

Defining inputs and outputs formats, metrics that will be check and a measure of success.

Answer 18

The purpose is to remove data risks (availability, correctness, and completeness) before aligning with product and engineering on integration specification.

Answer 19

The Product Proof of Concept.

Answer 20

- Defining the success criteria. - Product and engineering specifications are aligned. - Engineering resources are allocated in sprints. - Additional input features are developed. - Models are refined, and A/B tests are scheduled. - The validated learning in the second phase is assessing capability/market fit, as observed from A/B test results.

Answer 21

To address the “unknown unknown” at the time of planning, an additional one to three build, measure, learn iterations can be planned to learn, align, build, test, and assess a new data-driven capability.

Answer 22

Weekly syncs. This creates a project rhythm and keeps the project top of mind for the coordinating teams. Weekly milestones also allow data scientists to break large projects into approachable pieces and facilitate transparency in communicating DS project progress.

Answer 23

New data source—Data availability, correctness, and completeness may be at risk. Partner re-org—Disruption of prior alignment with partners; re-alignment is required. New data-driven feature—Tracking bugs, data pipeline issues, population drifts from product feature updates, and integration issues must be discovered and resolved. Existing product—Feature upgrades can change the meaning of metrics and signals. Solution architecture dependencies—Technology platform updates can disrupt the operations of DS capabilities.

Answer 24

Project team size—Involves 1–2 data scientists, compared with 3–10 engineers. Project uncertainty—Data-dependent risks exist on top of engineering risks. Project value—Demonstrated through A/B tests, feature completion is not enough.

Answer 25

- A predictable underlying process exists in the domain - Quantifiable signals can be timely processed - Levers exist for fast response

Answer 26

- High-frequency trading (capturing market microstructure and order book dynamics). - Recommendation systems (such as recency, frequency, monetization, or RFM based models).

Answer 27

- Sales forecasting. - Infrastructure load prediction. - Financial account balance forecasting. - Structural modelling in economics. Where various cyclical patterns drive the outcome.

Part 1: The Tech Lead Flashcards

(51 cards)