Data analytics lifecycle Flashcards
What are the main reasons to use frameworks?
efficient use of time
nothing gets forgotten
scale projects
why use frameworks in data science?
acts as a guide
ensure focus is on ds not bi
needs a collaborative approach
what are the 2 key project roles that get a sponsor presentation?
Business user
project sponsor
what are the 2 key project roles that get the code and technical documents?
data engineer
data scientist
what are the 2 key project roles that get an analyst presentation?
BI analyst
Database administrator
what are the 6 key project roles?
business user project sponsor project manager bi analyst data engineer database administrator ds
what is the data lifecycle? (6 phases)
discovery data prep model planning model building communicate results operationalise
In discovery what are the seven main areas?
learn business domain learn from the past resources frame the problem interviewing formulate initial hypothesis identify data sources
In discovery learn the domain - what do you not need to do? A)determine amount of domain knowledge B) determine general analytic problem C) decide what technique to use D)if you have no idea. Conduct research.
C) decide what technique to use
In discovery learn from the past what do you need to do?
have there been any previous attempts
why did they fail?
who is a business user?
someone who benefits from end results
who is the project sponsor?
person responsible for genesis of the project
who is a project manager?
ensure key milestones are met
who is the BI analyst?
business domain expert
who is the data engineer?
deep technical skills