Data Architecture Flashcards

1
Q

Define Data Architecture

A

A framework ๐Ÿ—๏ธ that

defines how data is

collected,๐Ÿ“ฅ
stored, ๐Ÿ—„๏ธ
managed, ๐Ÿ“‹
and accessed ๐Ÿ”‘

across an organisation

Designed with business need in mind

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Name 3 tools that you work with as part of you data systems architecture

A

๐ŸŸข Collibra - metadata on JLPโ€™s data assets and RoP

๐Ÿ”ค Alfabet - records the applications used in JLP and how they are connected - the IT landscape

โ„๏ธ Snowflake - cloud based storage and analytics service that handles both structured and unstructured data

๐ŸŸฆ Tableau - JLP tool of choice for data visualisation used for reporting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why did you use regression analysis to investigate your data?

A
  • Pearsonโ€™s Correlation identified a weak + relationship between Lead Time & Stock Holding
  • Regression would show me the linear relationship to understand how stock holding varies with lead time
  • Forecast stock holding based on lead time
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why did you use time series forecasting to investigate you data?

A
  • Regression Model:
  • not accurate;
  • other variables;
  • weak relationship
  • Time Series Forecast to identify patterns in Stock Holding and forecast using Historical data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How did you access data from your organisationโ€™s data system and how did this impact your tool choice

A
  • Collibra/Alfabet Direct Extract
  • Google Sheets
  • Tableau: Integrated with GS
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why did you use Tableau?

A
  1. Integrates w/ **GS **๐Ÿ—’๏ธ
  2. Interactive๐ŸŽฎ
  3. Secure sharing - Tableau Online, accessible from any device ๐Ÿ“ฑ
  4. Stakeholders familiar with it - JLP** tool of choice** โœ…
  5. User-friendly join feature ๐Ÿค
  6. Future automation: Snowflake or Collibra Application Programming Interace ๐Ÿค–
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Why did you use GS?

A
  1. User friendly interface - formulas โž—
  2. JLP tool of choice โœ…
  3. Collaboration ๐Ÿค
  4. Tableau Integration/File Import ๐Ÿ”—
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why did you use Python? (Hypothesis Testing)

A

For my hypothesis test (stock holding and lead time relationship):
1. built in libraries
2. visualisation like scatter plot & hypothesis test in one tool
3. handles big data sets
4. reuse the same code for different sets of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why did you use Python? (Regression)

A
  1. Greater flexibility
  2. Greater control
  3. Tableau linear regression operates behind scenes
  4. E.g. Tableau wouldnโ€™t have split the data into test/train
How well did you know this?
1
Not at all
2
3
4
5
Perfectly