Foundations - Roles and Concepts Flashcards

1
Q

what are the issues organization will have

A

Data Processing in SILOS
Excessive Data movement
Data Duplication

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Data Engineer role

A

Raw data into valuable insights
Design, Develop and Maintain data architectures ad (ETL)

Getting Data from sources, Making it useful and Serving to stakeholders

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the Key functions of DE

A

Build and Maintain Data Infrastructure
Ingest data from Various sources
Prepare ingested data for analytics
Catalog and document curated datasets
Automate regular data flows
Ensure Data Quality, Security and Compliance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Build and Maintain Data Infrastructure

A

Setting up databases, data lakes, and data warehouses on AWS services like Amazon Simple Storage Service (Amazon S3), AWS Glue, Amazon Redshift, among others

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Ingest data from Various sources

A

Use tools like AWS Glue jobs or AWS Lambda functions to ingest data from databases, applications, files, and streaming devices into the centralized data platforms.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Prepare ingested data for analytics

A

Use technologies like AWS Glue, Apache Spark, or Amazon EMR to prepare data by cleaning, transforming, and enriching it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Catalog and document curated datasets

A

Use AWS Glue crawlers to determine the format and schema, group data into tables, and write metadata to the AWS Glue Data Catalog. Use metadata tagging in Data Catalog for data governance and discoverability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Automate regular data flows and Pipelines

A

Simplify and accelerate data processing using services like AWS Glue workflows, AWS Lambda, or AWS Step Functions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Ensure Data Quality, Security and Compliance

A

Create access controls, establish authorization policies, and build monitoring processes. Use Amazon DataZone or AWS Lake Formation to manage and govern access to data using fine-grained controls. These controls help ensure access with the right level of privileges and context

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Chief data officer CDO - Responsibility

A

Builds a culture of using data to solve problems and accelerate innovation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Chief Data Office - Area of Interest

A

Data quality, data governance, data and artificial intelligence (AI) strategy, evangelizing the value of data to the business

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Data architect - Responsibility

A

Driven to architect technical solutions to meet business needs, focuses on solving complex data challenges to help the CDO deliver on their vision

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Data Architect - Area of Interest

A

Data pipeline, data processing, data integration, data governance, and data catalogs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Data engineer - Responsibility

A

Delivers usable, accurate datasets to the organization in a secure and high-performing manner

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Data Engineer - Area of Interest

A

The variety of tools used for building data pipelines, ease of use, configuration, and maintenance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Data security officer - Responsibility

A

Ensures that data security, privacy, and governance are strictly defined and adhered to

17
Q

Data Security Officer - Area of Interest

A

Keeping information secure, complying with data privacy regulations, protecting personally identifiable information (PII), and applying fine-grained access controls and data masking

18
Q

Data scientist - Responsibility

A

Constructs the means for quickly extracting business-focused insight from data for the business to make better decisions

19
Q

Data Scientist - Area of Interest

A

Tools that simplify data manipulation and provide deeper insight than visualization tools and tools that help build the machine learning (ML) pipeline

20
Q

Data analyst - Responsibility

A

Reacts to market conditions in real time, must have the ability to find data and perform analytics quickly and easily

21
Q

Data Analyst - Area of Interest

A

Querying data and performing analysis to create new business insights and producing reports and visualizations that explain the business insights