Data Migration Flashcards

1
Q

What is data migration?

A

The process of transferring data between computer storage types or file formats.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the types of data migration?

A

Storage migration
Database migration
Application migration
Business process migration

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the stages of data migration?

A

Analysis & Discovery
Sampling & Profiling
Data Cleansing
Business Rules & Process Validation
Data Load
Reconciliation & Business Sign-off

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the key steps in pre-migration?

A

Source Data Exploration: Document systems, map data, and manage gaps.
Data Assessment: Profile data, clean records, and identify the “Golden Record.”

Design & Build: Create a low-level design, build ETL jobs, and validate.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the main steps during migration?

A

Preparation: Agree on a cutover plan, code freeze, and resourcing.

Execution: Run migration jobs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the key post-migration activities?

A

Reconcile data and manage errors.

Address remaining records.

Plan for legacy system retirement.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are some common issues in data migration?

A

Legacy data not fitting into new systems.

Moving target due to continuously updated source systems.

Lack of collaboration.
Insufficient knowledge of source systems.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the resolutions to common issues?

A

Modify architecture for legacy data or accept fallout.

Agree on a cut-off date and code freeze.

Use collaborative tools like stand-ups and cross-training.

Involve businesses in design and cleansing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is an Agile lifecycle for data migration?

A

An iterative approach using sprint sessions to deliver value slices collaboratively.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a Waterfall lifecycle for data migration?

A

A sequential approach where progress flows linearly through initiation, design, execution, and maintenance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the advantages of cloud data migration?

A

Cost savings on servers and storage.
Increased collaboration, scalability, and reliability.
Enhanced disaster recovery options.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the disadvantages of cloud data migration?

A

Platform dependency.
Limited storage options by vendor.

Costs increase with large data transfers.

Security concerns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are key parameters of good quality data?

A

Completeness, conformity, consistency, accuracy, duplication, and integrity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What features are essential in a Data Quality (DQ) tool?

A

Parsing & standardization
Cleansing & matching
Profiling & monitoring
Enrichment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are key parameters for choosing an ETL tool?

A

Support for multiple data sources
GUI-based environment
Team development capabilities
Built-in data profiling
Metadata management
Scheduling and error handling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are key parameters for selecting a BI tool?

A

Ease of setup and usability
Intuitive UI
Powerful analytics and real-time insights
Options for charts, graphs, and customizable dashboards
Mobile BI support

17
Q

What was the objective in the UK’s leading gas and electricity company migration?

A

To migrate asset and user management data for 40k engineers with zero downtime.

18
Q

What was the approach in the grocery company migration?

A

Business involvement in identifying disparate sources.
Profiling and cleansing to de-duplicate data.
Oracle Golden Gate for CDC and Microstrategy for reporting