Database Management (ETL) Flashcards
Target dependencies, such as where and on how many machines the repository lives, and the specifics of loading data into that platform.
Refresh volume and frequency, such as whether the data warehouse is to be loaded on an incremental basis, whether data is forwarded to the repository as a result of triggered transaction events, or whether all the data is periodically loaded into the warehouse in the form of a full refresh.
ETL (Loading Stage Issues)
During the transformation step, a series of rules or functions is applied to the extracted data and can involve transformation such as data summations, data encoding, data merging, data splitting, data calculations, and create of surrogate keys.
Data aggregation is the process where data is collected and presented in a summarized format for statistical analysis and to effectively achieve business objectives.
ETL (Transformation Stage)
evaluating relationships or associations between data elements that demonstrate some kind of affinity between objects.
Affinity Grouping
taking a large collection of objects and dividing them into smaller groups of objects that exhibit some similarity.
Clustering
trying to characterize what has been discovered or trying to explain the results of the data mining process.
Description