Data warehousing/data mining Flashcards
What is data warehouse?
Data warehouse is a centralized repository that selectively aggregates relevant data from various operational databases to support strategic business analysis and decision-making. It’s not designed to store all data, but specifically, the data that helps in making business decisions.
What is ETL Process and its purpose?
ETL process involves extraction of information from internal and external databases, transformation of information to fit enterprise/corporate definitions, and loading into the data warehouse
What is Data Marts
Data marts are a subset of data warehouses, tailored for specific analytical needs, allowing for focused and efficient analysis
Explain this model of a typical data warehouse
“A typical data warehouse collects and refines different types of data from internal and external sources through ETL. It acts as a central hub for storing vast amounts of processed data. Specific, topic-focused data marts are then created from this central data for detailed analysis. The ultimate goal is to explore and mine this integrated data to gain valuable business insight
What is Data mining and its purpose?
Data mining involves analyzing large datasets to find hidden patterns and insights. Drilling down means looking into more detailed data, while ‘drilling up’ means summarizing data into larger categories. Data mining tools are used to perform these analyses, utilizing techniques like querying, reporting, and statistical analysis to support decision-making