Data warehousing Flashcards
What is a data warehouse ?
a system that aggregates data from one or more sources into a single consistent data store to support data analytics
What does on-premises mean ?
inside a building or on the area of land that it is on.
What does appliances mean ?
a device or piece of equipment designed to perform a specific task.
What do Data warehouse systems support ?
Data mining ,Artificial intelligence ,Machine learning, Front-end reporting , OLAP
Where are Data warehouses hosted ?
in the beginning :on premises
2000s :Appliances
2010 until present :Cloud
Who uses Data warehouses ?
pretty much every industry; E-commerce ,Transportation,Medical,Banking ,Social media ,Government
Name Data mart different possible structures
Relational database
Star or snowflake schema
What are the types of Data marts ?
Dependent ,Hybrid ,Independent
What is a Data pipeline ?
A data pipeline is a method in which raw data is ingested from various data sources and then ported to data store, like a data lake or data warehouse
What are the data marts purposes ?
-Timely relevant data
-Rapid query responses
-Cost efficiency
-Secure access
What’s a data mart ?
It’s an isolated part of the larger EDW,built to serve a business function,purpose or user community .
What is a Data lake ?
Exist as a repository of raw data straight from the source .
What does it mean scalable ?
Scalability is the property of a system to handle a growing amount of work. One definition for software systems specifies that this may be done by adding resources to the system. In an economic context, a scalable business model implies that a company can increase sales given increased resources
What does it mean curated ?
(of online content, merchandise, information, etc.) selected, organized, and presented using professional or expert knowledge.
What does it mean agile ?
able to move quickly and easily