Chp.5 Data Generation in Source systems Flashcards
Draw the data engineering lifecycle
A file is a ..?
sequence of bytes stored on a disk
source systems produce..?
data in several ways
Get familiar with your source system and how
it generates data
files may store…?
local parameters , events, logs, images, and audio.
Elaborate
files are the universal medium of …?
data exchange
What are the major file formats you will come across?
excel, csv, txt, json, xml
What are standard ways for exchanging data between systems.
API’s
typically an application database is an …?
online transaction processing system - OLTP
OLTP are referred to as ….?
transactional databases. Why?
OLTP db’s work well as …?
application backends when thousands or even millions or users might be interacting. Why ?
What does ACID stand for?
atomicity, constancy, isolation, and durability
With respect to ACID what does consistency relate to?
Consistency means that any database read will return the last written version of the retrieved item. Why?
what is a atomic transaction?
it is a set of several changes that are committed as a unit. Why?
in the data engineering : fundamentals data application stands for what?
applications that hybridize transactional and analytics workloads. Why?