Professional Documents
Culture Documents
Data scientists spending time and brainpower on applying data science and
analytic results to critical business issues - helping an organization turn data
into information - information into knowledge and insights - and valuable,
actionable insights into better decision making and game changing
strategies.
Data engineers are the designers, builders and managers of the information
or "big data" infrastructure. They develop the architecture that helps analyse
and process data in the way the organization needs it. And they make sure
those systems are performing smoothly.
Data Engineering Goal : The goal is to use the available data or generate
more data, and to thereby understand the process being investigated.
OLTP vs OLAP
Visual on OLAP
ACID Requirements
1. Atomicity Either all operations of the transaction are properly
reflected in the database or none are.
2. Consistency Execution of a (single) transaction preserves the
consistency of the database.
3. Isolation Although multiple transactions may execute concurrently,
each transaction must be unaware of other concurrently executing
transactions. Intermediate transaction results must be hidden from
other concurrently executed transactions.
Advantages
Management of distributed data with different levels of
transparency
The EMPLOYEE, PROJECT, and WORKS_ON tables may be fragmented
horizontally and stored with possible replication
Users do not have to worry about operational details of the network
Replication transparency:
Fragmentation transparency: