All Flashcards
DER algorithm
Step 1: replication
Step 2: local inner join
Step 3: select ROW ID of left table with no matches
Step 4: redistribute the ROW ID
Step 5: Store ROW ID that appears as many times as the number of processors
Step 6: Join
ROJA Algorithm
Step 1: reshuffle the data based on the join attribute
Step 2: each processor performs the local outer join
DOJA Algorithm
step 1: replication. we duplicate small table
step 2: local inner join
step 3: hash redistribute the inner join result based on attribute X
step 4: local outer join
OJSO
When joining 3 tables:
1. Do redistribution on join attribute (same as ROJA)
2. local join (same as ROJA)
3. redistribute joined table & third table based on the join attribute
(ignore dangling records)
4. local join
local join: scan cost
(after divide & broadcast)
(Optimizing main memory)
((R_i/P)+(S_i/P)-(M/P)) * IO
M = size of memory
Speed up
concerned with processing speed while the same workload
elapsed time on uniprocessor / elapsed time on multiprocessor
Scale up
concerned with increasing workload while maintaining processing speed
time on small system / time on large system
Downside of a shared-nothing architecture
load balancing becomes difficult
Downside of Shared-Memory & Shared-Disk architectures
suffers from memory and bus contention
Process activation or involvement of parallel search algorithms
Process activation or involvement of parallel search algorithms
Key comparison of parallel search algorithms
Divide and Broadcast […] join: Transfer cost
(S_i/P) x (n-1) x(m_p+m_l)
Divide and Broadcast […] join: Receiving cost
(S/P - S_i/P) x (m_p)
Divide and Broadcast […] join: Scan cost
(S_i/P) x IO