Distributioned Systems & Networks Flashcards

Question

What is CRC

Answer 1

CRC, or Cyclic Redundancy Check, is a error detection method which holds a checksum field in the frame, this is calculated on both ends and is compared This can be used on the link layer and other layers - for example, IPv4 packets hold such a checksum

Answer 2

Forward error correction includes error-correcting codes in frames which can be used to detect AND fix errors in the transmitted data

Answer 3

No, because it would be unnecessary overhead when ethernet is already reliable

Answer 4

Framing the process is adding indicators of where the frame starts and ends to the link layer data bits

Answer 5

Using FLAGS + Using a predefined flag, append the FLAG byte value to mark the start and end of the data + If the FLAG occurs in the actual data, use an escape byte + When receiving ignore the first escape byte

Answer 6

Access to and from the physical medium, this is typically specific to the type of physical layer

Answer 7

CSMA/CD Stands for Carrier Sence Multiple Access with Collision Detection It is used when using a single shared media in order to ensure that only one one sender is transmitting at any time

Answer 8

Sender listens to see if the media is busy, if it is wait When the channel is free transmit, while you transmit, listen and stop if a collision occurs Back off before retransmitting if a collision is detected

Answer 9

When a collision is detected for the first time, back off for a random amount of time between MIN and x. When you transmit again, if another collision is detected increase x (i.e. double it) and wait this random amount of time again

Answer 10

WiFi doesn't use CSMA/CD because WiFi devices generally can't send and listen at the same time. There is also the "hidden node" problem in which two devices can see the access point but not each other

Answer 11

CSMA/CA Carrier Sence Multiple Access with Collision Avoidance

Answer 12

When a frame needs to be transmitted wait until the channel is idle. When it's free transmit a Request to Send, if a Clear to Send is received back transmit the data, otherwise wait a random back off time

Answer 13

The Internet layer

Answer 14

The network layer

Answer 15

The network/Internet layer

Answer 16

Unique addressing and next hop routing

Answer 17

The IP Header

Answer 18

Packet switched - Its connectionless Unreliable - Packets are sent on a best effort basis Routed - Routers use a routing table to determine the path

Answer 19

Store and forward packet switching refers to the process of sending a packet to the nearest router, which should then parse it onto a router from its routing table. This should result in packets eventually arriving at the end process.

Answer 20

They prioritise certain traffic

Answer 21

Fragmentation is the process of spliting Protocol Data Units into smaller Protocol Data units for transmission so they are below the MTU value

Answer 22

MTU is the "Maximum Transmission Unit" and is the maximum size packet which the physical connection is able to transmit. Protocol Data Units which are larger than the MTU must be split into smaller packets

Answer 23

Internet protocol: + IPv4 + IPv6 Control and diagnostic protocols: + ICMP + ICMPv6 Encryption and Security + IPSEC Establishing of IPv4 multicast groups + IGMP

Answer 24

Omit the leading 0s Replace a single set of repeated 0 blocks with ::

Answer 25

2001:630:d0:f500::64

Answer 26

IPv4 allows packets to be fragmented at any routing hop, while IPv6 only allows packets to be fragmented by the sending host This means, if needed, IPv4 packets can be re-assembled at intermediate routers It also means that IPv6 must use Path MTU discovery before sending

Answer 27

That the first 48 bits are common to the network, /48 is called the prefix length

Answer 28

The first 16 bits are common to the network, this is called the subnet mask This can also be represented ass 255.255.0.0

Answer 29

Class A: /8 prefix with 16 million addresses Class B: /16 prefix with 65000 addresses Class C: /24 prefix with 256 addresses This was inefficient as, say you needed 258 addresses, you would get a /16 allocation with 65000 addresses consuming large ammouts of address space

Answer 30

CIDR stands for Classless Inter Domain Routing and allows for variable length prefixes to be used (in place of the 3 original allocation classes). This helped reduce IPv4 address consumption

Answer 31

Subnetting allows us to limit the propagation of ethernet broadcast traffic across a network and put hosts into segments This allows for larger IP allocations to be logically divided by, for example, buildings

Answer 32

256 Total addresses 253 are usable as .0 or .255 are reserved, and one (often .1 or .254) is used for the router

Answer 33

200 devices require a /24 subnet mask (max), giving the subnet 254 addresses 100 devices require a /25 subnet mask (max), giving the subnet 126 address So one example is 152.78.70.0 /24 (152.78.70.1 -> 152.78.70.254) 152.78.71.0 /25 (152.78.71.1 -> 152.78.71.127) 152.78.71.128/25 (152.78.71.129 -> 152.78.71.254)

Answer 34

/64 as there is no real need to go bigger or smaller

Answer 35

A router is needed any time there is a change in the address space

Answer 36

RFC 1918 is the name of the memo which defines the private address space. These are IPv4 addresses which are for internal use within networks and are not globally routable.

Answer 37

10.0.0.0/8 with 16 million addresses 172.16.0.0/12 with 1 million addresses 192.168.0.0/16 with 65000 addresses

Answer 38

NAT stands for Network address Translation, although it is commonly also used to refer to Network Address and Port Translation (NAPT) NAT allows one global IPv4 address to be shared between multiple hosts, for example, a home network will have 1 IPv4 from the ISP and then use RFC1918 internally

Answer 39

Carrier Grade NAT allows sharing global addresses between customers who get private addresses from a special range Customers then NATs that address to RFC1918 This should be done with the 100.64.0.0/10 block, but some abuse RFC1918

Answer 40

Routing describes how packets should move between different subnets

Answer 41

The Internet/ Network layer

Answer 42

Directly to a destination if its on the same local subnet A router

Answer 43

DHCP or IPv6 RA

Answer 44

Destination IP prefixes and the interface or next hop to use The local subnet which the host is connected to To default route

Answer 45

Thr route with the longest prefix will always be picked first, if two routes have the same prefix then that with the lowest metric is picked first

Answer 46

Prefix aggregation allows subnet prefixes to be aggregated with those of adjacent subnets

Answer 47

Routing protocols allow for routers to create their own routing tables

Answer 48

An AS is a large network or group of networks with a unified routing policy, these make up the internet

Answer 49

An ASN is an Autonomous System Number and is assigned by a Regional Internet Registry, each AS needs its own ASN

Answer 50

+ Multihomed + Transit + Single-homed/ stub

Answer 51

Interior Gateway Protocols are used within an Autonomous System, such as within a corporate network

Answer 52

Exterior gateway protocols are used between autonomous systems

Answer 53

+ Distance Vector - Talk only to neighbouring routers. + Link state - Talk to all routers on the network.

Answer 54

Each router talks only to directly neighbouring routers They then exchange the best route information for any known prefixes with direct neighbours

Answer 55

Each router talks to all other routers to establish full knowledge of the routers and topology in a site Routers flood information describing their connected neighbours around the entire site network

Answer 56

Metrics are simple hop count values limited to 15 Updates are not acknowledged Updates are only sent every 30 seconds Routers don't have knowledge of the network topology Authentication is MD5 which is broken

Answer 57

1. Discover neighbours and determine the cost metric 2. Flood messages with this information to all routers 3. Use received messages to build topology, computing shortest paths for prefixes served by any router These messages are sent periodically or when a change in connectivity is detected

Answer 58

Link state converges faster, allowing changes of topologies to be detected in seconds Link state is better at avoiding loops as every node knows everything

Answer 59

AS's advertise their network prefixes to neighbouring networks AS's can also offer to transit to other AS's

Answer 60

As the path is sent when routes are advertised we can detect and prevent loops

Answer 61

+ BGP Relies on trues + BGP Is too slow and takes a lot of effort to update + Routers have limited BGP Routing table sizes

Answer 62

A Default Router

Answer 63

A Routing Protocol

Answer 64

User Datagram Protocol

Answer 65

Transmission Control Protocol

Answer 66

The Transport Layer

Answer 67

A three-way handshake is used: + SYN is sent by the client with random sequence numbers + SYN-ACK is sent back by the server + ACK is sent by the client meaning a connection is established

Answer 68

Sliding window protocol: + The receiver has a limited incoming buffer size + The sender should not send data unless the receiver indicates it has space to receive + Otherwise the packet would need to be resent later

Answer 69

TCP - this allows for buffering ahead of the video

Answer 70

Internet Control Message Protocol is used in both IPv4 and IPv6 for information and error messages For IPv6 only it is also used for router advertisement and neighbour discovery

Answer 71

Multicast is One to Many communication, packets are only sent to hosts who are interested in them This is required for IPv6 and an add-on for IPv4

Answer 72

Address Resolution Protocol is used to map an IPv4 address on the local subnet to a MAC address The host looking for a MAC address broadcasts an ARP "who has request" and the target sends a unicast reply to the requestor

Answer 73

Dynamic Host Configuration Protocol automates the process of Address configuration for IPv4

Answer 74

Neighbour Discovery Protocol Maps IPv6 addresses on the local subnet to Mac addresses It uses ICMP and multicast

Answer 75

When a host connects to a network it broadcasts DHCP DISCOVER The DHCP server reserves an address and replies with a DHCP offer The client then needs to DHCP REQUEST the address The server sends a DHCP ACK containing the lease duration and config

Answer 76

StateLess Address AutoConfiguration allows a host to autoconfigure basic network settings without a DHCPv6 Server. The RA specifies whether this should be used or not

Answer 77

A 64 bit prefix determined from a router assignment A 64 bit generated host segment

Answer 78

Originally the hosts MAC address was padded and used This was a privacy nightmare however as hosts could be tracked across subnets

Answer 79

DHCPv6, this uses DHCP Unique Identifier instead of MAC addresses, using Solicity, Advertize and Reply This can work at the same time as SLAAC

Answer 80

To decide whether to forward the frame, and if so which port to forwards it to. The switches lean the ethernet MAC addresses of hosts seen on each switch interface or port

Answer 81

A Link layer broadcast message is sent asking "who has this IP address", this is seen by all hosts on the same ethernet LAN and the host with the target IP responds.

Answer 82

A broadcast domain is a method of restricting the number of global broadcasts which to stopfrom them flooding the network We can do this by splitting the domain up with routers which will not broadcast link-layer broadcasts.

Answer 83

Spanning tree algorithms allow physical loops to be made without breaking the network, determining least cost paths to the root and finding best paths

Answer 84

Virtual LAN's can use identifiers in the Ethernet frame to create multiple VLANs in one trunked uplink This avoids the need to physically re-cable and can be used to control broadcasts to certain areas

Answer 85

Domain Name Systems allow machines to look up the IP address of hostnames. It lives on the application layer

Answer 86

Telnet was simple unencrypted terminal emulation protocol which was replaces by SSH and SCP

Answer 87

Simple Mail Transfer Protocol is a protocol used to send email where a TCP connection is made to a mail server and

Answer 88

Internet Message Access Protocol is another email protocol which keeps a TCP connection open with the server

Answer 89

HTTP is text based protocol which uses text base messages

Answer 90

Quick UDP Internet Connections, this is a UDP protocol which allows TCP connections to be made over UDP

Answer 91

Constrained Application Protocol provides a HTTP like protocol for simpler devices with minimal overhead It uses binary GET/PUT etc commands making messages small and uses simple subscription methods

Answer 92

MQTT is a hierarchical protocol where messages get published to brokers and then get shared with any clients who are subscribed to the data streams All data is raw to avoid overhead

Answer 93

AAAA - IPv6 records A - IPv4 records These are used to store host name to IP conversions

Answer 94

Iterative - This means that the server will respond with a referral to another server Recursive - This means that the server will respond from the local cache or resolve the query before responding

Answer 95

One of the root name servers

Answer 96

Any cast allows clients to reach the nearest instance of a service, this means the same IP can be used at multiple points, and routers will learn of the nearest instance

Answer 97

As quartz resonates at a precise frequency we can count the number of oscillations to measure time

Answer 98

Pysical clocks count the number of seconds which have parsed while logical clocks count the number of events

Answer 99

To synchronize time with a time server

Answer 100

1) The client sends a request to the clock server for the time 2) The clock server responds by returning the clock server time 4) The client process receives the response and uses it to calculate the synchronized client clock time The new client time is the server time plus half the round trip time

Answer 101

1) Use a leader election process to chose a co-ordinator node 2) The co-ordinator requests the time from each node 3) The co-ordinator should use cristians algorithm to fetch the time from each node 4) The coordinator should calculate the average time difference and add it to the current time of the co-ordinator's clock 5) Broadcast the co-ordinators current time over the network

Answer 102

1) Ignore significant outliers when calculating the average time difference 2) I second leader should be pre-chosen incase the coordinator fails or corrupts 3) Broadcast the relative inverse time difference instead of the synchronized time

Answer 103

Stratum 0 is the level assigned to the highest precision clocks

Answer 104

1) Client sends (T_0, _, _, _) 2) Server receives and adds T_1 (T_0, T_1, _, _) 3) Server sends and adds T_2 (T_0, T_1, T_2, _) 4) Server receives and adds T_3 (T_0, T_1, T_2, T_3) The round trip network delay can then be worked out as (T_3 - T_0) - (T_2 - T_1)

Answer 105

Slowly adjust the clock (slewing) Reset the clock (stepping) Panic

Answer 106

event a happens before event b

Answer 107

When ONE of the following is true: + When a and b occured at the same node and a occurred before b in that nodes local execution order + When a is the sending of some message m and b is the receipt of the same message m + When there exists an event c such that a -> c and c -> b

Answer 108

In this case a and b are concurrent, this is written as a || b

Answer 109

The number of events which occur

Answer 110

If a has a causal dependency on b then a might have had some role in causing b (e_1 -> e_2) => (T(e_1) < T(e_2))

Answer 111

When an event occurs or a message is sent, t is incremented by one If a message is received the t value is set to max(message_t, current_t) + 1

Answer 112

L(e) refers to the value of t after an increment caused by event e

Answer 113

if a->b then L(a) < L(b)

Answer 114

With Lamport clocks and L(a) < L(b) we can't tell which happened before or whether the events are concurrent

Answer 115

Node N_i increments T[i] by one when an event occurs at it

Answer 116

For each j in 1 to n T[j] := max(T[j], T'[j]) + 1 T[i] := T[i] + 1

Answer 117

Mutual exclusion is the name given to the process of concurrency control when running multiple processes in parallel

Answer 118

The critical section is the name given to variables or resources which can be accessed by more than one code segment

Answer 119

Because the variables in it need to remain consistent, and the value depends on the sequence of execution of instructions

Answer 120

+ No Deadlocks + No Starvation - Every site which want to execute in the critical section should get the chance to + Fairness - Every site should have a fair chance to execute in the critical section + Fault Tolerance

Answer 121

Safety properties ensure bad things don't happen

Answer 122

Liveliness properties ensure good things do happen

Answer 123

Fairness properties ensure that access to the critical section is done in a fair order (duh)

Answer 124

They: + Minimise the number of messages sent to each entry or exit op + Minimise client delay when entering or exiting the critical section + Minimise synchronization delay between when a process exits the critical section and the next one enters

Answer 125

Token Based Algorithms Non Token Based Algorithms Quorum based approach

Answer 126

+ A Unique token is shared among all sites + A site can only enter the critical section if it has this token + Sequence numbers are used to order requests

Answer 127

With this one node is chosen as the coordinator, when a node wants to enter the critical section must request the token from the coordinator, and only get given the token once its been released by the last process to use it

Answer 128

Each site in the system has a queue of tasks which it needs the critical section for, it also has a pointer to the next site in the ring The token is cycled around the ring, when held by a site the site can complete one of the tasks from its queue before handing the token to the next site in the queue

Answer 129

No, because order is based on the position of the token and the shape of the ring

Answer 130

The server acts as a single point of failure and bottleneck

Answer 131

Each node should be given a parent A child node can only send requests to its parents Each node has a FIFO queue of requests If any node is forwarding privilege to other nodes and has a non-empty queue, it forwards a request message

Answer 132

1. If a node i (not holding the token) wants to use the token, so that it can enter the critical section, it sends a request to its parent, node j. – If node j FIFO queue is empty, node j shifts i into its FIFO queue; j then issues a request to its parent, k, that it desires the token – If node j FIFO queue is not empty, it simply shifts i into the queue 2. When node k has token and receives the request from j it sends token to j and sets j as its parent 3. When node j receives the token from k, it forwards the token to i and i is removed from the queue of j – If the queue of j is not empty after forwarding the token to i, j must issue a request to i in order to get the token back

Answer 133

Whichever node is holding the token is the root

Answer 134

Non Token based algorithms use timestamps using logical clocks

Answer 135

The site ID and the timestamp

Answer 136

A queue and a site ID

Answer 137

Send a broadcast to all sites containing the site ID and timestamp All sites which receive this will add it to the queue and respond if and only if a) the receiving process is not currently interested in the critical section b) The receiving process has a lower priority based on the timestamp The requesting site waits until all sites have replied before it uses the critical section When the requesting site has exited, it can send any deferred response messages

Answer 138

Queue defaulting to [] Boolean voted = TRUE | FALSE defaulting to FLASE state = WANTED | RELEASED | HELD defaulting to RELEASED

Answer 139

The request subset is the set of processes that a process must request permission to enter the critical section from

Answer 140

Set state to WANTED Multicast "request" to all processes in R_i Wait until k reply messages are received Set state to HELD

Answer 141

Set state to RELEASE Multicast release to all processes in R_i

Answer 142

If the state is HELD or voted is TRUE then Queue request Else Send a reply to p_1 and set voted to TRUE

Answer 143

If the Queue is empty then set voted to false else pop the head of the queue and call it p_x Send p_x the reply Set voted to true

Answer 144

Use vector clocks instead of lamport clocks

Answer 145

2 sqrt(N) for entry and sqrt(N) for exit

Answer 146

1) When each process is roughly the same 2) When the cluster is performing a complex task which requires close collaboration 3) When the system executes many distributed writes to a disk and requires good consistency

Answer 147

It marks itself as a participant and sends a election message with its identifier to its left neighbour

Answer 148

It forwards the message

Answer 149

IF it is currently a non-participant THEN It marks itself as a participant It substitutes its own identifier It forwards the election message ELSE It does nothing

Answer 150

It declares itself a leader It marks itself as non-participant It sends an elected message with its identifier to its left neighbour

Answer 151

It marks itself as non-participant an notes the identifier of the leader from the message If the identifier is not its own it parses the message on If the identifier is its own it can act as leader

Answer 152

N election messages followed by N elected messages Total messages: 2N messages This happens when the node which initiated the election has the highest identifier

Answer 153

N-1 election messages (to get to the highest ID), followed by N election messages of ID N, followed by N elected messages Total Messages: 3N-1 This happens when the node to the right of the initiator has the highest ID

Answer 154

The participant flag triggers a process to stop election messages with lower identifiers If every process started an election at the same time, and identifiers in the ring are ordered, N-1 additional elections would take place causing N(N-1)/2 additional messages to be sent

Answer 155

Election message to announce an election Answer Message to respond to election messages Coordinator message sent by the winner of an election to announce victory

Answer 156

It broadcasts a Coordinator message notifying that it is the coordinator now

Answer 157

It sends an Election message to all processes with higher IDs than itself If P receives an answer from a process with higher ID than itself it sends no further messages and waits for the coordinator message If there is no answer after sending the election message within a period of time, then the process broadcasts a Coordinator message and becomes the coordinator

Answer 158

It sends an answer message back, if an election is not already started it will start the election message at the beginning by sending election messages to higher-numbered processes

Answer 159

The best case occurs when the failure of the coordinator was detected by the process with the second highest identifier, this can elect itself as the leader in one message One message

Answer 160

The worst case occurs when the failure of the coordinator is detected by the process with the lowest identifier triggering all other processes to begin elections This sends N(N-1) messages (or O(N^2)), giving a turnaround time of O(N) as two messages need to be sent, (and a timeout)

Answer 161

If a process fails during a bully election, there may end up being two coordinators

Answer 162

Concurrency No Global Clock Independent Failures

Answer 163

Components of they system are located at computers Computers are interconnected by a network Components communicate by passing messages over the network

Answer 164

Processes - which communicate through inter-process channels Nodes - which are used if process abstraction is not available Threads - which are used as lightweight execution means

Answer 165

We can abstract problems for distributed systems to consider Objects which we can share around the system and model the entitys of the problem Components, of the system, which use Objects Web services which provide a means of accessing the system

Answer 166

Message Passing Sockets

Answer 167

+ Request reply protocols such as HTTP + Remote Procedure Call which request the execution of a remote function and receive the response + Remote method invocation - Matched with problem-oriented communication entities these are an object-oriented version of remote procedure call

Answer 168

Replication Partitioning

Answer 169

Pros: Better fault tolerance and load balancing Cons: There a need to ensure data consistency

Answer 170

Pros: No overhead for data consistency Cons: No real fault tolerance

Answer 171

A proxy server is able to provide caching of recently used data objects closer to the client

Answer 172

+ Layering + Tiering + Thin clients + Proxying + Brokerage

Answer 173

To hide details of software application from the higher levels of a system

Answer 174

Tiering organises separate layers of functionality onto separate servers, for example clients could connect to tier 2 application servers which can then connect to teir 3 database servers

Answer 175

Consensus refers to the process of reaching an agreement among piers in a distributed system

Answer 176

All processes must eventually terminate All processes must agree on the same value It must be done with integrity -- If all correct processes propose the same value or action then any correct process that has decided must choose that value or action.

Answer 177

1. Each process can propose its value for consensus and multicast it to all other processes 2. Upon receiving proposals each process can compare the proposed values and attempt to converge towards a single value based on rules or conditions 3. Through repeated rounds of this proposal and communication processes are able to converge towards a consensus value which satisfies the criteria of the consensus problem

Answer 178

Unicast: A message is sent from one sender to one recipient Broadcast: A message is send from one sender to all recipients in the network Multicast: A message is sent from one sender to a specific group of recipients

Answer 179

Application Layer multicast Network-assisted multicast

Answer 180

IPv6 is best for multicast traffic as the specification was designed with it in mind, whereas IPv4 had it as an afterthought

Answer 181

No, multicast messages may not arrive in the order they were sent Messages may also fail to reach some or all of the intended recipients

Answer 182

+ FIFO multicast + Causal multicast + Total order multicast + FIFO-Total order multicast

Answer 183

If m1 and m2 are broadcast by the same node and broadcast(m1)->broadcast(m2) then m1 must be delivered before m2

Answer 184

If broadcast(m1)->broadcast(m2) then m1 must be delivered before m2

Answer 185

Total order multicast means that if m1 is delivered before m2 on one node, then m1 must be delivered before m2 on all nodes

Answer 186

FIFI-Total order multicast is a combination of FIFO multicast and total order multicast

Answer 187

Broadcast algorithms ensure that messages are reliably sent to all nodes, that every node receives the message and that the messages are delivered in a specific order

Answer 188

That each time a node recives a message for the first time it rebroadcasts it to each other node via reliable links This is reliable but can cause up to O(n^2) messages for n nodes

Answer 189

Basic multicast allows a single sender to transmit data to multiple recipients at the same time, this guarantees that if the multi caster does not crash a process will eventually deliver the message This can have open or closed groups, where a closed group means that only group members can multicast

Answer 190

If the sender crashes in the middle of the send loop then only some of the processes will receive the message

Answer 191

B-multicast(g, m) - For each process p in group g send message m using send(p, m) B-receives(m) - deliver the message to the process

Answer 192

1) Use B-Multicast(g, m) to send the message to all processes in the group including itself 2) When a message m is received for the first time each recipient must multicast m to the group 3) After multicasting, deliver m to the process

Answer 193

That each non faulty process delivers message m at most once

Answer 194

That if one correct process delivers m, all others in the group will eventually do so too

Answer 195

That any correct process multicasting m will eventually deliver it, this ensures sender livelyness

Answer 196

Yes, as it has: + Integrity (as it detects duplicates) + Validity (as a correct sender will eventually R-deliver a message) + Agreement (as if a sender crashes before delivering the message either it didn't get sent to anyone or was received and will be re shared by one)

Answer 197

R-Multicast provides no guarantee for the order of delivery

Answer 198

Each process keeps a sequence number for each of the other processes. When a message is received, we check the message number, + if this is as expected we accept and increment + If this is higher than expected we buffer it in a queue + If it is lower than expected we reject

Answer 199

O(N^2) B-multicast messages

Answer 200

S^p_g holding the number of messages p has sent to g R^q_g holding the number of the latest group g message from q A hold back queue of messages

Answer 201

p increments S^p_g by 1 p piggy-backs the value of this onto the message p B-multicasts this message m to g

Answer 202

GIVEN R^q_g is the sequence number of the latest group g message delivered from q IF (S == R^q_g + 1) THEN p FO-delivers m and increments R^q_g ELSE IF ( S > R^Q_g + 1) THEN p places the message in the hold back queue until the messages which should have been received are delivered

Answer 203

Vector timestamps These count the number of multicast messages already delivered from each process

Answer 204

The processes entry in the timestamp is incremented

Answer 205

One dedicated sequencer, which is responsable for giving an auto-incremented sequence number for each total order multicast message This maintains a S_g value which is the current sequence number in group g

Answer 206

SENDER: Add an id `i` to m and B-multicast to both g and the sequencer GROUP RECIPIENT of : Add to the hold back queue SEQUENCE RECIPIENT of : B-multicast(g, <"order", i, S_g>) and then increment S_g GROUP RECIPIENT of <"order", i, S>: Wait until is in the hold back queue and S=R_g, when this is the case deliver the message to the process, delete it from the holdback queue and increment counter R_g

Answer 207

F+2 processes as we need at least 2 to form a consensus F+1 rounds as their may be failures during the rounds

Answer 208

Clients send requests to primary servers which are connected to the backup server, when a primary server wishes to execute an action it also executes it on the backup servers

Answer 209

Consistency - All replicas of the same data object always have the same state Availability - Requests are always served so long as at least one server is available Partition tolerance - The data store keeps working even if servers are partitioned

Answer 210

A CP system ensures that data is always consistent and there is a high tolerance for failures, this can result in actions such as concurrent read write failing

Answer 211

AP Systems only garuntee eventual consistency, asynchronus periods mean that read operations can return non consistent values

Answer 212

CA systems give up partition tolerance. This means they assume that partitions cannot occur and everything is on one partition

Answer 213

The goal of loose coupling is to minimise the dependencies between different web services It can be achieved by focusing on interfaces rather than implementation, designing simple and generic interfaces, and by opting for asynchronous communcation

Answer 214

Asynchronous - good for time consuming operations where replies can be received later on Synchronous - Good for interactive, fast operations

Answer 215

Simple Object Access Protocol defines how synchronous and asynchronous interactions over the internet should take place, how to use XML to represent the content of messages and how messages should be exchanged

Answer 216

One request, as REST should be stateless

Answer 217

A transaction is a set of related, sequential operations which need to be executed atomically

Answer 218

Atomicity - a transaction must be all or nothing Consistency - a transaction takes the system from one consistent state to another Isolation - each transaction must be performed without interference from other transaction Durability - After a transaction has completed successfully all of its effects are saved in permanent storage

Answer 219

Two concurrent operations conflict if their combined effect depends on the order in which they are executed, a read and a read on the same object does not conflict, but a read and a write does

Answer 220

It means that all pairs of conflicting operations in the two transactions must be executed in the same order for all the objects they both access

Answer 221

Locks Optimistic concurrency control Timestamp ordering

Answer 222

Read Phase: Read the version of the data item's which need to be modified Validation Phase: Check the data items accessed by the transaction to see if they have been modified by any other transactions since the transaction began, if the check fails roll back Commit Phase: If the validation phase succeeds write the changes with a new version to reflect the updates

Answer 223

Replace clients AND servers with peers Replace just servers as peers

Answer 224

High level of churn refers to the independent arrival and departure of thousand or millions of peers

Answer 225

Peers are incentivised to upload chunks by giving preference to client peers which act as server peers, and chocking those which don't upload enough

Answer 226

Use its hash to find the node in the DHT that acts as the tracker for the file + Use this tracker to learn which machines store the file which is being looked for

Distributioned Systems & Networks Flashcards

(262 cards)