exam_2017 Flashcards by Deleted Deleted

Using examples from NFS, explain why distribution transparency and failure transparency are impossible or hard to achieve

Distribution transparency, because a user can’t tell whether NFS is down or the network is badly congested, because it’s impossible to distinguish between a dead process or a slow responding process. These communication latencies can’t be hidden.
Failure transparency, because a user can’t tell whether the server performed the operation if the NFS crashed.

How well did you know this?

Not at all

Perfectly

What are scaling techniques?

Bigger Machines
Virtualisation
Asynchronous communication
Replication & Caching
Partitioning
Software Optimisation

How well did you know this?

Not at all

Perfectly

A mathematical proof states that reliable failure detection is impossible. Why then is it possible to do reliable failure detection in practice?

By using flooding consensus, an agreement is reached in which a selected leader gets accept messages from a quorum of servers. Two examples (fail-noisy methods) are Paxos, which is used by Google, or the more understandable and formally proven correct protocol: Raft.
By making agreements on a certain amount of time after which a server is considered down.

How well did you know this?

Not at all

Perfectly

What is consensus?

The process by which we reach agreement over system state between unreliable machines connected by asynchronous networks

How well did you know this?

Not at all

Perfectly

What is Paxos trying to solve?

How do we reach agreement over a single value in a scenario where failures might occur

How well did you know this?

Not at all

Perfectly

What are Paxos stages?

It is essential to have a multi-state process.

Promise and commit
Majority agreement
Monotonically increasing numbers

How well did you know this?

Not at all

Perfectly

Why is reliable failure detection important for consensus in a process group?

To achieve overall system reliability in the presence of a number of faulty processes, or else a process may wait infinite time for a response.

How well did you know this?

Not at all

Perfectly

How does asynchronous communication help to build large systems? Give an example.

Async communication helps because systems don’t have to wait on each other to send bits over the line. A start and stop bit let the client know that the information is complete. Downloading or sending files or emails are examples of async communication.

How well did you know this?

Not at all

Perfectly

What are the 4 types of servers for google search?

Root
Cache
Parent
Leaf

How well did you know this?

Not at all

Perfectly

Scaling techniques for types of servers google search?

root: software optimisation
Cache: replication/caching
Parent: partitioning
Leaf: partitioning

How well did you know this?

Not at all

Perfectly

Functions for root, cache servers google search?

root: handles browser requests, acts as front-end web server
cache: Stores temporary requests

How well did you know this?

Not at all

Perfectly

Functions for parent, leaf servers google search?

parent: distribute queries as in a multi-level tree
leaf: index/doc requests are handled from in-memory data structures

How well did you know this?

Not at all

Perfectly

What are the pros of in-memory indexing systems?

Big increase in throughput.

Big decrease in query latency

How well did you know this?

Not at all

Perfectly

Issues of in-memory indexing systems?

Variance: query touches 1000s of machines, not dozens
Availability: 1 or few replicas of each doc’s index data
Queries of death

How well did you know this?

Not at all

Perfectly

What are canary requests?

Request to check health status of a machine. You send a request to check if it works on one server first, if it fails unexpectedly, try another machine (could be coincidence). If fails K times, reject request

How well did you know this?

Not at all

Perfectly

What does the repository manager do?

Study These Flashcards

Coordinates index switching as new shards become available

What were the problems with traditional google search system?

Study These Flashcards

More collections to search besides web. For example, Google Maps. You need more real-time results

How was creating the index done first?

Study These Flashcards

It was a batch process via MapReduce.

Store all documents in GFS
Run several MapReduce jobs to create index
Upload index to Leaf servers

What was the problem with the MapReduce index method?

Study These Flashcards

New documents would not show up in search results for 2-3 days.

What solutions replaced mapreduce

Study These Flashcards

Data storage system: Colossus / BigTable

Event-driven, incremental processing: Caffeine / Percolator

What is BigTable?

Study These Flashcards

A distributed storage system. A given table is a three-dimensional structure containing cells indexed by a row key, a column key and a timestamp. Each table may consist of many tablets. It’s typically used to replicate data to multiple bigtable clusters in different datacenters.

What makes BigTable scalable?

Study These Flashcards

There is no versioning (timestamp is the version)
Automatic resource management (less manual labor and instant resource availability)
Tablets in table split if getting too big
Different machines can handle different tablets, which results in the workload divided equally over resources

What makes caching one of the most efficient scaling techniques?

Study These Flashcards

A couple machines can do the work of a substantial amount of machines. It reduces network traffic, access latency, workload of server, and the robustness of the service is enhanced and the access time is shorter

What is the disadvantage of caching?

Study These Flashcards

Big latency spike/capacity drop when complete index updated or cache flushed.
- In some cases the data might be outdated, though there are methods to prevent this.

Cache misses increase lookup time because there’s already time spent looking into the cache.

What are the benefits of IaC?

1. Disaster recovery (much quicker because complete config is stored as code) 2. Consistency 3. Speed (tasks can be done in parallel) 4. Version control 5. Risk of bugs can be reduced 6. Cost, one can do the work of many

How is IaC vital to CI/CD?

The idea is that CICD was originally only focused on the applications and they made assumptions about the infrastructure. With IaC, the test and production environment are equal because both environments are built from the same definition files. Because the infrastructure is defined as code, it can be spinned up automatically and therefore every small patch can be pushed to the production environment continuously

What are the powers and names?

``` 2^10=1 Kilobyte 2^20=1 Megabyte 2^30=1 Gigabyte 2^40=1 Terabyte 2^50=1 Petabyte ```

How many MB in 1 petabyte

2^30=1073741824. | 1 with 9 numbers

What are the 5 main stages within the ITIL Framework?

``` Service Strategy. ... Service Design. ... Service Transition. ... Service Operation. ... Continual Service Improvement. ```

In the DevOps philosophy, operations people should be more involved in the earlier stages of the life cycle of a service as defined by ITIL. For the first 3 stages explain where and how operations people can contribute.

Strategy: defining the business model Design: What are the requirements, what does dev operations need? Are we going to use a ticket system, what are the service hours, SLA, do we need replication etc. Transition: Do we agree? Is this what we wanted? Since operations is closer to end-users, they may select users to test the product (pre release testing operations)

Name two assurances that a Change Request Board (CRB) provides.

- Assisting assessment/prioritise/approve requested changes into the live environment - Approved changes are managed in a rational and predictable manner to ensure all changes meet the quality requirements by enforcing change and release policies and procedures.

Give 3 arguments why DevOps can reduce the need for a Change Request Board.

Many aspects are being covered by DevOps: - Change-management processes - Establishing definitions/standards for rating change risks - Rejecting or approving change requests - Coordinating post-change activities

Give 2 examples of how DevOps can make it easier to meet Launch Readiness Criteria.

Make sure the service is monitored, SLA is defined, backups and restores are tested/working, user documentation/training is complete

exam_2017 Flashcards

(33 cards)