exam_2021-retake Flashcards by Deleted Deleted

What is the AFK scale cube?

It is a model for segmenting services, defining micro-services and scaling products.

How well did you know this?

Not at all

Perfectly

What are the axis’ of AFK scale cube?

X-Axis: Horizontal duplication and cloning of services and data
Y-Axis: Functional Decomposition and segmentation
Z-Axis: Service and data partitioning along customer boundaries

How well did you know this?

Not at all

Perfectly

What is horizontal duplication?

One monolithic system/service > many systems, each a clone and load balanced

How well did you know this?

Not at all

Perfectly

What is y-Axis

Split by function, method, service or dissimilar things

How well did you know this?

Not at all

Perfectly

What is Z-Axis?

Split by similar things. For example, customers

How well did you know this?

Not at all

Perfectly

Q1.A Explain the difference between service partitioning and data partitioning inthe AKF scale cube, and give an example of each

Splitting services is on the Y-Axis according to AFK scale cube, data partitioning on the Z-Axis. You can split services by separating the webserver from the database. You can do data partitioning by splitting a database (sharding) based on customer ID’s for example.

How well did you know this?

Not at all

Perfectly

Q1.B At one point, Google search introduced extra layers between the web frontend and the servers holding the partitions of the index. Why did they have to introduce these layers?

They introduced caching servers. These have a hit rate of 30-60% and are capable of handling a whole lot of traffic

How well did you know this?

Not at all

Perfectly

What is a race condition?

When the code is trying to do two or more things at once, and the result changes depending on the order they occur in

How well did you know this?

Not at all

Perfectly

What is one simple solution against race conditions?

To place all the requests in a queue and refuse to answer any requests until the previous one is completed

How well did you know this?

Not at all

Perfectly

What is the problem with placing all requests in a queue and refusing any further requests to prevent race conditions?

It doesn’t scale. That’s how old computers work, single-threaded.

How well did you know this?

Not at all

Perfectly

When is using a queue and refusing further requests a good solution

If you absolutely must count everything accurately, in real time. For example, a large festival with lots of requests where it can’t allow collisions.

How well did you know this?

Not at all

Perfectly

What is alternative to using this queueing system that does scale well?

Eventual consistency

How well did you know this?

Not at all

Perfectly

What is eventual consistency?

Each server holds it’s own count. It will update the central system when there’s time to do so. (it can also be seconds apart, doesn’t have to be hours).

How well did you know this?

Not at all

Perfectly

When is eventual consistency a bad design choice?

When a change needs to be made immediately. For example, the privacy settings of a youtube video (private or public)

How well did you know this?

Not at all

Perfectly

Why are youtube views not accurate?

This is because of caching. Caching holds the data and serves it to the customer quickly. A site like youtube has many, many caching servers and each time you can be routed to a different caching server.
Eventually, eventual consistency will take place and everything will be sorted out at some point

How well did you know this?

Not at all

Perfectly

Q1.C Google Search replaced batch processing to create the index to a more incremental method of keeping the index up-to-date. Why did they want to make this replacement?

They wanted to make this replacement because the batch process via MapReduce resulted in documents not showing up in search results for 2-3 days. They needed a lower “time from crawl-to-search-hit”. Solution was:

New data storage system: Colossus / BigTable
Event-driven, incremental processing: Caffeine / Percolator

How well did you know this?

Not at all

Perfectly

What is batch processing?

It gives you the ability to execute multiple operations in one request, rather than having to submit each operation individually

How well did you know this?

Not at all

Perfectly

Q1.D What problem is avoided by Google by time-stamping the contents of a BigTable cell?

Because of versioning by timestamps there are no write-write conflicts on a cell. As we will see: when replicated, eventual
consistency is used.

How well did you know this?

Not at all

Perfectly

Q1.E Briefly explain why virtualisation is considered a scaling technique?

Study These Flashcards

It lets you run multiple operating systems and applications on a single server, consolidate hardware to get higher productivity from fewer servers and simplify the management, maintenance and the deployment of new applications

Q2.A The parameters of a remote procedure call can be of 3 types: in, out, or inout. What are these?

Study These Flashcards

in: object is transffered from client to service, only used for inputs
out: object is transferred from client to service only used for outputs
inout: object is transferred from client to service used for both inputs and outputs

Q2 TODO

Study These Flashcards

TODO

Q3.A You are the administrator of an application that consists of web serverrs on 3 machines and a database on 1 (seperate) machine. The vendor of the application wants to change the application components (web server, database) from running natively to running inside a container. Discuss how this change would affect the performance of your application

Study These Flashcards

There are some cases in which virtualisation offers performance benefits but these are quite rare, normally. Typically, the overhead of a hypervisor is around 1-5% of CPU and memory overhead.
By using some of VMware’s memory management techniques you can eliminate the memory overhead IF, and only IF, you are using multiple VM’s onto hosts. Using 1 VM will always be slower.
HOWEVER, while you lose some slight performance, virtualisation is about management. With virtualisation you can easily scale to 20 or 40 VM’s on each host

Q3.B TODO

Study These Flashcards

TODO

Q3.C Classify firecracker into the virtualisation taxonomy discussed in class

Study These Flashcards

These are system VMs capable of running a the same ISA (Linux)

What is firecracker

Open source virtualisation technology that enables you to deploy workloads in lightweight virtual machines called microVMs which provide enhanced security and workload isolation over traditional VMs

What is the virtualisation taxonomy?

``` You have Process VMs: - Same ISA - Different ISA System VMS: - Same ISA - Different ISA ```

Q3.D Give one similarity and one difference between firecracker and unikernels

The similarity is that they both offer significant performance improvements by excluding unnecessary devices and guest functionality. The difference is that firecracker runs in user-space and unikernels in kernel-space

What are unikernels?

They optimise a VM for one application. You strip unused parts of OS and libraries

Where does firecracker run in?

It runs in user space

What is gVisor?

Provides a virtualised environment in order to sandbox containers.

gVisor advocates itself as third secure environment for running containerised applications, next to machine-level virtualisation and rule-based execution (such as SELinux). What disadvantages of the two others does it try to solve?

gVisor intercepts application system calls and acts as the guest kernel, without the need for translation through virtualised hardware.

What are the different layers of gVisor?

1. Machine-level virtualisation (such as KVM, XEN), exposes virtualised hardware to a guest kernel via a virtual machine monitor 2. Rule-based execution, such as SELinux. 3. Intercepts application system calls and acts as the guest kernel, without the need for translation through virtualised hardware.

4.A If we did not have dedicated configuration management tools like Ansible and Puppet, could an organisation still use Infrastructure as Code? Briefly argue why or why not?

Yes, IaC is the process of managing and provisioning computer data centers through machine-readable definition files. You don't need a tool for this, it's more about the methodology behind it which can help reduce cost speed and risks

What are the advantages of IaC?

Cost Speed Risk

What is cost advantage IaC?

By removing the manual component, people are able to refocus their efforts towards other enterprise tasks

What is speed IaC?

Infrastructure automation enables speed through faster execution when configuring your infrastructure and aims at providing visibility to help other teams across the enterprise work quickly and more efficiently

What is risk IaC?

Automation removes the risk associated with human error, like manual misconfiguration; removing this can decrease downtime and increase reliability

Q4.B Some daemons reload their configuration files when they sense a change, causing downtime for that daemon. How do configuration management tools avoid this side effect?

I believe tools such as ansible have configurable parameters to specify whether daemons should be related. The default option is not to do so

Q4.C Name 3 important pieces of information that IaC should record about a change

The change itself, the timestamp and the initial version

How can you improve communication?

Chatrooms (force people to be online) | Virtual and physical standups

Q4.D Explain why duvel beer is an important tool in DevOps

You want smaller changes to be happening throughout the week and not one massive change on a friday afternoon when everyone will go for drinks after?

What is idempotence?

No changes when the same state is applied again

Q5.A In your new job you are asked to migrate an "All Eggs in One Basket" server to setup with different virtual machines for each service. Briefly describe how you would perform such a migration?

Before migrating, I would ensure that I first create a backup. Then I would create a clone, so that you have the system still running, and a backup in case it goes down. Inside the clone I would setup the virtual machines which should be a machine each for each service to improve performance and it also gives management + security benefits. This should be done following IaC so that it's reproduce-able. After the set up is fully working and tested, I would create a new VM to test if the whole configuration works. Then, the IaC can be used to configure a production instance (first shadowing the initial system). If all is well, it can replace the initial system. Again, a backup should be made of the new system.

Q5.B The book advises against using desktop hardware to run services. Yet google and other big services have been built around such hardware. Briefly describe how one can build such services from desktop components.

1KW = 3412 BTUs So, 3.52 x 3412 = 12010 of cooling 12010 / 1725 = 6.96 So you can cool about 7 servers (if, and only if, they are not all generating the full heat for a longer period of time, which is unlikely)

How much BTUs is 1KW

1KW = 3412 BTUs

Why do you, as an IT department providing a service, have to be very careful what goes into a SLA between you and a business unit in your company

The SLA is a commitment between a service provider and a client. It defines the level of service being sold in plain language terms with definitions about mean time between failures, mean time to repair or mean time to recover. You also identify which party is responsible for reporting faults or paying fees.

exam_2021-retake Flashcards

(46 cards)