Block 4 part 1 Flashcards

1
Q

what is the key metric availability and how do we calcul it

A

availability is the probability that an application,service,system is available to use

A= uptime ÷ (uptime + Downtime)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Give me examples of planned downtime

A

Backup and restauration , hardware os network upgrades , application and db maintenance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

give me examples fo unplanned downtime

A

environmental factors, app errors, operator and user errors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the primary cause of downtime in data centers

A

Ups system failure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is the average annual loss per company according to size

A

small 221 817
Medium 450 000
Large 927 823

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do we calculate reliability

A

check slide 13
reliability is the ability for a system to perform its fucntiin ynder conditions in a specified period of time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is mean time tonfailure MTTF

A

measure of reliability for items that cannot be repaire
MTTF = (test period x num of item under test) ÷ num of items that fail

see example on slide 14

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is annualised failure rate AFR

A

afr = (num of failures × 8760 hours)÷ MTTF hours × 100%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Howbdo we calculate how much drive will fail in 1 year using AFR

A

num failures = num of drives × AFR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is inherent availability and how do we calcul it

A

inherent availability is the availability of a system that has not been created

Ai = MTTF ÷ (MTTF + MTTR)

mttr being the mean time to replace (time)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is operational availability

A

Ao = MTBM (mean time between maintenance) ÷ (MTBM + MDT (mean downtime))

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do we increase availability

A

Load sharing -> sharing workload accross a number of computers.

the internet send a load to the load share monitor that would distribute it to multiple nodes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the disadvantages of load sharing and the solutiond

A
  • the monitor cannot track the responses if a node does fail
  • monitor represents a single point of failure
  • updates to each node independently
  • no guarantee that multiple request from a client are directeed to the same node

Solution:
incorporate cookies
add a form of a shared storage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is heartbeat in load sharing

A

a small message that is communicated from the node tk monitor and if its not communicated the monitor will assume failure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is clustering

A

collection of independent computer nodes as single logicsl server to user
its goal is to increase availability

there is two form active - active (start copy of application) / active - passive (takeover in failure)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what is fault tolerance

A

a hardware continues to operate in the event of a single hardware failure

17
Q

what are the potential benefits of virtualisation

A
  • reduce cost of ownership as two virtual servers operate of on computer
  • additional protection by executing untrusted app on guest os (sandboxing)
  • Legacy system , emultatecolder peripherals that are no longer manufactured
  • quick instalation
  • quick recovery
18
Q

What is cloud computing

A

conbine high availability hardware with virtual servers

Infrastructure as a service : business with a complete set of computers

Platform as a service : business with computer platform

Software as a service : business with an entire application

does not increase availability -better utilisation of perf of processors

19
Q

what is disaster recovery

A

putting in place a plan that wil enable a company to recover its it system from a disaster - enable to functiom during a disadter

it have to describe:
what activities should be done
who should do the activitued
when and in what sequence they should be done