Enterprise Computing Flashcards

Question

What distinguishes a Wizard of Oz MVP?

Answer 1

In a Wizard of Oz MVP, the product’s functionality is manually simulated without the customer’s knowledge. It’s used to validate a clear solution hypothesis while minimizing development effort.

Answer 2

A Landing Page MVP uses a webpage to gauge interest in a product idea. Visitors can sign up or pledge support, providing data on demand before product development.

Answer 3

Dropbox used a Video MVP, creating a simple video to demonstrate its functionality. This validated interest and attracted early adopters without building a full product.

Answer 4

Airbnb’s MVP was a basic website showcasing their apartment space. They manually managed bookings, testing the concept of renting personal spaces to travelers.

Answer 5

Iterative learning allows for continuous improvement by validating assumptions, minimizing waste, and adapting to market needs efficiently, ensuring the product aligns with customer demands.

Answer 6

A startup is a human experiment designed to create a new product or service under conditions of extreme uncertainty. The goal is to test hypotheses about customer needs and product-market fit.

Answer 7

The biggest waste in product development is building products that nobody wants. This waste occurs due to a lack of understanding of customer needs before development begins.

Answer 8

Continuous learning is the universal constant of all successful startups. Startups must iteratively test and refine their ideas based on customer feedback and market demands. Performing a pivot while staying grounded in what learning has occurred already

Answer 9

Agile development is suitable for startups as it emphasizes iterative progress, flexibility, and adapting to changes, which aligns with the startup’s need to rapidly respond to feedback and refine their product.

Answer 10

Agile is not so suitable for safety critical systems, the product must be released in such as state where there are no errors. Releasing with errors can cause serious problems

Answer 11

Validated learning is the process of demonstrating empirically that a team has discovered valuable truths about a startup’s present and future business prospects. It uses metrics derived from customer behavior rather than opinions or assumptions.

Answer 12

The first version of a product, or Minimum Viable Product (MVP), should include only the core functionalities required to test key assumptions about customer needs and product viability. The MVP is designed to gather feedback with minimal resources.

Answer 13

Reducing the cycle time is the most important thing

Answer 14

The heuristic for startup advice is that it should be actionable, testable, and tied to specific customer or market contexts. Generic advice should be avoided in favor of insights that can drive specific experiments.

Answer 15

Actionable metrics are specific, clear, and tied to decision-making. They enable startups to evaluate the effectiveness of their strategies and experiments. Unlike vanity metrics, actionable metrics directly inform whether the business is on the right path.

Answer 16

By focusing on building MVPs, testing hypotheses, and using validated learning, the Lean Startup methodology ensures resources are allocated to features and products that meet actual customer needs, reducing waste.

Answer 17

Customer feedback is crucial because it provides real-world insights into user needs, helping startups pivot or persevere based on evidence rather than assumptions.

Answer 18

Experimentation allows startups to test hypotheses about products, customers, and markets under real-world conditions, enabling informed decision-making and minimizing risks associated with uncertainty.

Answer 19

A monolithic system is a single program run by a single process, formed from a collection of modules that communicate via procedure calls.

Answer 20

It is easy because all the code is in one language and one place, allowing the development team to utilize their existing programming skills, tools, and experience effectively.

Answer 21

Testing is straightforward because the system is built as a single executable, enabling automatic testing with a suite of tests and easy debugging when issues arise.

Answer 22

Scaling in a monolithic system is done through vertical scaling, which involves upgrading to a more powerful machine to run the system better.

Answer 23

Centralized databases ensure data accuracy, completeness, and consistency, which simplifies maintenance and management.

Answer 24

A transaction can either: Commit, completing successfully and moving the database to a new consistent state. Abort, completing unsuccessfully and restoring the database to its previous consistent state.

Answer 25

ACID stands for: Atomicity Consistency Isolation Durability

Answer 26

Atomicity ensures that transactions either fully succeed or fully fail, even in the presence of system failures.

Answer 27

Consistency ensures that a transaction takes the database from one consistent state to another.

Answer 28

Isolation ensures that the effects of concurrent transactions are the same as if the transactions were performed sequentially.

Answer 29

Durability guarantees that once a transaction is successful, its effects persist, even in the event of system failures.

Answer 30

This observation suggests that teams should anticipate that their first version of a monolithic MVP might need to be discarded and rebuilt to incorporate lessons learned during its development.

Answer 31

Gall’s Law states that a complex system that works evolves from a simple system that worked. For monolithic MVPs, it implies starting with a simpler design that can be refined over time.

Answer 32

YAGNI advises against implementing features until they are actually needed, emphasizing simplicity in monolithic MVPs to avoid unnecessary complexity.

Answer 33

Conway’s Law states that a system’s design mirrors the structure of the organization that created it. For monolithic MVPs, this means the design will reflect the communication structure of the team.

Answer 34

Structuring the organization so that there are departments for each module can make the process more congruent

Answer 35

"Eating your own dogfood" means that development teams should use their own product to identify and address issues, ensuring its quality and usability.

Answer 36

Advantages include simplicity in development and testing, centralized data management, and easier debugging due to having all code in one place.

Answer 37

Limitations include difficulty scaling horizontally, potential for large and complex codebases, and challenges in adapting to changes or integrating new technologies.

Answer 38

Vertical scaling involves upgrading the hardware of a single machine to improve performance. It’s common in monolithic systems because they run as a single process that benefits from more powerful hardware.

Answer 39

A microservices system consists of multiple programs running as independent processes that communicate by sending messages over a network.

Answer 40

REST is a network protocol that is a conventional form of the HyperText Transfer Protocol (HTTP) used in microservices to communicate and provide resources.

Answer 41

Document - File-like resources managed using GET (read), PUT (update), and DELETE (delete). Controller - External resources that execute tasks using POST. Collection - Directory-like resources where GET lists items and POST creates a new one with an invented name. Store - Similar to collection, but PUT creates new resources with a given name.

Answer 42

A distributed database is a system where data is stored across multiple databases, each accessed by individual microservices. Ensuring accuracy, completeness, and consistency in such a setup is challenging.

Answer 43

Two-phase commit Sagas

Answer 44

The coordinator transaction asks participants to vote on committing a change. If all agree, the commit is executed; otherwise, all transactions are aborted. Each participant holds locks on its data until the decision is made. If a vote is not made in a certain timeframe it will time out and the vote will be considered negative

Answer 45

A saga executes a sequence of transactions, committing or aborting each individually. If a failure occurs, compensating transactions undo previous commits. This approach sacrifices atomicity and relies on eventual consistency.

Answer 46

In two phase commits services must hold locks until the coordinator finishes. If one service crashes, everyone might be stuck waiting - not present in sagas Non-blocking: No locks or coordination — each service finishes its own task independently. Fault tolerant: Easier to recover from failures. Scales well: Ideal for microservices, cloud-native apps. Better for long-running operations: Since you don’t hold resources hostage.

Answer 47

Horizontal scaling involves adding more machines, each capable of running multiple microservice instances, to improve system scalability. Why does the number of machines allocated to run one microservices be the same as the number allocated to run another?

Answer 48

Since microservices communicate over a network, failures in one service can impact others. Strategies like retries, circuit breakers, and redundancy are used to improve reliability.

Answer 49

The strangler pattern is a gradual migration approach where monolithic modules are placed behind a facade and replaced one-by-one with microservices, updating the facade as changes are made.

Answer 50

The second-system effect, identified by Fred Brooks, suggests that engineers tend to overcomplicate their second system. In microservices migration, teams must avoid unnecessary complexity when breaking apart a monolith.

Answer 51

Jeff Bezos suggests that teams should be small enough to be fed with two pizzas. In microservices, small, independent teams are ideal for maintaining and developing individual services efficiently.

Answer 52

The "Big Ball of Mud" describes an unstructured, poorly designed software system. Microservices help avoid this by enforcing modular design principles.

Answer 53

Technical debt refers to the cost of shortcuts in development. Poorly designed microservices architectures can accumulate technical debt, requiring costly future refactoring.

Answer 54

The CAP theorem states that distributed systems can only provide two of three guarantees: Consistency, Availability, and Partition Tolerance. Microservices architectures must choose trade-offs based on system needs.

Answer 55

API gateways act as intermediaries between clients and microservices, handling request routing, security, rate limiting, and load balancing.

Answer 56

Popular tools include Kubernetes (orchestration), Docker (containerization), Consul (service discovery), and Istio (service mesh).

Answer 57

Event-driven architecture uses events to trigger and communicate between microservices asynchronously, improving decoupling and scalability.

Answer 58

Scalability Improved fault isolation Flexibility in using different technologies Faster development and deployment cycles

Answer 59

Increased complexity Network latency issues Distributed data management challenges Higher infrastructure costs

Answer 60

The nine common characteristics of microservices according to Martin Fowler are: Componentization via Services – Microservices are independently deployable services. Organized Around Business Capabilities – Teams are structured around business functions. Products Not Projects – Microservices focus on long-lived products, not temporary projects. Smart Endpoints and Dumb Pipes – Business logic is in the services, not the communication mechanism. Decentralized Governance – Different teams can use different technologies. Decentralized Data Management – Each service manages its own database. Infrastructure Automation – Deployment and monitoring are automated. Design for Failure – Systems assume failures and handle them gracefully. Netflix Death Monkey Evolutionary Design – Services can be updated or replaced independently.

Answer 61

A component is a unit of software that can be replaced or upgraded independently. In microservices, a component is defined by its behavior and exposed via an API, making it easier to manage and scale.

Answer 62

Organizing around business capabilities ensures that teams focus on delivering value rather than being restricted by technology stacks. It enables better ownership, faster delivery, and clearer responsibilities. Each microservice aligns with a distinct business function.

Answer 63

Endpoints should be smart, while the communication between them should be simple (dumb pipes). This means that microservices encapsulate business logic within the service, whereas the network simply routes data without complex logic.

Answer 64

Each microservice should have its own dedicated database and not share it with other services. This ensures loose coupling and independence, allowing services to evolve independently.

Answer 65

You must assume the "Fallacies of Distributed Computing," including: The network is reliable. Latency is zero. Bandwidth is infinite. The network is secure. The topology doesn’t change. There is one administrator. Transport cost is zero. The network is homogeneous. Recognizing these assumptions helps design resilient and fault-tolerant microservices.

Answer 66

There is no fixed size, but a microservice should be small enough to be developed and managed by a small team (typically 2-5 developers) and should perform a single business function well.

Answer 67

Before transitioning to microservices, teams must consider: Deployment automation – Microservices require CI/CD pipelines. Monitoring and logging – Observability is critical. Service discovery – Dynamic service registration is needed. Fault tolerance – Handling failures must be a priority. Data consistency – Distributed databases need careful management. Organizational readiness – Teams must be capable of handling service independence.

Answer 68

Monorepo - A single large repository that contains all microservices. Any commit triggers the production of multiple microservices. Multirepo - A separate repository for each service. Any commit only affects a single service.

Answer 69

Advantages: Simplifies dependency management. Centralized codebase for better consistency. Easier refactoring across services. Disadvantages: Can become slow and difficult to manage at scale. Requires robust tooling to handle changes efficiently. Can cause bottlenecks if too many teams work on the same repository.

Answer 70

Advantages: Allows independent development and deployment of services. Teams have full control over their own repositories. Reduces risk of large-scale merge conflicts. Disadvantages: Harder to coordinate cross-service changes. Dependency management can be more complex. May lead to duplication of code across repositories.

Answer 71

Feature-Based Development - Developers create long-lived feature branches that may last for weeks or months before merging into the main branch. Trunk-Based Development - Developers work primarily on a single main branch, with short-lived feature branches that are merged back within minutes or hours.

Answer 72

Advantages: Allows isolated development of new features. Provides stability by keeping unfinished code out of the main branch. Disadvantages: Merging long-lived branches can be complex and lead to conflicts. Delays in integration may cause unexpected failures when merging.

Answer 73

Advantages: Encourages continuous integration. Reduces merge conflicts by keeping branches short-lived. Faster feedback and fewer integration problems. Disadvantages: Requires disciplined development practices. Can be difficult for large teams to coordinate without proper tooling.

Answer 74

The essential practices of version control include: Run commit tests locally. Wait for commit tests to complete before proceeding. Avoid committing on a broken build. Never leave work with a broken build. Be prepared to revert changes if needed. Avoid commenting out failing tests. Take responsibility for fixing breakages.

Answer 75

Running commit tests locally ensures that the developer's changes do not introduce failures before pushing them to the repository. This prevents unnecessary build failures and broken tests in shared branches.

Answer 76

Developers should wait for commit tests to complete because: It ensures that the build remains stable. Developers can quickly fix failures instead of delaying corrections.

Answer 77

Committing on a broken build: Makes debugging more difficult. Causes further build failures and wastes time. Leads to a culture where broken builds become common and unresolved.

Answer 78

Developers should never leave a broken build because: It delays fixes and impacts the entire team. Developers may forget details of the change, making debugging harder. Experienced developers commit changes at least an hour before leaving to ensure stability.

Answer 79

Developers should be prepared to revert changes because: Quick reverts keep the project in a working state. If a fix takes too long (e.g., over 10 minutes), reverting prevents prolonged issues.

Answer 80

It can lead to lower code quality. Instead, developers should: Fix the code if it fails. Modify the test if assumptions change. Delete the test if the functionality no longer exists.

Answer 81

Taking responsibility for breakages ensures that: The codebase remains stable. Developers collaborate to resolve issues quickly. No single person is left fixing issues they did not introduce.

Answer 82

Step back to safety – Enables rolling back to previous versions in case of issues. Share changes easily – Facilitates collaboration by allowing multiple contributors to work on the same project. Store changes safely – Ensures that all changes are securely saved and can be retrieved when needed.

Answer 83

Mono-repo – Stores everything in a single large repository. Multi-repo – Each independent component has its own repository. Multi-repo' – Stores interdependent components in separate repositories.

Answer 84

mono-repo supports these benefits because: Step back to safety – Rolling back changes affects all components together. Share changes easily – Any component can be updated in a centralized manner. Store changes safely – Everything, including dependencies, is stored in one place.

Answer 85

A multi-repo can struggle with these benefits because: Step back to safety – No centralized rollback mechanism for all components. Share changes easily – Difficult to coordinate updates across repositories. Store changes safely – Versioning relationships between components are not inherently stored.

Answer 86

Fixed, well-understood APIs – Components interact through stable interfaces. Flexible, backward/forward-compatible APIs – Ensures components work together despite version differences.

Answer 87

Step back to safety – Individual components can be rolled back independently. Share changes easily – Updates can be coordinated through APIs (though this remains complex). Store changes safely – Components are versioned separately but stored reliably.

Answer 88

Because in the multi-repo' model: Components cannot be developed independently. Components cannot be deployed independently. In a monorepo, you can make a single commit that changes multiple projects at once. In multi-repo, if your service depends on changes in another repo, you need to: Make the change in repo A Wait for it to be merged and released Update repo B to use the new version This causes coordination overhead and risks of version mismatches. You're forced to treat changes across services like you're deploying to Mars.

Answer 89

Continuous Integration (CI) is the practice of quickly integrating newly developed code with the rest of the application code. This process is usually automated and results in a build artifact at the end. The goal of CI is to detect errors early and streamline the deployment process.

Answer 90

Key benefits of CI include: Early bug detection through automated testing. Faster development cycles and releases. Reduced integration problems. Improved collaboration among development teams. Higher code quality through continuous testing.

Answer 91

The four traditional product delivery releases are: Alpha Release - An early version for internal testing. Beta Release - A more stable version for external user feedback. Release Candidate - A near-final version tested for last-minute issues. Final Release - The official version available to customers

Answer 92

The three modern feature delivery environments are: Development Environment - Where individual teams integrate their work, updated throughout a sprint. Staging Environment - A near-production environment where multiple teams integrate their work, updated at the end of a sprint. Production Environment - The live system where software is deployed for customer use, updated based on business needs.

Answer 93

Shift Left Testing refers to the practice of moving testing earlier in the software development cycle. It ensures that testing is done frequently and early, reducing defects and improving code quality. In CI, Shift Left Testing is crucial as it enables continuous feedback, helping to identify and fix issues before deployment.

Answer 94

The Test Pyramid is a model that categorizes different levels of testing: Unit Tests - Test individual functions or components, performed in milliseconds. Service Tests - Test interactions between services, performed in minutes. End-to-End Tests - Test the entire application workflow, mimicking user interaction, performed in several minutes.

Answer 95

The Test Snow Cone is an anti-pattern where more end-to-end tests exist than unit or service tests. This leads to slow test execution and longer feedback cycles. CI best practices encourage more unit and service tests over end-to-end tests to ensure efficiency.

Answer 96

Brittle Tests are tests that fail because another dependent service fails, even if the functionality being tested is correct. They can cause false negatives, making debugging difficult.

Answer 97

Flaky Tests sometimes fail due to non-deterministic issues such as timeouts or race conditions. They create unreliable feedback and reduce confidence in test automation.

Answer 98

Normalization of Deviance is a concept where teams gradually accept small failures as normal, leading to degraded quality over time. In CI, failing tests must be addressed immediately to prevent this mindset and ensure reliable software.

Answer 99

Build Light Indicators visually represent the status of CI builds. A green light means the build is successful, while a red light indicates a failure. Some teams use lava lamps or monitor screens to display build statuses.

Answer 100

Automation is central to CI as it enables frequent builds, automated testing, and fast feedback. It ensures that code changes do not introduce new errors and maintains software quality at scale.

Answer 101

Integration hell is an anti-pattern in software development where different parts of a software system are integrated too late, leading to complex and time-consuming conflicts.

Answer 102

Running commit tests locally ensures that the deployment pipeline remains a valuable shared resource that is not blocked by unnecessary test failures.

Answer 103

Developers should wait for test results to be available so they can immediately fix any issues, ensuring smooth progress.

Answer 104

Fixing or reverting failures within 10 minutes prevents blocking progress for others and maintains development velocity.

Answer 105

If a teammate breaks the rules, their changes should be reverted to prevent them from blocking progress.

Answer 106

If someone else notices a failure before you do, it indicates a lack of attentiveness and encourages developers to monitor their changes more closely.

Answer 107

Once a commit passes, a developer should move on to their next task, as automated testing ensures that their changes are stable.

Answer 108

The committer is responsible for fixing a failing test to ensure accountability in the development process.

Answer 109

Everyone who may be responsible should agree on who will fix the failure, ensuring that accountability is maintained.

Answer 110

Monitoring changes ensures that any issue is detected early, preventing unfit software from being released

Answer 111

Immediate attention to pipeline failures ensures that the pipeline remains clear for other changes, maintaining continuous integration efficiency.

Answer 112

Continuous Delivery (CD) is a software engineering practice where software is automatically moved from a source code repository to a staging environment. At the press of a "release" button, it can be deployed to the production environment for customer use.

Answer 113

Continuous Deployment (CD) goes a step beyond Continuous Delivery by automatically deploying software to the production environment without manual intervention, making new features immediately available to customers.

Answer 114

The key principles of Continuous Delivery include: Create a repeatable process Automate almost everything Version control for everything If it hurts, do it more frequently Build quality in Done means released Everyone is responsible Continuous improvement

Answer 115

A repeatable process ensures consistency, reduces errors, and allows teams to become more efficient. A well-practiced process becomes routine and reliable.

Answer 116

Automation ensures accuracy, consistency, and efficiency. Manual processes introduce human error and inefficiencies, whereas automation standardizes execution and reduces risks.

Answer 117

Version control allows any team member to build any version of the application on demand. It ensures traceability, facilitates rollback if needed, and supports collaboration.

Answer 118

Performing painful processes frequently helps improve efficiency, identify bottlenecks, and streamline workflows. Regular practice leads to familiarity and process refinement.

Answer 119

This principle emphasizes fixing defects as soon as they are found. Early detection and resolution of defects are cost-effective and ensure higher software quality.

Answer 120

A feature is only considered "done" once it has been deployed to a production-like environment. This avoids ambiguity in development progress and ensures accountability.

Answer 121

Continuous Delivery requires collaboration across teams. When everyone is responsible, issues are resolved collectively rather than leading to blame culture and inefficiencies.

Answer 122

Continuous improvement encourages teams to reflect on successes and failures, leading to process optimizations and better software delivery over time.

Answer 123

he three types of testing in production are: A/B Testing Canary Testing Blue/Green Testing

Answer 124

A/B Testing involves directing a small percentage of user traffic to a new interface in production. If users respond negatively, traffic is reverted to the old interface.

Answer 125

Canary Testing directs a small percentage of traffic to a new version of the software. If issues arise, traffic is rolled back to the previous version.

Answer 126

Blue/Green Testing swaps the production (blue) and staging (green) environments. If the new version performs well, the switch is made permanent; otherwise, it is reversed.

Answer 127

Continuous delivery is achieved through fast, automated feedback on the production readiness of applications every time there is a change — to code, infrastructure, or configuration.

Answer 128

Software should always be in a production-ready or releasable state.

Answer 129

Continuous delivery helps avoid waste by making it easier to deploy new, experimental features into production quickly and efficiently, reducing delays and unnecessary rework.

Answer 130

Testing should be done continuously throughout the development process, not just at the end.

Answer 131

Everyone involved in the development process is responsible for quality, not just a dedicated QA team.

Answer 132

Keeping the system in a working and stable state is more important than delivering new functionality.

Answer 133

Continuous delivery reduces risk by enabling small, extensively tested changes to be released frequently, and by making reversion easy in case of issues.

Answer 134

Automation is critical for providing fast feedback, reducing human error, and ensuring that every change can be safely and quickly deployed.

Answer 135

Continuous integration ensures that changes are merged and tested frequently, reducing integration problems and allowing for quicker releases.

Answer 136

Common tools include Jenkins, GitHub Actions, GitLab CI/CD, CircleCI, and Travis CI for automation, along with Docker and Kubernetes for containerization and deployment.

Answer 137

Frequent, smaller releases reduce risk, improve feedback loops, enable faster value delivery, and make it easier to pinpoint issues when they arise.

Answer 138

Continuous delivery encourages DevOps practices, breaking down silos and promoting shared responsibility for deployment, monitoring, and system reliability.

Answer 139

Challenges include cultural resistance to change, legacy system constraints, lack of automation, and the need for robust testing strategies.

Answer 140

Continuous delivery enables businesses to respond quickly to market changes, customer feedback, and new opportunities by streamlining the software release process.

Answer 141

Continuous delivery is a key practice within DevOps, aiming to integrate development and operations for seamless, automated software releases.

Answer 142

Monitoring provides real-time feedback on system performance, helping teams detect and resolve issues quickly to maintain system reliability.

Answer 143

Rollback capability ensures that if an issue arises in production, teams can quickly revert to a previous stable version, minimizing downtime and impact.

Answer 144

Feature flagging allows teams to deploy changes without exposing them to all users, enabling controlled testing and gradual rollouts.

Answer 145

Continuous delivery ensures software is always ready for release, while continuous deployment automatically releases every successful change into production without manual intervention.

Answer 146

Success can be measured using metrics such as deployment frequency, lead time for changes, mean time to recovery (MTTR), and change failure rate.

Answer 147

Cloud computing is the delivery of computing services—including servers, storage, databases, networking, software, and more—over the internet ("the cloud"). It allows users to rent rather than own IT infrastructure, offering scalability, flexibility, and cost savings.

Answer 148

Just as electricity utilities provide power from centralized plants, cloud computing providers supply computing resources from centralized data centers. This model allows for economies of scale and expertise that individual users cannot achieve on their own.

Answer 149

A staging environment is a pre-production environment where software is tested in conditions similar to production. It allows stakeholders to validate the software before deployment. Many enterprises rent their staging environments in the cloud rather than maintaining physical infrastructure.

Answer 150

Broad network access (availability over standard networks, including VPNs). On-demand, self-service (users can provision resources as needed). Measured service (pay-per-use billing model). Rapid elasticity (scaling resources up or down as needed). Resource pooling (multi-tenant models for efficiency).

Answer 151

Broad network access means cloud services are available over standard network technologies, including the internet and VPNs, ensuring accessibility from various devices and locations.

Answer 152

On-demand, self-service means customers can provision computing resources automatically without human intervention, typically through a web interface or API.

Answer 153

Measured service refers to the provider's ability to track and optimize resource usage, ensuring customers pay only for what they consume.

Answer 154

Rapid elasticity allows customers to scale computing resources up or down dynamically based on demand, ensuring efficiency and cost-effectiveness.

Answer 155

Resource pooling enables cloud providers to serve multiple customers using shared resources, efficiently distributing computing power among users through multi-tenancy.

Answer 156

The two main phases are: Serverful computing (traditional model with dedicated infrastructure). Serverless computing (execution-based model where infrastructure management is abstracted).

Answer 157

Serverful computing includes: Infrastructure-as-a-Service (IaaS): Access to raw computing resources (e.g., virtual machines). Platform-as-a-Service (PaaS): Managed infrastructure with OS and development tools. Software-as-a-Service (SaaS): Fully managed applications delivered over the cloud.

Answer 158

Serverful computing relies on virtualization, which includes: Virtual Machines (VMs): Software-based simulations of physical computers managed by hypervisors. Containers: Lightweight, OS-level virtualization managed by the operating system.

Answer 159

The serverful cost model is based on resource rental, where customers pay for allocated resources, regardless of whether they are fully utilized. This is similar to renting a car.

Answer 160

Serverless computing includes: Backend-as-a-Service (BaaS): Pre-built backend services (e.g., authentication, databases). Function-as-a-Service (FaaS): Execution of code in response to events without managing infrastructure.

Answer 161

Serverless computing uses hidden containers to run function code. Though servers are still used, their management is abstracted, and responsibility shifts to the cloud provider.

Answer 162

Serverless computing charges customers based on execution time rather than resource allocation. This model is often compared to hailing a taxi—you pay only for the ride, not for keeping a car.

Answer 163

Microservices can be implemented using: Virtual machines (serverful approach, more overhead). Containers (lightweight, efficient serverful approach). Function instances (serverless approach, potential maintenance/performance challenges).

Answer 164

Mapping a single microservice to multiple function instances may create: Maintenance issues (tracking instances). Performance issues (cold start problems when instances are inactive).

Answer 165

FT data centre: Several weeks to months. AWS data centre: A few minutes to hours. This highlights the agility and scalability benefits of cloud computing.

Answer 166

Vendor lock-in occurs when it becomes costly or difficult to switch cloud providers. Wells suggests it is not always a major concern because cloud providers offer significant advantages. Mitigation strategies include using multi-cloud approaches and open standards.

Answer 167

Before: Infrequent, possibly quarterly or monthly releases. After: Continuous deployment, allowing multiple releases per day. The cloud enables faster development cycles and quicker feedback loops.

Answer 168

No, modern DevOps practices enable both. Automation, continuous integration/continuous deployment (CI/CD), and robust monitoring improve stability while maintaining rapid delivery.

Answer 169

Queues decouple system components, enhancing scalability and reliability. They help handle asynchronous processing and load balancing. Example: Message queues (e.g., AWS SQS, RabbitMQ) prevent system overload.

Answer 170

Resilience and fault tolerance. Network latency and eventual consistency. Observability: logging, monitoring, and tracing. Scalability: designing for auto-scaling and load balancing.

Answer 171

Traditional monitoring focuses on infrastructure metrics (CPU, memory, etc.). Business-focused monitoring tracks key performance indicators (KPIs) like user engagement, conversion rates, and revenue. Helps align IT efforts with business goals.

Answer 172

Ensures business continuity in case of failures. Identifies weaknesses in disaster recovery strategies. Techniques include chaos engineering (e.g., Netflix's Chaos Monkey) to simulate failures and test resilience.

Answer 173

Amazon promotes the philosophy "You build it, you run it," which gives developers operational responsibilities. This closes the feedback loop with customers and enhances service quality, unlike traditional models where developers hand off code and disengage.

Answer 174

CALMS stands for Culture, Automation, Lean, Measurement, and Sharing. These are the five key principles that guide DevOps practices.

Answer 175

Culture in DevOps emphasizes collaboration and shared values among teams. It encourages a blameless environment focused on learning from mistakes and continuously improving.

Answer 176

At the NUMMI plant, Toyota retrained GM workers using a high-trust, continuous improvement culture. Within three months, the plant produced the highest-quality cars in America, highlighting the power of DevOps-aligned culture.

Answer 177

Automation reduces the risk of deployment failures, speeds up processes, ensures repeatability, and increases transparency, allowing teams to focus on higher-value tasks.

Answer 178

Jidoka means "automation with a human touch." Machines or operators can halt production upon detecting issues. In DevOps, this relates to empowering systems or developers to detect and respond to failures early.

Answer 179

Lean focuses on eliminating waste (e.g., unnecessary processes, handoffs, or rework) to improve efficiency and reduce delays without sacrificing product quality.

Answer 180

By limiting work in progress and minimizing handoffs, teams can stay focused, avoid being interrupt-driven, and reduce coordination overhead.

Answer 181

Waste types include waiting (e.g., delayed deployments), transportation (data transfer inefficiencies), overproduction (building unneeded features), and rework (fixing bugs). Kanban boards help visualize and manage these wastes.

Answer 182

Measurement involves continuous monitoring of metrics and logs to quickly detect, diagnose, and fix system issues. It supports data-driven decisions and improvements.

Answer 183

Metrics are time-series data points that reflect system behavior. They are key to assessing system performance and comparing against KPIs or benchmarks for improvement.

Answer 184

Toyota tracks metrics like floor lengths in tenths to identify bottlenecks or underperforming areas, guiding targeted support through gemba walks (visits to the workplace).

Answer 185

Sharing promotes open communication and knowledge exchange between development and operations teams. It fosters collaboration, rapid problem detection, and learning from incidents.

Answer 186

By including operations in dev meetings, lunches, and team events, organizations build relationships and feedback loops that prevent problems and enhance mutual learning.

Answer 187

Genchi genbutsu means "go and see." Managers visit the worksite to understand issues firsthand, mirroring DevOps practices of engaging with systems and teams directly to solve problems.

Answer 188

According to Vargo, developers prioritize agility, meaning the ability to quickly write and deploy code. Operators prioritize stability, which refers to keeping systems reliable and preventing disruptions. This difference in priorities often leads to tension between the two roles.

Answer 189

DevOps, in its purest form, is about breaking down the metaphorical wall between developers and operators. It emphasizes collaboration, shared responsibilities, and integrated workflows between traditionally siloed teams to streamline development and operations.

Answer 190

Reducing organizational silos is essential because success in DevOps comes from cooperation between cross-functional teams. When teams work in isolation, it hinders communication, slows down processes, and reduces the overall efficiency of software delivery.

Answer 191

Failure should be accepted as normal because all human-created systems are inherently unreliable. Recognizing this encourages organizations to prepare for failure, build resilient systems, and respond to issues more effectively rather than trying to eliminate failure entirely.

Answer 192

Gradual change is crucial because large, million-line changes are difficult to debug and verify. Smaller, incremental changes reduce risk, make it easier to detect bugs, and allow for faster feedback cycles and rollbacks if necessary.

Answer 193

Tooling and automation are essential because they convert manual work into repeatable, automated patterns. This improves consistency, reduces human error, and increases the speed and reliability of development and operational tasks.

Answer 194

Measurement is critical because it provides data to justify DevOps investments and sets clear metrics for success. Without data, it's difficult to assess progress or identify areas needing improvement.

Answer 195

SRE reduces silos by promoting shared ownership with developers, using common tools, and adopting shared availability metrics. This fosters communication and collaboration between development and operations teams.

Answer 196

SRE acknowledges system unreliability through the use of Service Level Objectives (SLOs), which define acceptable levels of performance. It also conducts blameless postmortems after failures to learn and improve without assigning personal blame.

Answer 197

SRE encourages small, fast, and iterative deployments to minimize the cost of failure. These smaller changes reduce complexity and make issues easier to diagnose and fix.

Answer 198

SRE aims to automate tasks that were manual in the past. The goal is that work done manually this year should be automated by the next, thereby reducing repetitive, manual work known as "toil."

Answer 199

SRE measures both system metrics, like reliability and performance, and human metrics, such as the amount of toil. This dual approach ensures technical health and team sustainability.

Answer 200

CALMS stands for Culture, Automation, Lean, Measurement, and Sharing, representing the five pillars of DevOps principles.

Answer 201

SRE supports Culture by forming separate SRE teams or embedding SREs in development teams. It promotes a "blameless" culture in incident analysis to encourage openness and learning.

Answer 202

A blameless postmortem avoids blaming individuals or teams for failures. This helps foster trust, openness, and continuous improvement without fear of punishment.

Answer 203

The goal is to share postmortems across engineering teams to learn from incidents and improve system resilience.

Answer 204

SRE uses automation to eliminate "toil," which is manual, repetitive, and low-value operational work. The aim is to free engineers to focus on higher-value engineering tasks.

Answer 205

Ideally, SREs should spend at least 50% of their time on engineering work and the remainder on on-call duties like support and incident response.

Answer 206

To prevent "pager fatigue" and ensure quality incident handling and postmortems, each SRE should face no more than two incidents per 8-12 hour shift.

Answer 207

An error budget is calculated as the difference between observed reliability and agreed reliability. It helps control feature releases by allowing them when within budget and halting them when exceeded.

Answer 208

By clearly separating development and operations times, engineers can focus solely on one type of work at a time, reducing context switching and miscommunication.

Answer 209

SRE obsessively monitors a small set of key metrics chosen based on intuition, experience, and user needs, ensuring they reflect service health accurately.

Answer 210

SLI (Service Level Indicator): Quantitative measure of service aspect (e.g., latency). SLO (Service Level Objective): Target value or acceptable range for an SLI. SLA (Service Level Agreement): Contract specifying consequences for meeting or missing an SLO.

Answer 211

Knowledge sharing and tool/technique sharing.

Answer 212

Developers inform ops about upcoming functionality; ops inform developers about performance issues, ensuring both sides are informed.

Answer 213

It standardizes environment management and deployment, allowing any engineer to self-service tasks like deployments, improving efficiency and reducing bottlenecks.

Answer 214

A good alert is actionable and pertains to an issue that cannot be fixed without human intervention. If automated remediation is feasible, it should be attempted first. This is important for SREs because poor alerts lead to alert fatigue and unnecessary stress, particularly during on-call shifts.

Answer 215

Reliability theater refers to traditional setups like Network Operations Centers (NoCs) or war rooms that exist mainly to impress stakeholders rather than improve reliability. SREs find this problematic because it detracts from genuine incident response effectiveness.

Answer 216

A server that is unique, manually configured, and difficult to reproduce or replace. A snowflake is a production server maintained through manual command-line tweaks, making it unique and hard to reproduce or debug. SREs discourage snowflakes because they violate the principle of infrastructure as code and hinder scalability and reliability.

Answer 217

'Pets' are individually managed servers with unique configurations, akin to snowflakes. 'Cattle' are standardised servers managed in groups. 'Poultry' refers to ephemeral containers, also managed in bulk. The progression from pets to poultry reflects increasing automation and decreasing management overhead.

Answer 218

Autonomous systems make independent decisions and take actions without human input, whereas automated systems follow predefined instructions. Autonomous > automated because it reduces human intervention and eases the burden on the on-call rotation, improving system resilience.

Answer 219

Embedding SREs fosters trust and collaboration between development and operations. It allows SREs to contribute to system design early on, resulting in more reliable and maintainable software.

Answer 220

The "right number of nines" (e.g., 99.9%, 99.99% uptime) depends on how much downtime the business can tolerate. It's a trade-off between reliability and cost, based on business requirements and customer expectations.

Answer 221

Eventual consistency is a property of distributed systems—like those often found in microservices and enterprise computing—where the system guarantees that, given enough time without new updates, all parts of the system will eventually reflect the same data (reach consistency). Not immediate: Unlike strong consistency (where updates are visible everywhere instantly), eventual consistency tolerates delays. Asynchronous replication: Changes are copied to other services/databases in the background. Common in high-availability systems: It allows systems to keep working even if some parts are temporarily unreachable.

Answer 222

Eventual consistency is a property of distributed systems—like those often found in microservices and enterprise computing—where the system guarantees that, given enough time without new updates, all parts of the system will eventually reflect the same data (reach consistency). Not immediate: Unlike strong consistency (where updates are visible everywhere instantly), eventual consistency tolerates delays. Asynchronous replication: Changes are copied to other services/databases in the background. Common in high-availability systems: It allows systems to keep working even if some parts are temporarily unreachable.

Answer 223

Key Performance Indicators (KPIs) in software development are specific, measurable metrics used to evaluate the performance and progress of software teams, projects, or products. They help organizations track success, identify areas for improvement, and align development efforts with business goals.

Enterprise Computing Flashcards

(264 cards)