KUBERNETES ARCHITECTURE Flashcards

Question

kube-proxy

Answer 1

The kube-proxy is in charge of managing the network connectivity to the containers. It does so through the use of iptables entries. It also has the userspace mode, in which it monitors Services and Endpoints using a random high-number port to proxy traffic. Use of ipvs can be enabled, with the expectation it will become the default, replacing iptables.

Answer 2

The container runtime is the software that is responsible for running containers. Kubernetes supports several container runtimes: Docker, containerd, cri-o, rktlet and any implementation of the Kubernetes CRI (Container Runtime Interface).

Answer 3

Services With every object and agent decoupled we need a flexible and scalable operator which connects resources together and will reconnect, should something die and a replacement is spawned. Each Service is a microservice handling a particular bit of traffic, such as a single NodePort or a LoadBalancer to distribute inbound requests among many Pods. A Service also handles access policies for inbound requests, useful for resource control, as well as for security. A service, as well as kubectl, uses a selector in order to know which objects to connect.

Answer 4

equality-based Filters by label keys and their values. Three operators can be used, such as =, ==, and !=. If multiple values or keys are used, all must be included for a match. set-based Filters according to a set of values. The operators are in, notin, and exists. For example, the use of status notin (dev, test, maint) would select resources with the key of status which did not have a value of dev, test, nor maint.

Answer 5

https://kubernetes.io/docs/concepts/architecture/controller/ An important concept for orchestration is the use of operators. These are also known as watch-loops and controllers. They query the current state, compare it against the spec, and execute code based on how they differ. Various operators ship with Kubernetes, and you can create your own, as well. A simplified view of an operator is an agent, or Informer, and a downstream store. Using a DeltaFIFO queue, the source and downstream are compared. A loop process receives an obj or object, which is an array of deltas from the FIFO queue. As long as the delta is not of the type Deleted, the logic of the operator is used to create or modify some object until it matches the specification. The Informer which uses the API server as a source requests the state of an object via an API call. The data is cached to minimize API server transactions. A similar agent is the SharedInformer; objects are often used by multiple other objects. It creates a shared cache of the state for multiple requests. A Workqueue uses a key to hand out tasks to various workers. The standard Go workqueues of rate limiting, delayed, and time queue are typically used. The endpoints, namespace, and serviceaccounts operators each manage the eponymous resources for Pods.

Answer 6

known as the pause container. The pause container is used to get an IP address, then all the containers in the pod will use its network namespace. You won’t see this container from the Kubernetes perspective, but you would by running sudo docker ps. To communicate with each other, containers can use the loopback interface, write to files on a common filesystem, or via inter-process communication (IPC). As a result, co-locating applications in the same pod may have issues. There is a network plugin which will allow more than one IP address, but so far, it has only been used within HPE labs. Support for dual-stack, IPv4 and IPv6 continues to increase with each release. For example, in a recent release kube-proxy iptables supports both stacks simultaneously.

Answer 7

kube-controller-manager kube-controller-manager compares states and makes decisions

Answer 8

The goal of the Container Runtime Interface (CRI) is to allow easy integration of container runtimes with kubelet. By providing a protobuf method for API, specifications and libraries, new runtimes can easily be integrated without needing deep understanding of kubelet internals.

Answer 9

The intent of the containerd project is not to build a user-facing tool instead, it is focused on exposing highly-decoupled low-level primitives: Defaults to runC to run containers according to the OCI Specifications Intended to be embedded into larger systems Minimal CLI, focused on debugging and development. With a focus on supporting the low-level, or backend plumbing of containers, this project is better suited to integration and operation teams building specialized products, instead of typical build, ship, and run applicatio

Answer 10

Use the -it options for an interactive shell instead of the command running without interaction or access. If you have more than one container, declare which container: kubectl exec -it -- /bin/bash

Answer 11

https://matthewpalmer.net/kubernetes-app-developer/articles/multi-container-pod-design-patterns.html#:~:text=The%20ambassador%20pattern%20is%20a,depending%20on%20the%20cluster's%20needs. There are three common design patterns and use-cases for combining multiple containers into a single pod. We’ll walk through the sidecar pattern, the adapter pattern, and the ambassador pattern

Answer 12

Liveness, Readiness and Startup Probes The kubelet uses liveness probes to know when to restart a container. For example, liveness probes could catch a deadlock, where an application is running, but unable to make progress. Restarting a container in such a state can help to make the application more available despite bugs. The kubelet uses readiness probes to know when a container is ready to start accepting traffic. A Pod is considered ready when all of its containers are ready. One use of this signal is to control which Pods are used as backends for Services. When a Pod is not ready, it is removed from Service load balancers. The kubelet uses startup probes to know when a container application has started. If such a probe is configured, it disables liveness and readiness checks until it succeeds, making sure those probes don't interfere with the application startup. This can be used to adopt liveness checks on slow starting containers, avoiding them getting killed by the kubelet before they are up and running.

Answer 13

an industry standard interface for container orchestration to allow access to arbitrary storage systems. Currently, volume plugins are "in-tree", meaning they are compiled and built with the core Kubernetes binaries. This "out-of-tree" object will allow storage vendors to develop a single driver and allow the plugin to be containerized. This will replace the existing Flex plugin which requires elevated access to the host node, a large security concern.

Answer 14

Keeping acquired data or ingesting it into other containers is a common task, typically requiring the use of a PersistentVolumeClaim (pvc).

Answer 15

he three access modes are RWO (ReadWriteOnce), which allows read-write by a single node, ROX (ReadOnlyMany), which allows read-only by multiple nodes, and RWX (ReadWriteMany), which allows read-write by many nodes.

Answer 16

A PersistentVolume (PV) is a storage abstraction used to retain data longer than the Pod using it. Pods define a volume of type PersistentVolumeClaim (PVC) with various parameters for size and possibly the type of backend storage known as its StorageClass. The cluster then attaches the PersistentVolume. Kubernetes will dynamically use volumes that are available, irrespective of its storage type, allowing claims to any backend storage. $ kubectl get pv $ kubectl get pvc

Answer 17

Provisioning Binding Using Releasing Reclaiming The reclaim phase has three options: Retain, which keeps the data intact, allowing for an administrator to handle the storage and data. Delete tells the volume plugin to delete the API object, as well as the storage behind it. The Recycle option runs an rm -rf /mountpoint and then makes it available to a new claim. With the stability of dynamic provisioning, the Recycle option is planned to be deprecated.

Answer 18

Persistent volumes are cluster-scoped, but persistent volume claims are namespace-scoped. An alpha feature since v1.11 this allows for static provisioning of Raw Block Volumes, which currently support the Fibre Channel plugin. There is a lot of development and change in this area, with plugins adding dynamic provisioning.

Answer 19

A Secret is an object that contains a small amount of sensitive data such as a password, a token, or a key. Such information might otherwise be put in a Pod specification or in an image. Users can create Secrets and the system also creates some Secrets.

Answer 20

Opaque - arbitrary user-defined data + 7 k8s specified types https://kubernetes.io/docs/concepts/configuration/secret/#secret-types

Answer 21

A similar API resource to Secrets is the ConfigMap, except the data is not encoded. In keeping with the concept of decoupling in Kubernetes, using a ConfigMap decouples a container image from configuration artifacts. They store data as sets of key-value pairs or plain configuration files in any format. The data can come from a collection of files or all files in a directory. It can also be populated from a literal value.

Answer 22

ConfigMaps can be consumed in various ways: Pod environmental variables from single or multiple ConfigMaps Use ConfigMap values in Pod commands Populate Volume from ConfigMap Add ConfigMap data to specific path in Volume Set file names and access mode in Volume from ConfigMap data Can be used by system components and controllers.

Answer 23

A common update is to change the number of replicas running. If this number is set to zero, there would be no containers, but there would still be a ReplicaSet and Deployment. This is the backend process when a Deployment is deleted.

Answer 24

The ambassador pattern is a useful way to connect containers with the outside world. An ambassador container is essentially a proxy that allows other containers to connect to a port on localhost while the ambassador container can proxy these connections to different environments depending on the cluster's needs.

Answer 25

Learning Objectives By the end of this section, you should be able to: Explain the flow of API requests. Configure authorization rules. Examine authentication policies. Restrict network traffic with network policies.

Answer 26

There are three main points to remember with authentication in Kubernetes: In its straightforward form, authentication is done with certificates, tokens or basic authentication (i.e. username and password). Users are not created by the API, but should be managed by an external system. System accounts are used by processes to access the API (to learn more read "Configure Service Accounts for Pods").

Answer 27

The type of authentication used is defined in the kube-apiserver startup options. Below are four examples of a subset of configuration options that would need to be set depending on what choice of authentication mechanism you choose: --basic-auth-file --oidc-issuer-url --token-auth-file --authorization-webhook-config-file

Answer 28

ABAC - Attribute-Based Access Control (ABAC) RBAC - Role-Based Access Control (RBAC) Webhook. They can be configured as kube-apiserver startup options: --authorization-mode=ABAC --authorization-mode=RBAC --authorization-mode=Webhook --authorization-mode=AlwaysDeny --authorization-mode=AlwaysAllow

Answer 29

Admission controllers are pieces of software that can access the content of the objects being created by the requests. They can modify the content or validate it, and potentially deny the request. Admission controllers are needed for certain features to work properly. Controllers have been added as Kubernetes matured. Starting with the 1.13.1 release of the kube-apiserver, the admission controllers are now compiled into the binary, instead of a list passed during execution. To enable or disable, you can pass the following options, changing out the plugins you want to enable or disable: - -enable-admission-plugins=Initializers,NamespaceLifecycle,LimitRanger - -disable-admission-plugins=PodNodeSelector The first controller is Initializers which will allow the dynamic modification of the API request, providing great flexibility. Each admission controller functionality is explained in the documentation. For example, the ResourceQuota controller will ensure that the object created does not violate any of the existing quotas.

Answer 30

Pods and containers within pods can be given specific security constraints to limit what processes running in containers can do. For example, the UID of the process, the Linux capabilities, and the filesystem group can be limited. This security limitation is called a security context. It can be defined for the entire pod or per container, and is represented as additional sections in the resources manifests. The notable difference is that Linux capabilities are set at the container level. spec: securityContext: runAsNonRoot: true https://kubernetes.io/docs/tasks/configure-pod-container/security-context/

Answer 31

To automate the enforcement of security contexts, you can define PodSecurityPolicies (PSP). A PSP is defined via a standard Kubernetes manifest following the PSP API schema. An example is presented below. These policies are cluster-level rules that govern what a pod can do, what they can access, what user they run as, etc. For instance, if you do not want any of the containers in your cluster to run as the root user, you can define a PSP to that effect. You can also prevent containers from being privileged or use the host network namespace, or the host PID namespace. For Pod Security Policies to be enabled, you need to configure the admission controller of the controller-manager to contain PodSecurityPolicy

Answer 32

While PSP has been helpful, there are other methods gaining popularity. The Open Policy Agent (OPA), often pronounced as "oh-pa", provides a unified set of tools and policy framework. This allows a single point of configuration for all of your cloud deployments. OPA can be deployed as an admission controller inside of Kubernetes, which allows OPA to enforce or mutate requests as they are received. Using the OPA Gatekeeper it can be deployed using Custom Resource Definitions.

Answer 33

By default, all pods can reach each other all ingress and egress traffic is allowed. This has been a high-level networking requirement in Kubernetes. However, network isolation can be configured and traffic to pods can be blocked. In newer versions of Kubernetes, egress traffic can also be blocked. This is done by configuring a NetworkPolicy. As all traffic is allowed, you may want to implement a policy that drops all traffic, then, other policies which allow desired ingress and egress traffic. Not all network providers support the NetworkPolicies kind. A non-exhaustive list of providers with support includes Calico, Romana, Cilium, Kube-router, and WeaveNet.

Answer 34

Labels are used to determine which Pods should receive traffic from a service. Labels can be dynamically updated for an object, which may affect which Pods continue to connect to a service. The default update pattern is for a rolling deployment, where new Pods are added, with different versions of an application, and due to automatic load balancing, receive traffic along with previous versions of the application. Should there be a difference in applications deployed, such that clients would have issues communicating with different versions, you may consider a more specific label for the deployment, which includes a version number. When the deployment creates a new replication controller for the update, the label would not match. Once the new Pods have been created, and perhaps allowed to fully initialize, we would edit the labels for which the Service connects. Traffic would shift to the new and ready version, minimizing client version confusion.

Answer 35

Typically, a service creates a new endpoint for connectivity. Should you want to create the service, but later add the endpoint, such as connecting to a remote database, you can use a service without selectors. This can also be used to direct the service to another service, in a different namespace or cluster.

Answer 36

- Port exposes the Kubernetes service on the specified port within the cluster. Other pods within the cluster can communicate with this server on the specified port. - TargetPort is the port on which the service will send requests to, that your pod will be listening on. Your application in the container will need to be listening on this port also. - NodePort exposes a service externally to the cluster by means of the target nodes IP address and the NodePort. NodePort is the default setting if the port field is not specified.

Answer 37

LoadBalancer Creating a LoadBalancer service generates a NodePort, which then creates a ClusterIP. It also sends an asynchronous call to an external load balancer, typically supplied by a cloud provider. The External-IP value will remain in a state until the load balancer returns. Should it not return, the NodePort created acts as it would otherwise.

Answer 38

More Resources There are several things that you can do to quickly diagnose potential issues with your application and/or cluster. The official Documentation offers additional materials to help you get familiar with troubleshooting: "Troubleshooting" https://kubernetes.io/docs/tasks/debug-application-cluster/troubleshooting/ "Troubleshooting Applications" https://kubernetes.io/docs/tasks/debug-application-cluster/debug-application/ "Troubleshoot Cluster" https://kubernetes.io/docs/tasks/debug-application-cluster/debug-cluster/ "Debug Pods and ReplicationControllers" https://kubernetes.io/docs/tasks/debug-application-cluster/debug-pod-replication-controller/ "Debug Services" https://kubernetes.io/docs/tasks/debug-application-cluster/debug-service/ You can also follow: Kubernetes GitHub resources for issues and bug tracking Kubernetes Slack channel

KUBERNETES ARCHITECTURE Flashcards

(62 cards)