Kubernetes Flashcards

Question

[k8s] What is sidecar container?

Answer 1

A sidecar container is a container that augments the operation of the main container of the pod. You add a sidecar to a pod so you can use an existing container image instead of cramming additional logic into the main app's code, which would make it overly complex and less reusable

Answer 2

The persistent volumes types explored so far required the developer of the pod to have knowledge of the actual network storage infrastructure available in cluster. For example, to create a NFS-backed volume, the developer has to know the actual server the NFS export is located on. This is against the basic idea of Kubernetes, which aims to hide the actual infrastructure from both the application and its developer, leaving them free from worrying about the specifics of the infrastructure and making apps portable across a wide array of cloud providers and on-premises data centers. To enable apps to request storage in a Kubernetes cluster without having to deal with infrastructure specifics, two new resources were introduced. They are Persistent-Volumes and PersistentVolumeClaim.

Answer 3

The cluster administrator sets up the underlying storage and then registers it in Kubernetes by creating a PersistentVolume resource through the Kubernetes API server. When creating the PersistentVolume, the admin specifies its size and the access modes it supports. Note the PV need be created (static) by the administrator. When a cluster user needs to use persistent storage in one of their pods, they first create a PersistentVolumeClaim manifest, specifying the minimum size and the access mode they require. The user then submits the PersistentVolumeClaim manifest to the Kubernetes API server, and Kubernetes finds the appropriate PersistentVolume and binds the volume to the claim. The PersistentVolumeClaim can then be used as one of the volumes inside a pod. Other users cannot use the same PersistentVolume until it has been released by deleting the bound PersistentVolumeClaim. PersistentVolume dont belong to any namespace. They are cluster-level resource like nodes.

Answer 4

When a PVC is deleted, PV goes into state 'released' if 'PersistentVolumeClaimPolicy' is set to 'retain'. So in this case new pod requesting PVC will not get the PV. For that the PV need to be cleaned up manually. 'recycle' policy automatically deletes the volume's contents and makes the volume available to be claimed again. 'Delete' policy deletes the underlying storage

Answer 5

PersistentVolumes and PersistentVolumeClaims makes it easy to obtain persistent storage without the developer having to deal with the actual storage technology used underneath. But this still requires a cluster administrator to provision the actual storage up front. ``` Kubernetes can also perform this job automatically through dynamic provisioning of PersistentVolumes. The cluster admin, instead of creating PersistentVolumes, can deploy a PersistentVolume provisioner and define one or more StorageClass objects to let users choose what type of PersistentVolume they want. The users can refer to the StorageClass in their PersistentVolumeClaims and the provisioner will take that into account when provisioning the persistent storage. An user can also make a Storageclass default whcich would be used to dynamically provision a PersistentVolume if the PersistentVolumeClaim does not say which storage class to use. ```

Answer 6

Yes. While specifying the container information in pod spec, use 'command' attribute to specify entry point and 'args' attribute to specify arguments to be passed to the command (i.e. overriding the CMD)

Answer 7

Configuration information can be passed on to the containers of the pod in different ways 1. By overriding ENTRYPOINT and CMD attributes. 2. By specifying a custom list of environment variables for each container of a pod. 3. Above two methods are kind of hard-coding configuration information in the pod spec. This problem can be addressed by using configMap resource.

Answer 8

Kubernetes allows separating configuration options into a separate object called a ConfigMap, which is a map containing key/value pairs with the values ranging from short literals to full config files. An application doesn’t need to read the ConfigMap directly or even know that it exists. The contents of the map are instead passed to containers as either environment variables or as files in a volume. if the referenced ConfigMap doesn’t exist when you create the pod. Kubernetes schedules the pod normally and tries to run its containers. The container referencing the non-existing ConfigMap will fail to start, but the other container will start normally. If you then create the missing ConfigMap, the failed container is started without requiring you to recreate the pod.

Answer 9

Using a ConfigMap and exposing it through a volume brings the ability to update the configuration without having to recreate the pod or even restart the container. When you update a ConfigMap, the files in all the volumes referencing it are updated. It’s then up to the process to detect that they’ve been changed and reload them.

Answer 10

To store and distribute sensitive information, Kubernetes provides a separate object called a Secret. Secrets are much like ConfigMaps—they’re also maps that hold key-value pairs. They can be used the same way as a ConfigMap. You can  Pass Secret entries to the container as environment variables  Expose Secret entries as files in a volume. Kubernetes helps keep your Secrets safe by making sure each Secret is only distributed to the nodes that run the pods that need access to the Secret. Also, on the nodes themselves, Secrets are always stored in memory and never written to physical storage, which would require wiping the disks after deleting the Secrets from them.

Answer 11

Every pod gets a default secret mounted in it which contains information that can be used by the pod to talk to the kubernets api-server.

Answer 12

conceptually secret and configMap are same. They allow run time configurations for the pod. But secrets and configMaps has few differences - - The contents of secret's entries are shown as Base-64 encoded strings, whereas those of a configMap are shown in cleartext - Maximum size of secret is limited to 1 MB

Answer 13

The Downward API enables you to expose the pod’s own metadata to the processes running inside that pod. It allows you to pass metadata about the pod and its environment through environment variables or files (in a downwardAPI volume). Don’t be confused by the name. The Downward API isn’t like a REST endpoint that your app needs to hit so it can get the data. It’s a way of having environment variables or files populated with values from the pod’s specification or status Currently it allows to pass following info to the processes running inside the pod  The pod’s name  The pod’s IP address  The namespace the pod belongs to  The name of the node the pod is running on  The name of the service account the pod is running under  The CPU and memory requests for each container  The CPU and memory limits for each container  The pod’s labels  The pod’s annotations Downward API allows you to keep the application Kubernetes-agnostic. This is especially useful when you’re dealing with an existing application that expects certain data in environment variables. The Downward API allows you to expose the data to the application without having to rewrite the application or wrap it in a shell script, which collects the data and then exposes it through environment variables.

Answer 14

kubernetes master hosts the kubernetes API server component which uses HTTPS and requires authentication. The URL to the API server can be obtained by running command $kubectl cluster-info Rather than dealing with authentication yourself, you can talk to the server through a proxy by running the kubectl proxy command. The 'kubectl proxy' command runs a proxy server that accepts HTTP connection on your local machine and proxies them to the API server while taking care of authentication, so you don't need to pass the authentication token in every request. It also makes sure you are talking to actual API server and not a man in the middle by verifying the server's certificate on each request $kubectl proxy

Answer 15

To talk to the API server from inside a pod, you need to take care of three things:  Find the location of the API server.  Make sure you’re talking to the API server and not something impersonating it.  Authenticate with the server; otherwise it won’t let you see or do anything. Finding the location of API server is easy because a service called 'kubernetes' is automatically exposed in the default namespace and configured to point to the API server. $kubectl get svc Environment variables are configured for each service so you can get both the IP and port of API server by lookingup the KUBERNETES_SERVICE_HOST and KUBERNETES_SERVICE_PORT variables The default secret is mounted inside each container at /var/run/secrets/kubernetes.io/serviceaccount/. $ls /var/run/secrets/kubernetes.io/serviceaccount/ ca.crt namespace token ca.crt file: Holds the certificate of the certificate authority (CA) used to sign the Kubernetes API server’s certificate. This file can be used in curl command to ensure that you are talking to API server only $ curl --cacert /var/run/secrets/kubernetes.io/serviceaccount /ca.crt https://kubernetes An authentication token on the secrets volume can be used to authenticate to the API server TOKEN=$(cat /var/run/secrets/kubernetes.io/serviceaccount/token) Using these details you can talk to the API server $curl -H "Authorization: Bearer $TOKEN" https://kubernetes

Answer 16

setting up certificate option and using tokens to access API server is cumbersome. So instead of talking to API server directly, you can run 'kubectl proxy' in an ambassador container alongside the main container and communicate with the API server through it. Because all containers in a pod share the same loopback network interface, your app can access the proxy through a port on a localhost.

Answer 17

A deployment is a higher-level resource meant for deploying applications and updating them declaratively, instead of doing it through a ReplicationController or a ReplicaSet, which are both considered lower level concepts. When you create a Deployment, a ReplicaSet resource is created underneath. So when using a Deployment, the actual pods are created and managed by Deployment's ReplicaSets and not by the Deployment directly. A hash value of pod's template is calculated and it is used while naming the underneath ReplicaSet and the pods. While updating the application, one just need to update a pod template and Deployment resource will take care of rolling out the update. Updates are done using different strategies, in case of rollingUpdate strategy, the pods are replaced one by one without any downtime of the application. There are other strategies like ReCreate which removes all the pods and then creates new ones. Note the Deployment resource creates multiple ReplicaSets, one for each version of the pod template. The Deployment can be rolled back or aborted mid-way. One can pause a deployment to inspect how a single instance of the new version behaves in production before allowing additional pod instances to replace old ones. You can control the rate of the rolling update through maxSurge and maxUnavailable properties

Answer 18

A canary release is a technique for minimizing the risk of rolling out a bad version of an application and it affecting all your users. Instead of rolling out the new version to everyone, you replace only one or a small number of old pods with the new once. This way only a small number of users wil initially hit the new version. You can then verify whether the new version is working fine or not and then either continue the roll-out across the remaining pods or roll back the previous version.

Answer 19

Pods are usually fronted by a Service. It’s possible to have the Service front only the initial version of the pods while you bring up the pods running the new version. Then, once all the new pods are up, you can change the Service’s label selector and have the Service switch over to the new pods. This is called a blue-green deployment. After switching over, and once you’re sure the new version functions correctly, you’re free to delete the old pods by deleting the old ReplicationController.

Answer 20

ReplicaSets create exact copies of a pod. These replicas dont differ from each other, apart from their name and IP address. If a pod template includes a volume, all replicas of the ReplicaSet will use the exact same volume. Statefulsets can be used here, where each pod created can have its own storage volume.

Answer 21

When a stateful pod instance dies (or the node it is running on fails), the pod instance need to be resurrected on another node, but the new instance needs to get the same name, network identity and state as the one it is replacing. ReplicaSets are stateless so they can be replaced with a completely new pod replica at any time.

Answer 22

A Kubernetes cluster is split into two parts:  The Kubernetes Control Plane  The (worker) nodes The Control Plane is what controls and makes the whole cluster function. The components that make up the Control Plane are -  The etcd distributed persistent storage  The API server  The Scheduler  The Controller Manager These components store and manage the state of the cluster, but they aren’t what runs the application containers. The task of running your containers is up to the components running on each worker node:  The Kubelet  The Kubernetes Service Proxy (kube-proxy)  The Container Runtime (Docker, rkt, or others) Kubernetes system components communicate only with the API server. They don’t talk to each other directly. The API server is the only component that communicates with etcd. None of the other components communicate with etcd directly, but instead modify the cluster state by talking to the API server. ``` #Get status of control plane components $kubectl get componentstatuses ```

Answer 23

A single master k8s cluster is not a good idea as the master becomes a single point of failure in the system. Generally a multi-master k8s deployment is done to ensure highly available system. For high availability of the control plane multiple instances of etc and API server can be active at the same time and do perform their jobs in parallel. Only one instance of scheduler and controller manager can be active at given time, with other in standby mode.

Answer 24

API server provides - CRUD operations - Validation of objects stored in etcd - Call various registered plugins like authorization and authentication. The API server doesn’t do anything else. For example, it doesn’t create pods when you create a ReplicaSet resource and it doesn’t manage the endpoints of a service. That’s what controllers in the Controller Manager do. But the API server doesn’t even tell these controllers what to do. All it does is enable those controllers and other components to observe changes to deployed resources. A Control Plane component can request to be notified when a resource is created, modified, or deleted. This enables the component to perform whatever task it needs in response to a change of the cluster metadata. ``` # what the pods i.e. creation, deletion etc. $kubectl get pods --watch ```

Answer 25

The operation of the Scheduler is simple. All it does is wait for newly created pods through the API server’s watch mechanism and assign a node to each new pod that doesn’t already have the node set. The Scheduler doesn’t instruct the selected node (or the Kubelet running on that node) to run the pod. All the Scheduler does is update the pod definition through the API server. The API server then notifies the Kubelet (again, through the watch mechanism described previously) that the pod has been scheduled. As soon as the Kubelet on the target node sees the pod has been scheduled to its node, it creates and runs the pod’s containers.

Answer 26

the Kubelet is the component responsible for everything running on a worker node. Its initial job is to register the node it’s running on by creating a Node resource in the API server. Then it needs to continuously monitor the API server for Pods that have been scheduled to the node, and start the pod’s containers. It does this by telling the configured container runtime (which is Docker, CoreOS’ rkt, or something else) to run a container from a specific container image. The Kubelet then constantly monitors running containers and reports their status, events, and resource consumption to the API server. The Kubelet is also the component that runs the container liveness probes, restarting containers when the probes fail. Lastly, it terminates containers when their Pod is deleted from the API server and notifies the server that the pod has terminated.

Answer 27

kube-proxy makes sure clients can connect to the services you define using service object. it makes sure connections to the service IP and port end up at one of the pods backing that service (or other, non-pod service endpoints). When a service is backed by more than one pod, the proxy performs load balancing across those pods.

Answer 28

Both the Control Plane components and the Kubelet emit events to the API server as they perform actions. They do this by creating Event resources, which are like any other Kubernetes resource. you can also retrieve events directly with "kubectl get events". Also watching events with the --watch option is much easier on the eyes and useful for seeing what is happening in the cluster. $kubectl get events --watch

Answer 29

Th pause container is the container that holds all the containers of a pod together. The pause container is an infrastructure container whose sole purpose is to hold all namespaces i.e.e network and other Linux namespaces. All other user-defined containers of the pod then use the namespaces of the pod infrastructure container. Actual application containers may die and get restarted. When such a container starts up again, it needs to become part of the same Linux namespaces as before. The infrastructure container makes this possible since its lifecycle is tied to that of the pod—the container runs from the time the pod is scheduled until the pod is deleted. If the infrastructure pod is killed in the meantime, the Kubelet recreates it and all the pod’s containers.

Answer 30

``` Each pod gets its own unique IP address and can communicate with all other pods through a flat, NAT-less network. The network is set up by the system administrator or by a Container Network Interface (CNI) plugin, not by Kubernetes itself. Number of CNI plugins are available * Calico * Flannel * romana * Weave Net ```

Answer 31

Each Service gets its own stable IP address and port. Clients (usually pods) use the service by connecting to this IP address and port. The IP address is virtual—it’s not assigned to any network interfaces and is never listed as either the source or the destination IP address in a network packet when the packet leaves the node. A key detail of Services is that they consist of an IP and port pair (or multiple IP and port pairs in the case of multi-port Services), so the service IP by itself doesn’t represent anything. That’s why you can’t ping them.

Kubernetes Flashcards

study kubernetes (55 cards)