Kubernetes Interview Questions and Answers
Kubernetes is the best orchestration tool, Mostly Companies use Kubernetes in their day-to-day activities.
1. What is Kubernetes?
Kubernetes is an open-source container management tool that holds the responsibilities of container deployment, scaling & descaling of containers & load balancing. Being Google’s brainchild, it offers an excellent community and works brilliantly with all the cloud providers. So, we can say that Kubernetes is not a containerization platform, but it is a multi-container management solution.
2. How is Kubernetes different from Docker Swarm?
Features | Kubernetes | Docker Swarm |
Installation & Cluster Config | The setup is very complicated, but once installed cluster is robust. | Installation is very simple, but the cluster is not robust. |
GUI | GUI is the Kubernetes Dashboard. | There is no GUI. |
Scalability | Highly scalable and scales fast. | Highly scalable and scales 5x faster than Kubernetes. |
Auto-scaling | Kubernetes can do auto-scaling. | Docker swarm cannot do auto-scaling. |
Load Balancing | Manual intervention is needed for load-balancing traffic between different containers and pods. | Docker swarm does the auto-load balancing of traffic between containers in the cluster. |
Rolling Updates & Rollbacks | Can deploy rolling updates and automatic rollbacks. | Can deploy rolling updates, but not automatic rollback. |
DATA Volumes | Can share storage volumes only with the other containers in the same pod. | Can share storage volumes with any other container. |
Logging & Monitoring | In-built tools for logging and monitoring. | 3rd party tools like ELK stack should be used for logging and monitoring. |
3. How is Kubernetes related to Docker?
It’s a known fact that Docker provides the lifecycle management of containers and a Docker image builds the runtime containers. But, since these individual containers have to communicate, Kubernetes is used. So, Docker builds the containers and these containers communicate with each other via Kubernetes. So, containers running on multiple hosts can be manually linked and orchestrated using Kubernetes.
4. What is the difference between deploying applications on hosts and containers?
Fig: Deploying Applications On Hosts Vs Containers
Deploying Applications On Host vs Containers
The left-side architecture represents deploying applications on hosts. So, this kind of architecture will have an operating system and then the operating system will have a kernel that will have various libraries installed on the operating system needed for the application. So, in this kind of framework you can have n number of applications and all the applications will share the libraries present in that operating system whereas while deploying applications in containers the architecture is a little different.
This kind of architecture will have a kernel and that is the only thing that’s going to be the only thing common among all the applications. So, if there’s a particular application that needs Java then that particular application we’ll get access to Java and if there’s another application that needs Python then only that particular application will have access to Python.
The individual blocks that you can see on the right side of the diagram are basically containerized and these are isolated from other applications. So, the applications have the necessary libraries and binaries isolated from the rest of the system, and cannot be encroached on by any other application.
5. What is Container Orchestration?
Consider a scenario where you have 5-6 microservices for an application. Now, these microservices are put in individual containers, but won’t be able to communicate without container orchestration. So, as orchestration means the amalgamation of all instruments playing together in harmony in music, similarly container orchestration means all the services in individual containers working together to fulfill the needs of a single server.
6. What is the need for Container Orchestration?
Consider you have 5-6 microservices for a single application performing various tasks, and all these microservices are put inside containers. Now, to make sure that these containers communicate with each other we need container orchestration.
Fig: Challenges Without Container Orchestration
There were many challenges that came into place without the use of container orchestration. So, to overcome these challenges container orchestration came into place.
7. How does Kubernetes simplify containerized Deployment?
As a typical application would have a cluster of containers running across multiple hosts, all these containers would need to talk to each other. So, to do this you need something big that would load balance, scale & monitor the containers. Since Kubernetes is cloud-agnostic and can run on any public/private provider it must be your choice to simplify containerized deployment.
8. What do you know about clusters in Kubernetes?
The fundamental behind Kubernetes is that we can enforce the desired state management, by which I mean that we can feed the cluster services of a specific configuration, and it will be up to the cluster services to go out and run that configuration in the infrastructure.
Fig 5: Representation Of Kubernetes Cluster
So, as you can see in the above diagram, the deployment file will have all the configurations required to be fed into the cluster services. Now, the deployment file will be fed to the API and then it will be up to the cluster services to figure out how to schedule these pods in the environment and make sure that the right number of pods are running.
So, the API which sits in front of services, the worker nodes & the Kubelet process that the nodes run, all together make up the Kubernetes Cluster.
9. What is Google Container Engine?
Google Container Engine (GKE) is an open-source management platform for Docker containers and clusters. This Kubernetes-based engine supports only those clusters which run within Google’s public cloud services.
10. What is Heapster?
Heapster is a cluster-wide aggregator of data provided by Kubelet running on each node. This container management tool is supported natively on the Kubernetes cluster and runs as a pod, just like any other pod in the cluster. So, it basically discovers all nodes in the cluster and queries usage information from the Kubernetes nodes in the cluster, via the on-machine Kubernetes agent.
11. What is Minikube?
Minikube is a tool that makes it easy to run Kubernetes locally. This runs a single-node Kubernetes cluster inside a virtual machine.
12. What is Kubectl?
Kubectl is the platform using which you can pass commands to the cluster. So, it basically provides the CLI to run commands against the Kubernetes cluster with various ways to create and manage the Kubernetes component.
13. What is Kubelet?
This is an agent service that runs on each node and enables the slave to communicate with the master. So, Kubelet works on the description of containers provided to it in the PodSpec and makes sure that the containers described in the PodSpec are healthy and running.
14. What do you understand by a node in Kubernetes?
Fig 6: Node In Kubernetes
Kubernetes Interview Questions and Answers
Architecture-Based Kubernetes Interview Questions
15. What are the different components of Kubernetes Architecture?
The Kubernetes Architecture has mainly 2 components – the master node and the worker node. As you can see in the below diagram, the master and the worker nodes have many inbuilt components within them. The master node has the Kube-controller-manager, kube-apiserver, kube-scheduler, etcd. Whereas the worker node has kubelet and kube-proxy running on each node.
Fig 7: Architecture Of Kubernetes
16. What do you understand by Kube-proxy?
Kube-proxy can run on each and every node and can do simple TCP/UDP packet forwarding across the backend network service. So basically, it is a network proxy that reflects the services as configured in Kubernetes API on each node. So, the Docker-linkable compatible environment variables provide the cluster IPs and ports which are opened by proxy.
17. Can you brief me on the working of the master node in Kubernetes?
Kubernetes master controls the nodes and inside the nodes the containers are present. Now, these individual containers are contained inside pods and inside each pod, you can have a various number of containers based upon the configuration and requirements. So, if the pods have to be deployed, then they can either be deployed using a user interface or a command-line interface. Then, these pods are scheduled on the nodes and based on the resource requirements, the pods are allocated to these nodes. The kube-apiserver makes sure that there is communication established between the Kubernetes node and the master components.
Fig 8: Representation Of Kubernetes Master Node
18. What is the role of kube-apiserver and kube-scheduler?
The kube – apiserver follows the scale-out architecture and is the front end of the master node control panel. This exposes all the APIs of the Kubernetes Master node components and is responsible for establishing communication between Kubernetes Node and the Kubernetes master components.
The Kube scheduler is responsible for distributing and managing the workload on the worker nodes. So, it selects the most suitable node to run the unscheduled pod based on resource requirements and keeps track of resource utilization. It ensures that the workload is not scheduled on already full nodes.
19. Can you brief me about the Kubernetes controller manager?
Multiple controller processes run on the master node but are compiled together to run as a single process: the Kubernetes Controller Manager. So, Controller Manager is a daemon that embeds controllers and does namespace creation and garbage collection. It owns the responsibility and communicates with the API server to manage the endpoints.
So, the different types of controller managers running on the master node are :
Fig 9: Types Of Controllers
20. What is ETCD?
Etcd is written in Go programming language and is a distributed key-value store used for coordinating distributed work. So, Etcd stores the configuration data of the Kubernetes cluster, representing the state of the cluster at any given point in time.
21. What are the different types of services in Kubernetes?
The following are the different types of services used:
Fig 10: Types Of Services
22. What do you understand by load balancer in Kubernetes?
A load balancer is one of the most common and standard ways of exposing service. There are two types of load balancers used based on the working environment i.e. either the Internal Load Balancer or the External Load Balancer. The Internal Load Balancer automatically balances the load and allocates the pods with the required configuration whereas the External Load Balancer directs the traffic from the external load to the backend pods.
23. What is the Ingress network, and how does it work?
An ingress network is a collection of rules that acts as an entry point to the Kubernetes cluster. This allows inbound connections, which can be configured to give services externally through reachable URLs, load balance traffic, or by offering name-based virtual hosting. So, Ingress is an API object that manages external access to the services in a cluster, usually by HTTP, and is the most powerful way of exposing services.
Now, let me explain to you the working of the Ingress network with an example.
There are 2 nodes having the pod and root network namespaces with a Linux bridge. In addition to this, there is also a new virtual ethernet device called flannel0(network plugin) added to the root network.
Now, suppose we want the packet to flow from pod 1 to pod 4. Refer to the below diagram.
Fig 11: Working Of Ingress Network
- So, the packet leaves pod1’s network at eth0 and enters the root network at veth0.
- Then it is passed on to cbr0, which makes the ARP request to find the destination and it is found out that nobody on this node has the destination IP address.
- So, the bridge sends the packet to flannel0 as the node’s route table is configured with flannel0.
- Now, the flannel daemon talks to the API server of Kubernetes to know all the pod IPs and their respective nodes to create mappings for pod IPs to node IPs.
- The network plugin wraps this packet in a UDP packet with extra headers changing the source and destination IP’s to their respective nodes and sends this packet out via eth0.
- Now, since the routing table already knows how to route traffic between nodes, it sends the packet to the destination node2.
- The packet arrives at eth0 of node2 and goes back to flannel0 to de-capsulate and emits it back in the root network namespace.
- Again, the packet is forwarded to the Linux bridge to make an ARP request to find out the IP that belongs to veth1.
- The packet finally crosses the root network and reaches the destination Pod4.
24. What do you understand by Cloud controller manager?
The Cloud Controller Manager is responsible for persistent storage, network routing, abstracting the cloud-specific code from the core Kubernetes-specific code, and managing the communication with the underlying cloud services. It might be split out into several different containers depending on which cloud platform you are running on and then it enables the cloud vendors and Kubernetes code to be developed without any inter-dependency. So, the cloud vendor develops their code and connects with the Kubernetes cloud controller manager while running the Kubernetes.
The various types of cloud controller managers are as follows:
Fig 12: Types Of Cloud Controller Manager
25. What is Container resource monitoring?
As for users, it is really important to understand the performance of the application and resource utilization at all the different abstraction layers, Kubernetes factored the management of the cluster by creating abstraction at different levels like container, pods, services, and whole cluster. Now, each level can be monitored and this is nothing but Container resource monitoring.
The various container resource monitoring tools are as follows:
Fig 13: Container Resource Monitoring Tools
26. What is the difference between a replica set and a replication controller?
Replica Set and Replication Controller do almost the same thing. Both ensure that a specified number of pod replicas are running at any given time. The difference comes with the usage of selectors to replicate pods. Replica Set uses Set-Based selectors while replication controllers use Equity-Based selectors.
- Equity-Based Selectors: This type of selector allows filtering by label key and values. So, in layman’s terms, the equity-based selector will only look for the pods with the exact same phrase as the label.
- Example: Suppose your label key says app=nginx; then, with this selector, you can only look for those pods with the label app equal to Nginx.
- Selector-Based Selectors: This type of selector allows filtering keys according to a set of values. So, in other words, the selector-based selector will look for pods whose label has been mentioned in the set.
- Example: Say your label key says app in (Nginx, NPS, or Apache). Then, with this selector, if your app is equal to any of Nginx, NPS, or Apache, the selector will take it as a true result.
27. What is a Headless Service?
Headless Service is similar to that of a ‘Normal’ service but does not have a Cluster IP. This service enables you to directly reach the pods without the need to access them through a proxy.
28. What are the best security measures that you can take while using Kubernetes?
The following are the best security measures that you can follow while using Kubernetes:
Fig 14: Best Security Measures
29. What are federated clusters?
Multiple Kubernetes clusters can be managed as a single cluster with the help of federated clusters. So, you can create multiple Kubernetes clusters within a data center/cloud and use federation to control/manage them all in one place.
The federated clusters can achieve this by doing the following two things. Refer to the below diagram.
Fig 15: Federated Clusters
Scenario-Based Kubernetes Interview Questions
Scenario 1: Suppose a company built on monolithic architecture handles numerous products. Now, as the company expands in today’s scaling industry, its monolithic architecture started causing problems.
30. How do you think the company shifted from monolithic to microservices and deploy its services containers?
As the company’s goal is to shift from their monolithic application to microservices, they can end up building piece by piece, in parallel, and just switch configurations in the background. Then they can put each of these built-in microservices on the Kubernetes platform. So, they can start by migrating their services once or twice and monitor them to make sure everything is running stable. Once they feel everything is going well, then they can migrate the rest of the application into their Kubernetes cluster.
Scenario 2: Consider a multinational company with a very distributed system, with a large number of data centers, virtual machines, and many employees working on various tasks.
31. How do you think can such a company manage all the tasks in a consistent way with Kubernetes?
As all of us know that I.T. departments launch thousands of containers, with tasks running across numerous nodes across the world in a distributed system.
In such a situation the company can use something that offers them agility, scale-out capability, and DevOps practice to the cloud-based applications.
So, the company can, therefore, use Kubernetes to customize its scheduling architecture and support multiple container formats. This makes it possible for the affinity between container tasks that gives greater efficiency with extensive support for various container networking solutions and container storage.
Scenario 3: Consider a situation, where a company wants to increase its efficiency and the speed of its technical operations by maintaining minimal costs.
32. How do you think the company will try to achieve this?
The company can implement the DevOps methodology, by building a CI/CD pipeline, but one problem that may occur here is the configurations may take time to go up and running. So, after implementing the CI/CD pipeline the company’s next step should be to work in the cloud environment. Once they start working on the cloud environment, they can schedule containers on a cluster and can orchestrate with the help of Kubernetes. This kind of approach will help the company reduce its deployment time, and also get faster across various environments.
Scenario 4: Suppose a company wants to revise its deployment methods and wants to build a platform that is much more scalable and responsive.
33. How do you think this company can achieve this to satisfy its customers?
In order to give millions of clients the digital experience they would expect, the company needs a platform that is scalable, and responsive, so that they could quickly get data to the client’s website. Now, to do this the company should move from their private data centers (if they are using any) to any cloud environment such as AWS. Not only this, but they should also implement the microservice architecture so that they can start using Docker containers. Once they have the base framework ready, then they can start using the best orchestration platform available i.e. Kubernetes. This would enable the teams to be autonomous in building applications and delivering them very quickly.
Scenario 5: Consider a multinational company with a very distributed system, looking forward to solving the monolithic code base problem.
34. How do you think the company can solve its problem?
Well, to solve the problem, they can shift their monolithic code base to a microservice design, and then each and every microservices can be considered as a container. So, all these containers can be deployed and orchestrated with the help of Kubernetes.
Scenario 6: All of us know that the shift from monolithic to microservices solves the problem from the development side, but increases the problem on the deployment side.
35. How can the company solve the problem on the deployment side?
The team can experiment with container orchestration platforms, such as Kubernetes, and run it in data centers. So, with this, the company can generate a templated application, deploy it within five minutes, and have actual instances containerized in the staging environment at that point. This kind of Kubernetes project will have dozens of microservices running in parallel to improve the production rate as even if a node goes down, then it can be rescheduled immediately without performance impact.
Scenario 7: Suppose a company wants to optimize the distribution of its workloads, by adopting new technologies.
36. How can the company achieve this distribution of resources efficiently?
The solution to this problem is none other than Kubernetes. Kubernetes makes sure that the resources are optimized efficiently, and only those resources are used which are needed by that particular application are. So, with the usage of the best container orchestration tool, the company can achieve the distribution of resources efficiently.
Scenario 8: Consider a carpooling company that wants to increase its number of servers by simultaneously scaling its platform.
37. How do you think will the company deal with the servers and their installation?
The company can adopt the concept of containerization. Once they deploy all their application into containers, they can use Kubernetes for orchestration and use container monitoring tools like Prometheus to monitor the actions in containers. So, such usage of containers gives them better capacity planning in the data center because will now have fewer constraints due to this abstraction between the services and the hardware they run on.
Scenario 9: Consider a scenario where a company wants to provide all the required hand-outs to its customers having various environments.
38. How do you think they can achieve this critical target in a dynamic manner?
The company can use Docker environments, to put together a cross-sectional team to build a web application using Kubernetes. This kind of framework will help the company achieve the goal of getting the required things into production within the shortest time frame. So, with such a machine running, the company can give hands-outs to all the customers having various environments.
Scenario 10: Suppose a company wants to run various workloads on different cloud infrastructures from bare metal to a public cloud.
39. How will the company achieve this in the presence of different interfaces?
The company can decompose its infrastructure into microservices and then adopt Kubernetes. This will let the company run various workloads on different cloud infrastructures.
Multiple-Choice Interview Questions
40. What are minions in the Kubernetes cluster?
- They are components of the master node.
- They are the work-horse / worker node of the cluster.[Ans]
- They are monitoring engines used widely in Kubernetes.
- They are docker container services.
41. Kubernetes cluster data is stored in which of the following?
- Kube-apiserver
- Kubelet
- Etcd[Ans]
- None of the above
42. Which of them is a Kubernetes Controller?
- ReplicaSet
- Deployment
- Rolling Updates
- Both ReplicaSet and Deployment[Ans]
43. Which of the following are core Kubernetes objects?
- Pods
- Services
- Volumes
- All of the above[Ans]
44. The Kubernetes Network proxy runs on which node?
- Master Node
- Worker Node
- All the nodes[Ans]
- None of the above
45. What are the responsibilities of a node controller?
- To assign a CIDR block to the nodes
- To maintain the list of nodes
- To monitor the health of the nodes
- All of the above[Ans]
46. What are the responsibilities of the Replication Controller?
- Update or delete multiple pods with a single command
- Helps to achieve the desired state
- Creates a new pod, if the existing pod crashes
- All of the above[Ans]
47. How to define a service without a selector?
- Specify the external name[Ans]
- Specify an endpoint with IP Address and port
- Just by specifying the IP address
- Specifying the label and api-version
48. What did the 1.8 version of Kubernetes introduce?
- Taints and Tolerations[Ans]
- Cluster level Logging
- Secrets
- Federated Clusters
49. The handler invoked by Kubelet to check if a container’s IP address is open or not is?
- HTTPGetAction
- ExecAction
- TCPSocketAction[Ans]
- None of the above
50. What are the features of Kubernetes?
The features of Kubernetes, are as follows:
Fig 4: Features Of Kubernetes
51. Can you brief me on the working of the master node in Kubernetes?
Kubernetes master controls the nodes and inside the nodes the containers are present. Now, these individual containers are contained inside pods and inside each pod, you can have a various number of containers based upon the configuration and requirements. So, if the pods have to be deployed, then they can either be deployed using a user interface or a command-line interface. Then, these pods are scheduled on the nodes and based on the resource requirements, the pods are allocated to these nodes. The kube-apiserver makes sure that there is communication established between the Kubernetes node and the master components.
Fig 8: Representation Of Kubernetes Master Node – Kubernetes Interview Questions
52. What is an Ingress network, and how does it work?
An ingress network is a collection of rules that acts as an entry point to the Kubernetes cluster. This allows inbound connections, which can be configured to give services externally through reachable URLs, load balance traffic, or by offering name-based virtual hosting. So, Ingress is an API object that manages external access to the services in a cluster, usually by HTTP, and is the most powerful way of exposing services.
Now, let me explain to you the working of the Ingress network with an example.
There are 2 nodes having the pod and root network namespaces with a Linux bridge. In addition to this, there is also a new virtual ethernet device called flannel0(network plugin) added to the root network.
Now, suppose we want the packet to flow from pod 1 to pod 4. Refer to the below diagram.
Fig 11: Working Of Ingress Network
- So, the packet leaves pod1’s network at eth0 and enters the root network at veth0.
- Then it is passed on to cbr0, which makes the ARP request to find the destination and it is found out that nobody on this node has the destination IP address.
- So, the bridge sends the packet to flannel0 as the node’s route table is configured with flannel0.
- Now, the flannel daemon talks to the API server of Kubernetes to know all the pod IPs and their respective nodes to create mappings for pod IPs to node IPs.
- The network plugin wraps this packet in a UDP packet with extra headers changing the source and destination IP’s to their respective nodes and sends this packet out via eth0.
- Now, since the routing table already knows how to route traffic between nodes, it sends the packet to the destination node2.
- The packet arrives at eth0 of node2 and goes back to flannel0 to de-capsulate and emits it back in the root network namespace.
- Again, the packet is forwarded to the Linux bridge to make an ARP request to find out the IP that belongs to veth1.
- The packet finally crosses the root network and reaches the destination Pod4.