Kubesimplify

Perform CRUD Operations on Kubernetes Using Golang

Kunal Verma — Wed, 17 Apr 2024 11:30:47 GMT

In a previous article, we learned that Kubernetes essentially is an API under-the-hood and every action you take within a Kubernetes cluster, be it the creation of pods or the monitoring of services, boils down to interactions with its API.

We learned that there are three different ways of interacting with the API:

via kubectl
via simple HTTP requests with curl
using Client Libraries

In our previous discussion, we particularly focused on accessing the Kubernetes API by making simple HTTP requests using the curl command - which is a practical and a very beginner-friendly way of understanding the API mechanics.

Now, in this article well be taking that concept to a next level and focus on accessing the Kubernetes API programmatically i.e via client libraries.

The aim here is to provide you a step-by-step guide on performing basic CRUD operations (create, read, update and delete) on a Kubernetes resource, using Go as the programming language.

Getting Started - Understanding the Basics

Before we dive into the demo itself, lets ensure we have the basics firmly in place!

Familiarity with Kubernetes API Concepts

Throughout the guide, we'll use various terminologies associated with the core Kubernetes API concepts, which were, very conveniently covered in the previous blog(and that too, in-depth). If you haven't already, I highly recommend checking that out before moving on with this one.

Importance of Using Client Libraries

One may ask this question - Why make things more complicated? Why not just stick to the basic HTTP requests instead of using a client library?

Thats a valid question to consider and and there are several reasons to why learning and using a client library is essential:

Abstraction of Complexity - Client libraries abstract away the complexities of direct HTTP communication, offering a simpler way to interact with API.
Error Handling - These libraries typically come with built-in error handling mechanisms. Thus, simplifying the process of dealing with errors and failures when interacting with API (which is certainly important, right?).
Efficiency - These libraries often provide optimized methods for common tasks, reducing the amount of code needed and improving overall efficiency. (well get a better idea of this in the upcoming sections)
Community Support - Popular client libraries have a strong community of users who contribute to documentation, provide support, and share best practices - which in turn makes it easier to learn and troubleshoot any issues that may arise during application development.

Exploring `Client-go`

As mentioned previously, well be using the Go programming language to perform basic CRUD operations on a Kubernetes resource via the client library.

Now, the official Go client library used for interacting with Kubernetes clusters is called client-go. It provides a set of functions and structures needed to interact with Kubernetes API programmatically, allowing developers to manage resources such as pods, services, deployments, and much more.

The purpose is simple - to simplify the development of Kubernetes-related applications. It does this by abstracting away the complexities of working directly with Kubernetes API, providing a more user-friendly interface for Go developers.

One may ask this question here - So, we dont interact with the API when usingclient-go?

Thats partially correct! When using client-go, we don't interact directly with the Kubernetes API endpoints. Instead, client-go provides a layer of abstraction between the us (the developers) and the low-level details of making HTTP requests to those endpoints.

It provides all the necessary set of functions and data structures that developers can use to perform actions on Kubernetes resources (like pods, services, deployments, etc.) without needing to handle the HTTP communication themselves.

Demo - CRUD Operations on Pod

To keep things simple, we'll be performing the basic CRUD operations on a Pod:

Creating a Pod.
Retrieving all the current Pods in the cluster.
Updating an existing Pod.
Deleting an existing Pod.

Prerequisites

Before we begin with the development, here are a few things youll need:

kubectl installed
Go installed (latest version)
A Kubernetes cluster (well be using minikube for this tutorial, but feel free to choose any other tool)

Step 1 - Creating a Kubernetes Cluster

Here, well use minikube to bootstrap a single node Kubernetes cluster using the following command:

$ minikube start😄  minikube v1.32.0 on Darwin 14.4 (arm64)  Using the docker driver based on existing profile👍  Starting control plane node minikube in cluster minikube🚜  Pulling base image ...🔥  Creating docker container (CPUs=2, Memory=7792MB) ...🐳  Preparing Kubernetes v1.28.3 on Docker 24.0.7 ...🔗  Configuring bridge CNI (Container Networking Interface) ...🔎  Verifying Kubernetes components...     Using image gcr.io/k8s-minikube/storage-provisioner:v5🌟  Enabled addons: storage-provisioner, default-storageclass...

As the cluster creation process finishes, use the following command to check the cluster information:

$ kubectl cluster-infoKubernetes control plane is running at https://127.0.0.1:52016CoreDNS is running at https://127.0.0.1:52016/api/v1/namespaces/kube-system/services/kube-dns:dns/proxy

Step 2 - Initial Project Setup

In this step, well be doing the following things:

Set up the Go project environment
Install the k8s.io/client-go module

Let us start by creating a new directory for our project and initializing a new Go module (go.mod):

$ mkdir k8s-crud-demo$ cd k8s-crud-demo$ go mod init github.com/USERNAME/k8s-crud

Now, we can install the k8s.io/client-go module using the following command:

go get k8s.io/client-go@latest

This will install the latest version of the k8s.io/client-go module, which includes all the necessary packages needed to interact with the Kubernetes API.

Step 3 - Create a new Kubernetes Client

Before we perform any operations on an existing Kubernetes cluster, we first need to create a new client.

Even if we are using a client library in this scenario, it all boils down to the basic client-server communication and we understood in the previous blog post that, only an authenticated client can make requests to the Kubernetes API.

Therefore, regardless of whether were using a client library like client-go or not, we need to establish an authenticated connection with the Kubernetes API server.

Thankfully, the process here is much simpler than what we did while making HTTP requests, as well directly be using the existing kubeconfig file to get the cluster info and create a new client from that.

Get the location of the kubeconfig file from the system and store that in a variable:

 home, _ := os.UserHomeDir() kubeConfigPath := filepath.Join(home, ".kube/config")

Next, well use the BuildConfigFromFlags() method from the k8s.io/client-go/tools/clientcmd package to create a new client configuration based on the provided kubeconfig file:
```
 config, err := clientcmd.BuildConfigFromFlags("", kubeConfigPath) if err != nil {     panic(err.Error()) }
```
At last, well use this configuration to create a new client, using the k8s.io/client-go/kubernetes package:
```
 client := kubernetes.NewForConfigOrDie(config)
```

Step 4 - Retrieving All the Current Pods

Let us start with the most basic operation - reading and listing down all the current running pods in our Kubernetes cluster.

📍 Note
Before building the logic for this operation, make sure you already have a few pods running in your newly created cluster to see some output in the end.
You can use kubectl in this case to do so:
$ kubectl run demo --image=nginxpod/demo created
In this scenario, I have the following pods running in my cluster:
$ kubectl get podsNAME                           READY   STATUS    RESTARTS     AGEdemo-crud55wwk                 1/1     Running   1 (6d ago)   6d1hdemo-nginx                     1/1     Running   1 (6d ago)   6d1hgo-api-2mwpl                   1/1     Running   2 (6d ago)   6d1htest-deploy-859f95ffcc-8p8t8   1/1     Running   8 (6d ago)   18dtest-deploy-859f95ffcc-fcdld   1/1     Running   8 (6d ago)   18d

For the logic of retrieving all the running pods in a cluster, paste the following code snippet in your main.go file:

// define the namespacenamespace := "default"// get the Pod interface (easy for later use)podsClient := client.CoreV1().Pods(namespace)// read all podspods, err := podsClient.List(context.TODO(), metav1.ListOptions{})if err != nil {    panic(err.Error())}fmt.Printf("There are %d pods in the cluster\n", len(pods.Items))// loop through pod list to get namesfor i, pod := range pods.Items {    fmt.Printf("Name of %dth pod: %s\n", i, pod.Name)}

A breakdown of the core logic being used is as follows:

client.CoreV1().Pods(namespace) - In the previous article, we covered that the Kubernetes API resources are divided into different API Groups and versions. Now, Pod (a K8s resource) is found under the core group, having v1 as the version.

So, in this line we are first calling the CoreV1() function from the k8s.io/client-go/kubernetes package which returns a CoreV1Interface interface, which is a collection of some embedded interfaces of all the Kubernetes resources that fall under the core v1 API group:

  type CoreV1Interface interface {      RESTClient() rest.Interface      ComponentStatusesGetter      ConfigMapsGetter      EndpointsGetter      EventsGetter      LimitRangesGetter      NamespacesGetter      NodesGetter      PersistentVolumesGetter      PersistentVolumeClaimsGetter      PodsGetter      ...  }

From here, we are then using the Pods() method included in the PodsGetter interface, which in turn returns the PodInterface interface, which is a collection of methods to work with the Pod resource:

  type PodInterface interface {      Create(ctx context.Context, pod *v1.Pod, opts metav1.CreateOptions) (*v1.Pod, error)      Update(ctx context.Context, pod *v1.Pod, opts metav1.UpdateOptions) (*v1.Pod, error)      UpdateStatus(ctx context.Context, pod *v1.Pod, opts metav1.UpdateOptions) (*v1.Pod, error)      Delete(ctx context.Context, name string, opts metav1.DeleteOptions) error      DeleteCollection(ctx context.Context, opts metav1.DeleteOptions, listOpts metav1.ListOptions) error      Get(ctx context.Context, name string, opts metav1.GetOptions) (*v1.Pod, error)      ...  }

📍 Note
Going forward, well be using the methods listed in the PodInterface interface to perform all the CRUD operations.

podsClient.List() - This ones simple to understand! Here, we use the List() method defined under the PodInterface interface, which returns a PodList struct. The PodList struct represents a list of Kubernetes pods.

  type PodList struct {      metav1.TypeMeta `json:",inline"`      // Standard list metadata.      // More info: https://git.k8s.io/community/contributors/devel/sig-architecture/api-conventions.md#types-kinds      // +optional      metav1.ListMeta `json:"metadata,omitempty" protobuf:"bytes,1,opt,name=metadata"`      // List of pods.      // More info: https://git.k8s.io/community/contributors/devel/sig-architecture/api-conventions.md      Items []Pod `json:"items" protobuf:"bytes,2,rep,name=items"`  }

The Items field represents a slice of Pod objects and that is what we accessed using pods.Items in the next set of lines.

Heres the complete code for the read operation, along with the output after execution:

package mainimport (    "context"    "fmt"    "os"    "path/filepath"    metav1 "k8s.io/apimachinery/pkg/apis/meta/v1"    "k8s.io/client-go/kubernetes"    "k8s.io/client-go/tools/clientcmd")func main() {    // get kubeconfig    home, _ := os.UserHomeDir()    kubeConfigPath := filepath.Join(home, ".kube/config")    // use the current context in kubeconfig    config, err := clientcmd.BuildConfigFromFlags("", kubeConfigPath)    if err != nil {        panic(err.Error())    }    // create a new client    client := kubernetes.NewForConfigOrDie(config)    // define the namespace    namespace := "default"    // define the pods client (easy for later use)    podsClient := client.CoreV1().Pods(namespace)    // read all pods    pods, err := podsClient.List(context.TODO(), metav1.ListOptions{})    if err != nil {        panic(err.Error())    }    fmt.Printf("There are %d pods in the cluster\n", len(pods.Items))    for i, pod := range pods.Items {        fmt.Printf("Name of %dth pod: %s\n", i, pod.Name)    }}

💡 If youre a beginner in the Go programming language, one thing you certainly might have noticed is, a lot of the parts of the client-go library that well be using, are interconnected in some way.
Interfaces contains some other interfaces, which may contain some different types or structs, which may contain some methods and so on.
A nice hack to see the interface/struct/method signatures in order to understand their connections, is to use cmd + click or win + click feature in editors like VSCode, which leads you to that specific interface/struct/method and then, you can understand how its all connected.

Step 5 - Create a Pod

Now, when it comes to creating a new Pod, there are two main things we need to define:

A pod definition - giving details such as Pod name, container name, container image etc.
Creating the Pod using the specified Pod definition.

Below is the code snippet to define a new pod definition:

    podDefintion := &v1.Pod{        ObjectMeta: metav1.ObjectMeta{            GenerateName: "demo-k8s-",            Namespace:    "default",        },        Spec: v1.PodSpec{            Containers: []v1.Container{                {                    Name:  "nginx-container",                    Image: "nginx:latest",                },            },        },    }

A breakdown of the core logic being used is as follows:

&v1.Pod{} - here, we are accessing the Pod struct which is from the k8s.io/api/core/v1 package. The signature of the struct looks like this:

  type Pod struct {      metav1.TypeMeta `json:",inline"`      // Standard object's metadata.      // More info: https://git.k8s.io/community/contributors/devel/sig-architecture/api-conventions.md#metadata      // +optional      metav1.ObjectMeta `json:"metadata,omitempty" protobuf:"bytes,1,opt,name=metadata"`      // Specification of the desired behavior of the pod.      // More info: https://git.k8s.io/community/contributors/devel/sig-architecture/api-conventions.md#spec-and-status      // +optional      Spec PodSpec `json:"spec,omitempty" protobuf:"bytes,2,opt,name=spec"`      // Most recently observed status of the pod.      // This data may not be up to date.      // Populated by the system.      // Read-only.      // More info: https://git.k8s.io/community/contributors/devel/sig-architecture/api-conventions.md#spec-and-status      // +optional      Status PodStatus `json:"status,omitempty" protobuf:"bytes,3,opt,name=status"`  }

Its important to note that, the fields mentioned in this struct are themselves structs, which have their own fields and those are the ones which we are actually using.

For instance, the ObjectMeta struct from the metav1 package contains the following fields, which we have used in our implementation:

  type ObjectMeta struct {      GenerateName string `json:"generateName,omitempty" protobuf:"bytes,2,opt,name=generateName"`      Namespace string `json:"namespace,omitempty" protobuf:"bytes,3,opt,name=namespace"`      ...  }

After defining the pod definition, well use the Create() method (from the PodInterface - learned above) to create a new Pod, based on the pod definition:

newPod, err := podsClient.Create(context.TODO(), podDefintion, metav1.CreateOptions{})if err != nil {    panic(err.Error())}fmt.Printf("Pod '%s' is created!", newPod.Name)

Heres the complete code for the create operation, along with the output after execution:

package mainimport (    "context"    "fmt"    "os"    "path/filepath"    metav1 "k8s.io/apimachinery/pkg/apis/meta/v1"    "k8s.io/client-go/kubernetes"    "k8s.io/client-go/tools/clientcmd")func main() {    // get kubeconfig    home, _ := os.UserHomeDir()    kubeConfigPath := filepath.Join(home, ".kube/config")    // use the current context in kubeconfig    config, err := clientcmd.BuildConfigFromFlags("", kubeConfigPath)    if err != nil {        panic(err.Error())    }    // create a new client    client := kubernetes.NewForConfigOrDie(config)    // define the namespace    namespace := "default"    // define the pods client (easy for later use)    podsClient := client.CoreV1().Pods(namespace)    // create a pod defintion    podDefintion := &v1.Pod{        ObjectMeta: metav1.ObjectMeta{            GenerateName: "demo-k8s-",            Namespace:    "default",        },        Spec: v1.PodSpec{            Containers: []v1.Container{                {                    Name:  "nginx-container",                    Image: "nginx:latest",                },            },        },    }    // create a new pod    newPod, err := podsClient.Create(context.TODO(), podDefintion, metav1.CreateOptions{})    if err != nil {        panic(err.Error())    }    fmt.Printf("Pod '%s' is created!", newPod.Name)}

Step 6 - Update an Existing Pod

Alright, let us say - we wish to change the container image version of the new pod we create in the above step i.e. demo-k8s-7p7w9 (in my case).

The current container image being used by the pod can be found using the following command:

$ kubectl describe pod demo-k8s-7p7w9...Containers:  nginx-container:    Container ID:   docker://f6a20de83befe78916136b425b7354fcc09bc6436de06efb7abb9fa25b260998    Image:          nginx:latest    Image ID:       docker-pullable://nginx@sha256:6db391d1c0cfb30588ba0bf72ea999404f2764febf0f1f196acd5867ac7efa7e    Port:               Host Port:          State:          Running...

Below is the code snippet to update the image version of this specific pod:

    fmt.Println("Updating pod...")    retryErr := retry.RetryOnConflict(retry.DefaultRetry, func() error {        // retrive the latest pod        currentPod, updateErr := podsClient.Get(context.TODO(), "demo-k8s-7p7w9", metav1.GetOptions{})        if updateErr != nil {            panic(updateErr.Error())        }        // change container image        currentPod.Spec.Containers[0].Image = "nginx:1.25.4"        // update pod        updatedPod, updateErr := podsClient.Update(context.TODO(), currentPod, metav1.UpdateOptions{})        fmt.Printf("Updated pod: %s", updatedPod.Name)        return updateErr    })    if retryErr != nil {        panic(retryErr.Error())    }

A breakdown of the core logic being used is as follows:

Here, we are mainly using the Get() and the Update() method from the PodInterface interface to first get the information about that specific pod, and then update the container image field with a new image version - in this case, from nginx:latest to nginx:1.25.4.
retry.RetryOnConflict() - This ones interesting, because we are enclosing the entire update operation inside this.
Here, we are using the RetryOnConflict() method from the k8s.io/client-go/util/retry package, which is designed to handle conflicts that may occur when attempting to update a Kubernetes resource.
In a distributed system like Kubernetes, conflicts can arise when multiple clients attempt to modify the same resource simultaneously. In that case, the RetryOnConflict() function implements a retry mechanism that retries the provided operation (in this case, the pod update operation) if a conflict error occurs.
You can find more details about the retry package in the documentation.

Heres the complete code for the update operation, along with the output after execution:

package mainimport (    "context"    "fmt"    "os"    "path/filepath"    metav1 "k8s.io/apimachinery/pkg/apis/meta/v1"    "k8s.io/client-go/kubernetes"    "k8s.io/client-go/tools/clientcmd"    "k8s.io/client-go/util/retry")func main() {    // get kubeconfig    home, _ := os.UserHomeDir()    kubeConfigPath := filepath.Join(home, ".kube/config")    // use the current context in kubeconfig    config, err := clientcmd.BuildConfigFromFlags("", kubeConfigPath)    if err != nil {        panic(err.Error())    }    // create a new client    client := kubernetes.NewForConfigOrDie(config)    // define the namespace    namespace := "default"    // define the pods client (easy for later use)    podsClient := client.CoreV1().Pods(namespace)    // update a pod    fmt.Println("Updating pod...")    retryErr := retry.RetryOnConflict(retry.DefaultRetry, func() error {        // retrive the latest pod        currentPod, updateErr := podsClient.Get(context.TODO(), "demo-k8s-7p7w9", metav1.GetOptions{})        if updateErr != nil {            panic(updateErr.Error())        }        // change container image        currentPod.Spec.Containers[0].Image = "nginx:1.25.4"        // update pod        updatedPod, updateErr := podsClient.Update(context.TODO(), currentPod, metav1.UpdateOptions{})        fmt.Printf("Updated pod: %s", updatedPod.Name)        return updateErr    })    if retryErr != nil {        panic(retryErr.Error())    }}

Step 7 - Delete an Existing Pod

Its time for a cleanup and now we wish to delete the pod we created above i.e. demo-k8s-7p7w9.

Below is the code snippet to delete the specified pod:

    deleteErr := podsClient.Delete(context.TODO(), "demo-k8s-7p7w9", metav1.DeleteOptions{})    if deleteErr != nil {        panic(deleteErr.Error())    }

This ones simple, as we are just using the Delete() method from the PodInterface interface to delete a specific pod.

Heres the complete code for the delete operation, along with the output after execution:

package mainimport (    "context"    "os"    "path/filepath"    metav1 "k8s.io/apimachinery/pkg/apis/meta/v1"    "k8s.io/client-go/kubernetes"    "k8s.io/client-go/tools/clientcmd")func main() {    // get kubeconfig    home, _ := os.UserHomeDir()    kubeConfigPath := filepath.Join(home, ".kube/config")    // use the current context in kubeconfig    config, err := clientcmd.BuildConfigFromFlags("", kubeConfigPath)    if err != nil {        panic(err.Error())    }    // create a new client    client := kubernetes.NewForConfigOrDie(config)    // define the namespace    namespace := "default"    // define the pods client (easy for later use)    podsClient := client.CoreV1().Pods(namespace)    // delete a pod    deleteErr := podsClient.Delete(context.TODO(), "demo-k8s-7p7w9", metav1.DeleteOptions{})    if deleteErr != nil {        panic(deleteErr.Error())    }}

Additional Configurations Options In `Client-go`

Here are a few additional and useful configurations, which are good to know!

Alternate Way to Kubeconfig Setup

In the initial step, we set the kubeconfig file location be the default location which is in - ${HOME}/.kube/config.

Theres an alternate way of configuring this step, wherein we can use the flag -kubeconfig, to set a custom location for the kubeconfig file to be used.

Below is the code snippet to set the -kubeconfig flag:

var kubeconfig *stringif home := homedir.HomeDir(); home != "" {    kubeconfig = flag.String("kubeconfig", filepath.Join(home, ".kube", "config"), "(optional) absolute path to the kubeconfig file")} else {    kubeconfig = flag.String("kubeconfig", "", "absolute path to the kubeconfig file")}flag.Parse()config, err := clientcmd.BuildConfigFromFlags("", *kubeconfig)if err != nil {    panic(err)}

A breakdown of the important concepts is as follows:

homedir.HomeDir() - Here, we are using the HomeDir() method from the k8s.io/client-go/util/homedir package to fetch the users home location.
We are using the flag package to define a new flag kubeconfig, that takes in a string input.

Now, if you wish to give a custom location of the kubeconfig file to use, it can be set as follows:

$ go run read.go --kubeconfig="/Users/kunalverma/Desktop/config"There are 5 pods in the clusterName of 0th pod: demo-crud55wwkName of 1th pod: demo-nginxName of 2th pod: go-api-2mwplName of 3th pod: test-deploy-859f95ffcc-8p8t8Name of 4th pod: test-deploy-859f95ffcc-fcdld

Alternate Way to Create a New Client

When it comes to the creating a new client using the config, the client-go module offers two ways to do so:

Using kubernetes.NewForConfigOrDie(config) - This is what we have used in the demo above.
Using kubernetes.NewForConfig(config)

The major difference between these two approaches is the way these handle errors.

NewForConfigOrDie() automatically takes care of any errors by panicking if there is an error in the config. Whereas, in NewForConfig() we need to handle the error explicitly, as shown below:

client, err := kubernetes.NewForConfig(config)if err != nil{    errors.New("Error in Config")}

Conclusion

In this practical guide, we covered the essentials of Kubernetes development using client-go in Go. From setting up our environment to performing CRUD operations on Pod, we've certainly gained some valuable insights.

These fundamentals set the right stage for you to navigate Kubernetes development with confidence and build robust applications with ease.

Well certainly be building some cool projects using the client libraries in the near future, so be sure to follow Kubesimplify for more such content.

Here's a detailed video on this topic

https://youtu.be/liwZF_0I8Ks

Happy Learning!

Resources

Follow Kubesimplify on Hashnode, Twitter/X and LinkedIn. Join our Discord server to learn with us!

Optimizing Scalability: A Deep Dive into Load Testing with Locust on EKS

Anshu Kumar — Mon, 15 Apr 2024 11:30:23 GMT

Introduction

This article explores strategies for optimizing scalability using Locust for load testing on Amazon EKS. We'll delve into scaling a Node.js app using Kubernetes' HPA and Cluster Autoscaler based on the load generated by Locust workers. The aim is to provide practical insights into ensuring applications can efficiently handle increasing user loads.

Prerequisites

Ensure you have AWS CLI configured with appropriate permissions and Terraform installed locally.
A basic understanding of AWS services, Terraform, Kubernetes concepts, Horizontal Pod Autoscaling (HPA) principles, and familiarity with the Kubernetes Cluster Autoscaler is required.

Understanding Horizontal Pod Autoscaler

Horizontal Pod Autoscaler(HPA) automatically adjusts the number of replica pods in a deployment or replication controller based on observed CPU utilization or other custom metrics. This ensures the application has sufficient resources to handle varying loads, thus improving performance and scalability.

Introduction to Locust

Locust is an open-source load-testing tool that allows you to define user behavior with Python code. It simulates thousands of concurrent users hitting your application, making it an excellent choice for load testing in Kubernetes environments.

Now that we've covered the prerequisites, let's proceed with setting up two EKS clusters: one for our App and another for Locust. We'll use Terraform's official modules, configure worker node autoscaling policies, enable Horizontal Pod Autoscaling (HPA) through the Metric Server, and integrate the Cluster Autoscaler for dynamic scaling of cluster nodes based on resource utilization.

Create VPC and EKS using the Terraform module

GitHub

Apply the terraform command.

Add a policy to the worker node role.

  {    "Version": "2012-10-17",    "Statement": [      {        "Effect": "Allow",        "Action": [          "autoscaling:DescribeAutoScalingGroups",          "autoscaling:DescribeAutoScalingInstances",          "autoscaling:DescribeLaunchConfigurations",          "autoscaling:DescribeScalingActivities",          "autoscaling:DescribeTags",          "ec2:DescribeImages",          "ec2:DescribeInstanceTypes",          "ec2:DescribeLaunchTemplateVersions",          "ec2:GetInstanceTypesFromInstanceRequirements",          "eks:DescribeNodegroup"        ],        "Resource": ["*"]      },      {        "Effect": "Allow",        "Action": [          "autoscaling:SetDesiredCapacity",          "autoscaling:TerminateInstanceInAutoScalingGroup"        ],        "Resource": ["*"]      }    ]  }

Apply cluster autoscaler yaml after updating the cluster-name and image version in the deployment section to match your Kubernetes version.
Github

Add the metric server for HPA to gather data.

  kubectl apply -f https://github.com/kubernetes-sigs/metrics-server/releases/latest/download/components.yaml

We need to create HPA for both locust and app after deploying them.
kubectl autoscale deployment --cpu-percent=50 --min=1 --max=10

Install monitoring components on the cluster

Run the following commands:

helm repo add prometheus-community https://prometheus-community.github.io/helm-chartshelm repo updatekubectl create ns monitoringhelm install monitoring prometheus-community/kube-prometheus-stack -n monitoring

Edit the service of Prometheus and Grafana from ClusterIP to LoadBalancer to access the UI.

Prometheus doesn't require an initial password, for Grafana we can run

  kubectl get secret --namespace monitoring monitoring-grafana -o jsonpath="{.data.admin-password}" | base64 --decode ; echo

Deploy sample app

GitHub

After applying the manifests, access the app using the LoadBalancer IP.

Configuration File for Application Monitoring

  apiVersion: monitoring.coreos.com/v1  kind: ServiceMonitor  metadata:    name: monitoring-node-app    labels:      release: monitoring      app: nodeapp  spec:    endpoints:    - path: /metrics      port: service      targetPort: 3000    namespaceSelector:      matchNames:      - nodeapp # namespace in which app is deployed    selector:      matchLabels:        app: nodeapp

After applying the above YAML, add metrics in Grafana to view the dashboard.
In Prometheus UI search for http_request_operations_total (for total request generated) and sum(rate(http_request_operations_total[15m])) (for request/second).
In Grafana UI create a new dashboard by adding the above two metrics.

Deploy Locust

GitHub

Update the Configmap provided in the GitHub repo before applying it. Add the actual URL of the app in the task section so that Locust can send traffic to the right place. Locust's master service is configured as LoadBalancer type to access the UI.
Locust's master service is configured as LoadBalancer type to access the UI.

Demo

Access the UI of locust and Run the Test

Access the Locust UI to configure and initiate the load test. Define the behavior of simulated users, such as the number of users, the rate of requests, and specific endpoints to target.

Grafana Dashboard

This metric indicates the rate at which Locust generates requests to your Node.js app, showcasing the simulated load.

This metric presents the total number of requests sent by Locust to your application, offering a comprehensive view of the applied load.

Observe the cluster's resource utilization metrics, including CPU and memory usage, and witness the dynamic scaling of the cluster in response to increased load.

Allow it to run for a few minutes, during which time you can switch between the "Statistics" tab and the "Charts" tab to observe the progress of the test.

We can observe that 69 Locust workers have been created using HPA, to generate load on our app.

Observation: Cluster Scaling

HPA

When the workload on your Node.js application surpasses the defined threshold, which is determined by CPU utilization, the Horizontal Pod Autoscaler (HPA) takes action. It dynamically adjusts the number of pods running your application to match the current demand. This means that if your application experiences higher traffic or processes more requests, HPA will trigger the creation of a new pod to handle the additional load.

Pending State

After HPA initiates the creation of a new pod, the Kubernetes scheduler tries to assign the pod to an available node within your cluster. However, if there aren't enough resources (such as CPU, memory, or disk space) on the existing nodes to accommodate the new pod, it enters a "pending" state. This indicates that the pod is waiting for resources to become available before it can start running.

Automatic Node Creation by Cluster Autoscaler

Recognizing that the pending pod requires additional resources that cannot be met by the existing nodes, the Cluster Autoscaler takes action. It monitors the resource utilization across your cluster and identifies the need for more computing capacity. In response to this demand, the Cluster Autoscaler automatically provisions a new worker node (virtual machine) within your Kubernetes cluster.

Transition to Running State for Pods

Once the new worker node is provisioned and ready to accept pods, the pending pod transitions from the "pending" state to the "running" state, this signifies that the pod is now actively serving requests and contributing to handling the increased load on your application. With the new pod running and workload distributed across multiple pods, your application can effectively manage the surge in traffic without compromising performance or availability.

Conclusion

In summary, this detailed exploration of load testing with Locust on Amazon EKS focused on optimizing scalability for a Node.js application using Kubernetes' HPA and Cluster Autoscaler. Key steps included setting up EKS clusters, implementing worker node autoscaling policies, enabling HPA, integrating the Cluster Autoscaler, and deploying monitoring components like Prometheus and Grafana. The process showcased how the system dynamically scaled resources in response to increased load, ensuring efficient traffic handling. Overall, this guide offers practical insights for developers and DevOps teams to improve scalability and performance in Kubernetes environments.

Remember to delete all resources after the demo.

Introducing Unikraft - Lightweight Virtualization Using Unikernels

Kunal Verma — Mon, 08 Apr 2024 11:30:40 GMT

Unikraft is a fast, secure and open-source Unikernel Development Kit which enables you to easily build minimal, ultra-lightweight virtual machines.

In practice, it is an alternative to running your application in the cloud. Now, in production environments, it feels no different to managing traditional containers that we all are familiar with, but whats fundamentally different from containers and traditional virtual machines, is the way your application is packaged and executed by Unikraft.

The high level goal of Unikraft is to build customizable and specialized OS images, known as unikernels lightweight, single-purpose operating systems optimized for specific applications. Unlike traditional virtualization solutions such as containers and virtual machines, that come with a wide range of features and functionalities, unikernels are tailored to serve a single task or application, resulting in minimal resource usage and enhanced performance.

Unikraft aims to streamline the process of creating and managing custom unikernels by providing developers with a modular and flexible approach by offering a comprehensive set of tools and libraries, which enables developers to optimize resource utilization, enhance security, and improve scalability across a wide range of use cases.

Understanding Unikernels

According to the official documentation:

Unikernels are specialized, single-address-space machine images constructed by using library operating systems (libOS).

Simply put, Unikernels are lightweight, single-purpose operating systems that are tailored to serve a specific application or task. Unlike traditional operating systems, which include a wide range of features and functionalities, unikernels contain only the necessary components required to support a particular application. This focused approach results in highly efficient and optimized systems, with reduced resource overhead and attack surface.

Evolution from VMs and Containers

Till now we have seen that, in most production systems, often the standard unit of isolation is the virtual machine (VM) since this provides the greatest degree of security for the application(s) enclosed within the isolated environment.

However, it was observed that a fully virtualized traditional VM is too heavy for most applications, which eventually led to the container-based model. Containers become a popular choice for packaging, deploying, and managing applications in cloud-native and microservices architectures due to their efficiency, flexibility, and portability.

Eventually, the evolution of running applications in the cloud led to the practice of running containers within virtual machines (VMs). This approach combines the strengths of both containers and VMs, providing robust isolation and security from VMs, while also benefiting from the flexibility and efficiency of containers.

💡 Kubernetes is an apt example of such deployment model, where the node pools are generally deployed as VMs.
Thus, it has been the de facto orchestration framework for container applications!

In the traditional concept of virtualization, the isolation between different VMs is typically achieved through software-based mechanisms implemented by the hypervisor (a specialized operating system) or the virtualization layer. This means that the hypervisor manages and enforces the isolation boundaries between VMs, using software-based techniques like memory protection, CPU scheduling, and device emulation.

💡 What is Device Emulation?
Simply put, Device emulation is like creating virtual versions of physical devices, such as network cards, storage drives, or graphics cards, within a virtual machine (VM). These virtual devices behave just like the real ones but exist only within the VM's environment.
For example, if the virtual computer wants to send data over the internet, the virtual network card (of the VM) sends the data out of the virtual machine, just like a real network card would. Similarly, if you want to save a file, the virtual storage drive takes care of it within the VM.
Thus, device emulation is essential for virtual machines because it allows them to interact with the outside world, access resources, and perform tasks as if they were physical machines, all while running within another computer's software environment i.e the Host OS.

With unikernels, this isolation is achieved at a lower level, directly by the hardware itself, rather than relying solely on software-based techniques. These hardware extensions provide the necessary support for creating and managing isolated execution environments, allowing unikernels to run directly on the underlying hardware with minimal overhead.

Therefore, by leveraging hardware primitives for isolation, unikernels are able to achieve better performance and efficiency compared to traditional virtualization approaches that rely purely on software-based isolation mechanisms.

This direct hardware-level isolation also contributes to the enhanced security and reliability of unikernels, as it reduces the attack surface and minimizes the impact of potential vulnerabilities in the software stack.

Unikernels v/s Traditional OSes

Apart from it being a modern approach to virtualization, unikernels have several unique characteristics and benefits over traditional operating systems. A few of them are discussed below:

Minimalist Architecture: Unikernels are designed to be extremely lightweight, containing only the essential components needed to run a specific application. This minimalist architecture results in reduced memory footprint, faster boot times, and improved performance.
Enhanced Security: By stripping away unnecessary components, unikernels have a smaller attack surface, making them more secure. Additionally, because they are built to serve the purpose for a single application, unikernels reduce the risk of security vulnerabilities associated with multi-purpose systems such as Linux, Windows, or macOS.
Efficient Resource Utilization: Unikernels focus resources solely on the application's requirements, making them ideal for resource-constrained environments like the cloud or edge devices.

Why Unikraft?

Unikraft is designed to solve the problems with using monolithic OSes and enable developers to create a specialized OS for each application, ensuring optimal performance, security guarantees, and meeting desired Key Performance Indicators (KPIs).

Unikraft adopts several unique design principles to achieve high modularity, enable great performance and security guarantees for your application. Some of them are discussed below:

Library Components - Unikraft offers a modular approach to building unikernels, with library components serving as the core building blocks for applications. These components handle crucial functions like memory management, scheduling, file access, and networking. Developers can easily select and configure these components using Unikraft's intuitive menu-driven interface, inspired by Linux's Kconfig system.
Configurability - Unikraft prioritizes configurability, allowing developers to fine-tune and customize every aspect of the unikernel to meet specific application needs. Drawing inspiration from Linux's Kconfig system, developers can easily select and configure libraries during the build process. This flexibility ensures adaptability to various use cases and environments.
Tooling and Integrations - Unikraft provides a comprehensive suite of tools designed to simplify unikernel creation and management. Leveraging technologies like Go, GNU Make, C, and Kconfig, these tools handle compilation, linking, and image generation tasks effortlessly. This tooling ecosystem empowers developers to build and deploy unikernels across different platforms with ease.

Key Features

Let us have a look at the key features offered by Unikraft across different dimensions:

Performance

Unikraft excels in performance testing, having lightning-fast boot times in milliseconds and minimal memory usage, typically requiring only a few megabytes.

Moreover, Unikraft's modular approach significantly reduces image sizes, with all applications staying under 2MBs. Boot times for the unikernels created with Unikraft range from microseconds to milliseconds, showcasing its efficiency. Memory consumption is minimal, with Unikraft guests needing only 2-6MBs.

Due to all the above factors, application performance is outstanding, with speeds 30%-80% faster than containers and 70%-170% faster than Linux VMs.

This amazing performance, results in an overall reduction in system call costs and optimized memory allocation, making Unikraft an excellent choice for modern computing.

Security

Unikraft ensures top-notch security with its minimal attack surface and robust isolation between applications. By focusing on single-application execution and removing unnecessary components, Unikraft significantly reduces potential vulnerabilities. By supporting safe languages like Rust to implement critical components, it adds an extra layer of enhanced protection.

Plus, Unikraft actively integrates core security features such as ASLR and stack protection. Thus, aligning with industry standards to ensure comprehensive security measures.

Efficiency

In terms of efficiency, Unikraft outperforms traditional monolithic operating systems like Linux. Through practical tests on devices like the Raspberry Pi 3 B+ and the Xilinx Ultra96-V2, Unikraft demonstrates lower power consumption than Alpine Linux and Raspbian OS.

These tests include idle states as well as CPU-intensive tasks like calculating .

Thus, Unikraft's ability to reduce power usage, especially in single-core scenarios (with networking disabled), highlights its efficiency advantage over Linux.

Compatibility

Unikraft prioritizes compatibility by ensuring POSIX and Linux compatibility, allowing seamless migration of existing applications to its deployment model. It incorporates a binary-compatibility layer, which enables the execution of Linux binaries (ELFs) on top of Unikraft.

To achieve this, Unikraft complies with Linux's ABI, providing a broad range of its system call interface currently on x86_64 with plans for extension to AArch64.

Unikraft's application catalog repository includes binary-compatible apps, enabling users to access and develop applications easily. By leveraging an application's native build system with musl C standard library, Unikraft eliminates the need for extensive application porting efforts.

Unikraft's commitment to compatibility extends to support for a wide range of applications and languages, enhancing its deployment potential. Ongoing efforts to increase syscall support aim to further expand Unikraft's ability to seamlessly run mainstream applications!

Potential Use Cases and Applications

With its unique features, the unikernel-based model of Unikraft offers a wide range of potential use cases across various industries.

It excels in scenarios where lightweight, specialized operating systems are needed to optimize performance, security, and efficiency. Some notable examples include cloud computing, edge computing, Internet of Things (IoT) devices, containerized applications, and real-time processing systems.

In cloud computing, Unikraft enables rapid deployment of highly efficient and secure microservices, reducing overhead and resource consumption.

Talking about edge computing, Unikraft's lightweight footprint and fast boot times make it ideal for deploying applications closer to users, improving latency and reliability.

In the IoT sector, Unikraft's small size and tailored configurations enhance device performance while ensuring robust security.

Overall, Unikraft's adaptability and efficiency makes it a valuable tool across a wide range of industries and use cases, empowering developers to build high-performance, secure applications tailored to specific requirements.

Get Started Using Unikraft

Below is a quick start guide for you all, to get started using Unikraft:

Step 1 - Install the `kraft` CLI

To begin, first install the kraft CLI tool, which allows you to easily leverage Unikraft unikernels at every stage of their lifecycle, from construction to production:

curl --proto '=https' --tlsv1.2 -sSf https://get.kraftkit.sh | sh

Step 2 - Using the Application Catalog

The Unikraft application catalog is a collection of applications and examples that are built and packaged to run with Unikraft. The application packages are stored in the Unikraft Application Registry, typically identified by a name similar to those used by DockerHub - unikraft.org/node:18, unikraft.org/python:3.10 , etc.

To list down all the available applications in the registry, use the command below:

$ kraft pkg ls --apps --all --updateTYPE  NAME                     VERSION  FORMAT  PULLED       MANIFEST  INDEX    PLAT           SIZEapp   unikraft.org/base        latest   oci     never        6cef805   3d4c008  qemu/x86_64    1.6 MBapp   unikraft.org/base        latest   oci     never        fbb21c5   3d4c008  fc/x86_64      1.6 MBapp   unikraft.org/caddy       2.7      oci     never        1bcd45f   85d8bba  qemu/x86_64    63 MBapp   unikraft.org/caddy       2.7      oci     never        7804074   85d8bba  fc/x86_64      63 MBapp   unikraft.org/helloworld  latest   oci     never        281e174   addacb0  xen/x86_64     143 kB...

If you wish to know more about the application catalog and how it works, check out the documentation.

Step 3 - Starting an Nginx Server

For this quick demo, let us pull and run the Unikraft nginx image - unikraft.org/nginx:1.15 from the application catalog, to start a new nginx server:

$ kraft run -W -dp 8080:80 unikraft.org/nginx:1.15[+] pulling unikraft.org/nginx:1.15  100% [8.5s]                                                                                                                              i  using arch=arm64 plat=qemuPowered byo.   .o       _ _               __ _Oo   Oo  ___ (_) | __ __  __ _ ' _) :_oO   oO ' _ `| | |/ /  _)' _` | |_|  _)oOo oOO| | | | |   (| | | (_) |  _) :_ OoOoO ._, ._:_:_,\_._,  .__,_:_, \___)                 Telesto 0.16.3~21bf34c

In the above command, we are using the -p flag to map the unikernels port 80 and the host port 8080 (similar to how we do it using docker commands).

Step 4 - Verify the Nginx Unikernel

Use the following command to list all the running unikernels:

$ kraft psNAME               KERNEL                         ARGS                       CREATED         STATUS   MEM  PORTS                 PLATrelaxed_snowflake  oci://unikraft.org/nginx:1.15  -c /nginx/conf/nginx.conf  17 seconds ago  running  64M  0.0.0.0:8080->80/tcp  qemu/arm64

Youll now be able to access the Nginx page at localhost:8080 in your machine!

Conclusion

So what do you think of Unikraft? Be sure to join their discord community if you wish to get involved or have any questions. Feel free to share your feedback about the tool or any features you wish to see in the coming future.

We are definitely looking forward to seeing the full potential of the unikernel model with Unikraft, in the coming future!

Resources

Here are a couple of resources to get you started:

Follow Kubesimplify on Hashnode, Twitter and LinkedIn. Join our Discord server to learn with us!

Kubernetes on Apple MacBooks (M Series)

Aditya Samant — Mon, 01 Apr 2024 11:30:31 GMT

There are many options to provision a local Kubernetes cluster on your laptop. The most popular ones are minikube, kind, K3s and MicroK8s. These options provide a simple and fast way to get Kubernetes running on your laptop by abstracting the complexities within the Kubernetes control plane.

Kubeadm is a tool that facilitates provisioning Kubernetes clusters on virtual machines. It can provision a multi-node Kubernetes cluster for development or production purposes. It can provision clusters on your local laptop, on-premise cloud or public cloud. A cluster provisioned by kubeadm is a great way for Kubernetes administrators to have a playground to work with. It is also useful for people pursuing the CKA and CKS certifications to practice tasks like cluster upgrade and troubleshooting.

VirtualBox is by far the most popular tool to spin up virtual machines (VMs) on a personal laptop. VirtualBox supports virtualization for x86 and AMD64 CPU architectures.

In 2020, Apple introduced the M series of MacBooks which use the Apple Silicon chip, based on ARM64 CPU architecture. VirtualBox does not have good support for machines based on ARM64 (a developer preview version exists, which cannot be relied on). As the M series MacBooks have gained popularity, it is important to look for an alternative virtualization tool that is tested and certified for ARM64. Enter Multipass by Canonical, a simple virtualization tool that is fully compatible with ARM64 based machines.

This article is a step-by-step walkthrough on how to install a Kubernetes cluster on a MacBook (M series) laptop using the kubeadm tool. It is a simplification of the steps in the official Kubernetes documentation.

Pre-requisites

A MacBook laptop (M series) with minimum 16 GB RAM (recommended).
Multipass by Canonical should be installed as per the instructions for macOS. After installation, verify that you are able to launch a sample Ubuntu instance. Cleanup the instance after verification.
Your account on your MacBook must have admin privileges and be able to use sudo.

Provision the VMs

We will create 3 VMs for our setup as follows:

kubemaster: The controlplane node
kubeworker01: The first worker node
kubeworker02: The second worker node

Each VM will have the following configuration (you can choose to edit it as per your host machine capacity)

Disk space: 10G
Memory 3G
CPUs 2

💡

In Multipass, by default, the IP address allocated to a VM is subject to change after a reboot of the VM. If IP addresses change over reboots, it breaks the Kubernetes cluster. As such, it is imperative that the VMs are provisioned with a static IP address as documented here.

Provisioning the controlplane instance (`kubemaster`)

Launch thekubemaster instance with a manual network

🗒

The values to the --network option need to be passed carefully.

🗒

name=en0: This is the name of the Wi-Fi network on your host machine. To get a list of possible values, use the command multipass networks. mac="52:54:00:4b:ab:cd": A unique and random MAC address that will be allocated to the instance.

multipass launch --disk 10G --memory 3G --cpus 2 --name kubemaster --network name=en0,mode=manual,mac="52:54:00:4b:ab:cd" jammy

You should see the following output:

Launched: kubemaster

Configure the extra interface

The macaddress field should contain the exact MAC address chosen in the multipass launch command.
The addresses field should contain the static IP address that will be allocated to this VM. The static IP address should be in the same subnet as the original IP address of the instance.
The original IP address allocated to the VM can be found by the multipass info kubemaster command as shown below:
multipass info kubemaster | grep IPv4
You should see an output similar to:
IPv4: 192.168.73.7
In this example, the original IP address of the instance is 192.168.73.7. So the static IP address can be chosen as 192.168.73.101

Execute the command shown below

multipass exec -n kubemaster -- sudo bash -c 'cat << EOF > /etc/netplan/10-custom.yamlnetwork:  version: 2  ethernets:    extra0:      dhcp4: no      match:        macaddress: "52:54:00:4b:ab:cd"      addresses: [192.168.73.101/24]EOF'

Apply the new configuration

multipass exec -n kubemaster -- sudo netplan apply

🗒

In case you receive a warning stating that the permissions are too open, please ignore it.

Confirm that it works

multipass info kubemaster | grep IPv4 -A1

You should see an output displaying both the original IP address and the static IP address:

IPv4:           192.168.73.7                192.168.73.101

Let's test the network connectivity using the ping command:
Example:
Original IP of the instance: 192.168.73.7
Static IP of the instance: 192.168.73.101
IP of the host laptop: 192.168.0.2

All the commands below should return a successful output:

# Ping from local to the original IP address of kubemasterping 192.168.73.7# Ping from local to the static IP address of kubemasterping 192.168.73.101# Ping from kubemaster to localmultipass exec -n kubemaster -- ping 192.168.0.2

Provisioning the first worker node (`kubeworker01`)

The MAC address and static IP address chosen must be different from the ones allocated to the kubemaster instance.

Launch thekubeworker01 instance with a manual network

multipass launch --disk 10G --memory 3G --cpus 2 --name kubeworker01 --network name=en0,mode=manual,mac="52:54:00:4b:ba:dc" jammy

Configure the extra interface, similar to the steps performed forkubemaster

multipass exec -n kubeworker01 -- sudo bash -c 'cat << EOF > /etc/netplan/10-custom.yamlnetwork:  version: 2  ethernets:    extra0:      dhcp4: no      match:        macaddress: "52:54:00:4b:ba:dc"      addresses: [192.168.73.102/24]EOF'

Apply the new configuration

multipass exec -n kubeworker01 -- sudo netplan apply

Test using ping similar to the steps followed for kubemaster.
Additionally, test that ping from kubemaster to kubeworker01 and vice versa is working.

# Ping from local to the original IP address of kubeworker01ping 192.168.73.8# Ping from local to the static IP address of kubeworker01ping 192.168.73.102# Ping from kubeworker01 to localmultipass exec -n kubeworker01 -- ping 192.168.0.2# Ping from kubeworker01 to kubemastermultipass exec -n kubeworker01 -- ping 192.168.73.101# Ping from kubemaster to kubeworker01multipass exec -n kubemaster -- ping 192.168.73.102

Provisioning the second worker node (`kubeworker02`)

The MAC address and static IP address chosen must be different from the ones allocated to the kubemaster and kubeworker01 instances.

Launch thekubeworker02 instance with a manual network

multipass launch --disk 10G --memory 3G --cpus 2 --name kubeworker02 --network name=en0,mode=manual,mac="52:54:00:4b:cd:ab" jammy

Configure the extra interface, similar to the steps performed forkubemaster

multipass exec -n kubeworker02 -- sudo bash -c 'cat << EOF > /etc/netplan/10-custom.yamlnetwork:  version: 2  ethernets:    extra0:      dhcp4: no      match:        macaddress: "52:54:00:4b:cd:ab"      addresses: [192.168.73.103/24]EOF'

Apply the new configuration

multipass exec -n kubeworker02 -- sudo netplan apply

Test using ping similar to the steps followed for kubemaster.

Additionally, test that all 3 VMs are able to ping each other successfully through their static IPs.

# Ping from local to the original IP address of kubeworker02ping 192.168.73.9# Ping from local to the static IP address of kubeworker02ping 192.168.73.103# Ping from kubeworker02 to localmultipass exec -n kubeworker02 -- ping 192.168.0.2# Ping from kubeworker02 to kubemastermultipass exec -n kubeworker02 -- ping 192.168.73.101# Ping from kubeworker02 to kubeworker01multipass exec -n kubeworker02 -- ping 192.168.73.102# Ping from kubemaster to kubeworker02multipass exec -n kubemaster -- ping 192.168.73.103# Ping from kubeworker01 to kubeworker02multipass exec -n kubeworker01 -- ping 192.168.73.103

Configure the local DNS

SSH into the machines through three separate terminal tabs by using themultipass shell command

multipass shell kubemastermultipass shell kubeworker01multipass shell kubeworker02

Edit the/etc/hosts file for all 3 VMs

Enter the following configuration in the /etc/hosts file of each VM:

🗒

Use the static IP addresses chosen for each VM instance.

sudo vi /etc/hosts

# 192.168.73.101 kubemaster192.168.73.102 kubeworker01192.168.73.103 kubeworker02

Install Kubernetes

Now that we have a perfect set of VMs up and running, it is time to proceed toward the Kubernetes installation.

Versions

The below versions are used in this lab.

Software / Package	Version	Location
`containerd`	1.7.14	releases
`runc`	1.1.12	releases
CNI plugin	1.4.1	releases
kubeadm	1.29.3	apt-get
kubelet	1.29.3	apt-get
kubectl	1.29.3	apt-get

🗒

All commands mentioned below need to be executed from within the terminal of the VMs.

Install and configure prerequisites

Forwarding IPv4 and letting iptables see bridged traffic

Execute the below set of commands onkubemaster, kubeworker01 and kubeworker02

cat <

Verify that the net.bridge.bridge-nf-call-iptables, net.bridge.bridge-nf-call-ip6tables, and net.ipv4.ip_forward system variables are set to 1 in your sysctl config.

net.bridge.bridge-nf-call-iptables = 1net.bridge.bridge-nf-call-ip6tables = 1net.ipv4.ip_forward = 1

🗒

For all the packages to be installed in this tutorial, ensure to use the arm64 variant only.

`Install a Container Runtime`

You need to install a container runtime into each node in the cluster so that Pods can run there.

`Step 1: Install containerd`

Execute the below commands on all 3 nodes

curl -LO https://github.com/containerd/containerd/releases/download/v1.7.14/containerd-1.7.14-linux-arm64.tar.gzsudo tar Cxzvf /usr/local containerd-1.7.14-linux-arm64.tar.gzcurl -LO https://raw.githubusercontent.com/containerd/containerd/main/containerd.servicesudo mkdir -p /usr/local/lib/systemd/system/sudo mv containerd.service /usr/local/lib/systemd/system/sudo mkdir -p /etc/containerd/sudo containerd config default | sudo tee /etc/containerd/config.toml > /dev/nullsudo sed -i 's/SystemdCgroup \= false/SystemdCgroup \= true/g' /etc/containerd/config.tomlsudo systemctl daemon-reloadsudo systemctl enable --now containerd#Check that containerd service is up and runningsystemctl status containerd

Verify that the output shows the containerd service up and running:

 containerd.service - containerd container runtime     Loaded: loaded (/usr/local/lib/systemd/system/containerd.service; enabled; vendor preset: enabled)     Active: active (running) since Tue 2024-03-26 11:15:20 IST; 5ms ago

`Step 2: Install runc`

Execute the below commands on all 3 nodes

curl -LO https://github.com/opencontainers/runc/releases/download/v1.1.12/runc.arm64sudo install -m 755 runc.arm64 /usr/local/sbin/runc

`Step 3: Install CNI plugins`

Execute the below commands on all 3 nodes

curl -LO https://github.com/containernetworking/plugins/releases/download/v1.4.1/cni-plugins-linux-arm64-v1.4.1.tgzsudo mkdir -p /opt/cni/binsudo tar Cxzvf /opt/cni/bin cni-plugins-linux-arm64-v1.4.1.tgz

`Install kubeadm, kubelet and kubectl`

Execute the below commands on all 3 nodes

sudo apt-get updatesudo apt-get install -y apt-transport-https ca-certificates curl gpgcurl -fsSL https://pkgs.k8s.io/core:/stable:/v1.29/deb/Release.key | sudo gpg --dearmor -o /etc/apt/keyrings/kubernetes-apt-keyring.gpgecho 'deb [signed-by=/etc/apt/keyrings/kubernetes-apt-keyring.gpg] https://pkgs.k8s.io/core:/stable:/v1.29/deb/ /' | sudo tee /etc/apt/sources.list.d/kubernetes.listsudo apt-get updatesudo apt-get install -y kubelet kubeadm kubectlsudo apt-mark hold kubelet kubeadm kubectl

Verify the installation using the below commands:

kubeadm version

kubeadm version: &version.Info{Major:"1", Minor:"29", GitVersion:"v1.29.3", GitCommit:"6813625b7cd706db5bc7388921be03071e1a492d", GitTreeState:"clean", BuildDate:"2024-03-15T00:06:16Z", GoVersion:"go1.21.8", Compiler:"gc", Platform:"linux/arm64"}

kubelet --version

Kubernetes v1.29.3

kubectl version --client

Client Version: v1.29.3Kustomize Version: v5.0.4-0.20230601165947-6ce0bf390ce3

`Configure crictl to work with containerd`

Execute the below commands on all 3 nodes

sudo crictl config runtime-endpoint unix:///var/run/containerd/containerd.sock

`Initializing the controlplane node`

Commands for initializing the controlplane node should be executed on kubemaster only.

Execute the below command onkubemaster

apiserver-advertise-address must be the exact value of the static IP allocated to kubemaster.

sudo kubeadm init --pod-network-cidr=10.244.0.0/16 --apiserver-advertise-address=192.168.73.101

If the command runs successfully, you should see the message 'Your Kubernetes control-plane has initialized successfully!'

💡

Save the entire kubeadm join command, which is printed on the output. This will be used when the worker nodes are ready to be connected to the cluster.

To make kubectl work for your non-root user, execute the below command on kubemaster:

mkdir -p $HOME/.kubesudo cp -i /etc/kubernetes/admin.conf $HOME/.kube/configsudo chown $(id -u):$(id -g) $HOME/.kube/config

Verify that you are able to reach the cluster through kubectl:

Execute the below command onkubemaster

kubectl -n kube-system get pods

🗒

The coredns pods will not be Ready at this stage. This is as expected, as we have not deployed the Pod network add-on yet.

NAME                                 READY   STATUS    RESTARTS      AGEcoredns-76f75df574-269qf             1/1     Pending                coredns-76f75df574-6mcvd             1/1     Pending                etcd-kubemaster                      1/1     Running   0             1m1skube-apiserver-kubemaster            1/1     Running   0             1m1skube-controller-manager-kubemaster   1/1     Running   0             1m1skube-proxy-7qfgq                     1/1     Running   0             1m1skube-scheduler-kubemaster            1/1     Running   0             1m1s

`Install a Pod network add-on`

You must deploy a Container Network Interface (CNI) based Pod network add-on so that your Pods can communicate with each other. Cluster DNS (CoreDNS) will not start up before a network is installed.

A list of all compatible Pod network add-ons can be found here.

In this lab, we will use Weave Net

Execute the below command onkubemaster

kubectl apply -f https://reweave.azurewebsites.net/k8s/v1.28/net.yaml

It will take up to a minute for the weave pod to be ready.

At this point, the controlplane node should be ready with all pods in the kube-system namespace up and running. Please validate this to confirm the sanity of the controlplane.

kubectl -n kube-system get pods

NAME                                 READY   STATUS    RESTARTS      AGEcoredns-76f75df574-269qf             1/1     Running   0             3m16scoredns-76f75df574-6mcvd             1/1     Running   0             3m16setcd-kubemaster                      1/1     Running   0             3m32skube-apiserver-kubemaster            1/1     Running   0             3m32skube-controller-manager-kubemaster   1/1     Running   0             3m32skube-proxy-7qfgq                     1/1     Running   0             3m16skube-scheduler-kubemaster            1/1     Running   0             3m33sweave-net-mvld4                      2/2     Running   1 (23s ago)   40s

`Join the worker nodes to the cluster`

Connect to each worker node and run the entire kubeadm join command that was copied earlier from the output of the kubeadm init command.

Sample command to be executed onkubeworker01 and kubeworker02

sudo kubeadm join 192.168.73.101:6443 --token tn082a..... \--discovery-token-ca-cert-hash sha256:c1b0143a.....

💡

If you missed making a note of the kubeadm join command earlier, you can generate a new token by using the below command on the controlplane and use it instead.

kubeadm token create --print-join-command

After a few seconds, check that all nodes have joined the cluster and are in a Ready state.

Execute the below command onkubemaster

kubectl get nodes

`Validation`

Validate that the Kubernetes setup is working correctly by deploying a nginx pod on the cluster.

Execute the below command onkubemaster

kubectl run test-nginx --image=nginx

kubectl get pod test-nginx

NAME         READY   STATUS    RESTARTS   AGEtest-nginx   1/1     Running   0          47s

Once the pod is in a Ready state, then it's time to say Congratulations! You've just built a fully functioning 3 node Kubernetes cluster on a M series MacBook.

`Backup and Restore`

Multipass offers an easy and effective way to take a backup of the controlplane and worker nodes. Using this backup, a corrupt Kubernetes cluster can be restored to a previous working state.

`Backup`

In order to perform a backup, use the snapshot feature offered by multipass.

Execute the below commands on a local terminal

Stop the VMs

multipass stop kubeworker02multipass stop kubeworker01multipass stop kubemaster

Verify that the VMs are stopped

multipass list

Name                    State             IPv4             Imagekubemaster              Stopped           --               Ubuntu 22.04 LTSkubeworker01            Stopped           --               Ubuntu 22.04 LTSkubeworker02            Stopped           --               Ubuntu 22.04 LTS

Capture a snapshot

multipass snapshot kubemastermultipass snapshot kubeworker01multipass snapshot kubeworker02

Verify that the snapshots are present

multipass list --snapshots

Instance       Snapshot    Parent   Commentkubemaster     snapshot1   --       --kubeworker01   snapshot1   --       --kubeworker02   snapshot1   --       --

`Restore`

In order to restore from a backup, use the restore command

💡

Substitute x with the number of the snapshot.

multipass restore kubemaster.snapshotxmultipass restore kubeworker01.snapshotxmultipass restore kubeworker02.snapshotx

`Cleanup`

In order to clean up the cluster, delete the multipass VMs using the below commands:

The delete command performs a soft deletion of the VMs. In other words, it moved the VMs to the recycle bin.

multipass delete kubeworker02multipass delete kubeworker01multipass delete kubemaster

Verify the deletion using the following command:

multipass list

Name                    State             IPv4             Imagekubemaster              Deleted           --               Ubuntu 22.04 LTSkubeworker01            Deleted           --               Ubuntu 22.04 LTSkubeworker02            Deleted           --               Ubuntu 22.04 LTS

In order to recover the deleted clusters, use the recover command:

multipass recover kubemastermultipass recover kubeworker01multipass recover kubeworker02

In order to permanently delete the VMs, the delete command should be followed by the purge command:

multipass delete kubeworker02multipass delete kubeworker01multipass delete kubemastermultipass purge

Purging an instance also deletes all the snapshots associated with this instance. In other words, the VMs cannot be recovered after being purged.

`Resources`

Here are the links to the resources referred in this blog post:

Kubeadm - Page describing the kubeadm tool in the official Kubernetes documentation.
Installing Kubernetes via kubeadm - The official Kubernetes documentation describing the steps involved in installing a cluster through kubeadm.
Multipass overview - An overview of multipass by Canonical, a tool to provision Ubuntu VMs on local machines.
Multipass installation - Steps to install the multipass tool.
Multipass instance management - Documentation on managing instances created by multipass.
Static IP provisioning - Steps to provision a static IP for a VM, which can persist over restarts.
Multipass snapshot - Instructions related to capturing a snapshot (backup) of a multipass instance.
Multipass restore - Instructions related to restoring an instance from a snapshot.
Releases for containerd - This page holds all the releases for containerd.
Releases for runc - This page holds all the releases for runc.
Releases for the CNI plugin - This page holds all the releases for the CNI plugin.
Pod network add-ons - A list of all compatible Pod network add-ons as per the official Kubernetes documentation.
ReWeave - An actively maintained fork of the Weave Net project.



KubeCon + CloudNativeCon, Rejekts and Wasm I/O Wrap-Up: A Leap into the Future with WebAssembly, AI, and Sustainable Cloud Practices
Saloni Narang — Wed, 27 Mar 2024 07:02:02 GMT
KubeCon + CloudNativeCon 2024
Location: Paris(19th-22nd March)
The tech world has been buzzing with innovations and insights, as evidenced by the recent Wasm I/O and KubeCon + CloudNativeCon events. These conferences not only showcased groundbreaking technologies but also set the stage for future trends in the industry. Here's a comprehensive wrap-up, highlighting the significant launches, discussions, and developments that have set the tech community abuzz.
The event was dominated by discussions around making Kubernetes more accessible to AI engineers and establishing it as the go-to platform for AI workloads, ensuring seamless operation. The keynote, presented by Priyanka, featured a live demo that illustrated this integration, driving home the point that Kubernetes is evolving to meet the needs of the AI-driven future.
KubeCon also showcased the growing synergy between Kubernetes, AI, and sustainability, offering insights into the latest trends and innovations in the cloud-native ecosystem.
You can see the buzz words in the cloud native ecosystem and where WebAssembly sits in! This is very interesting, SpinKube - a framework to deploy spin apps on Kubernetes easily was launched at WasmIO and then showcased at KubeCon also caught a lot of interest from the cloud native community. Now is your time to learn about WebAssembly from the free course that we have on Kubesimplify(The only complete course on YouTube)
https://youtu.be/eYekV2Do0YU?si=HVKHd3iIxMqwU1O6
 
Kubesimplify team talks at KubeCon:
Building a Tool to Debug Minimal Container Images
Speakers: Saiyam Pathak and Kyle Quest Addressing the challenges associated with distroless and slim containers, this session covered the tool called mint for debugging container images across platforms like Docker, Containerd, and Kubernetes, enhancing developers' experience and efficiency.
CloudnativeHacks Sustainability Panel Discussion
Panelists: Saiyam Pathak, Leonard Pahlke, Saloni Narang, Kristina D., and Niki Manoledaki
The discussion delved into sustainable practices within the cloud-native space, featuring insights from the Technical Advisory Group (TAG) on sustainability and offering actionable recommendations for fostering an environmentally friendly cloud-native ecosystem.
Heating Pools with Cloud Power: A New Wave in Green Computing
Speakers: Saiyam Pathak and Mark Bjornsgaard
The theme of sustainability was also prominent, with Civo and Deepgreen illustrating their innovative approach to reusing server heat for community swimming pools, reflecting a growing trend towards green computing. This tlak showcased end to end production usecase of Deep green + Civo + Fermyon.
The Kubernetes Hunger Games: Distro Performance in the Edge
Speakers: Saiyam Pathak and Shivay Lamba
This session provided an in-depth analysis of Kubernetes distributions in edge computing settings, examining the performance and suitability of microk8s, k0s, and k3s through various benchmarks, including fio, kbench, sysbench, and Iperf.
Overall Program Highlights:
Kubestronaut Program Launch: An initiative recognizing individuals who have achieved all five of CNCFs Kubernetes certifications, promoting continued learning and expertise in the Kubernetes ecosystem.
Cloud-Native AI working group: Group formed to bring together the AI and cloud native world together to solve bigger challenges together. They also released a whitepaper that gives a brief overview of the state-of-the-art AI/ML techniques, followed by what CN technologies offer, covering the next challenges and gaps before discussing evolving solutions. The paper will equip engineers and business personnel with the knowledge to understand the changing Cloud Native Artificial Intelligence (CNAI) ecosystem and its opportunities.
KubeCon EU next year is in London and Also KubeCon India happening this year in Delhi.
eBPF took a rise alongside of WebAssembly and AI.
Quotes form team Kubesimplify:
Saiyam said "KubeCon was amazing and I got to meet a lot of people, made new connections, great questions after all my talks. The AI thing at KubeCon was interesting and I saw a lot of sponsors showcasing the AI capabilities. The AI hub was cool as well. Totally worth it going to the biggest cloud native conference with great conversations"
Saloni said "Going with the baby is always challenging but love how CNCF provides the childcare facilities because of which I was able to go to my panel talk and also connect with people in the hallway track. the project pavilion was great!"
Faeka said "I saw the ebpf trend on hike and all around.📈These are two things I really need to catch hold off as they are moving very quickly. I was the only student from India this time, and i believe we need more exposure to these conferences for students.It was a very happening overall and I learnt more about the solutions so many companies are building on top of Kubernetes."
All the talks are being uploaded on CNCF YouTube channel and our sessions will be uploaded soon. One of them is already there.
Here is the curated list of recommended talks by team KubeSimplify
https://youtube.com/playlist?list=PL5uLNcv9SibDMdNvINsQYRTl0GNXMbHVX&si=jjwplR91zXCVZs86
 
Wasm I/O 2024 Wrap-Up: Harnessing WebAssembly for Future Tech
Location: Barcelona(14th and 15th March)
Wasm I/O 2024 focused on the evolution and application of WebAssembly (WASM) in modern computing, emphasizing machine learning, sustainability, and operational efficiency. This one highlighted more of production applications and component model. During the conference, SpinKube was launched that aims to simplify deployment of spin apps on Kubernetes.
Wasm I/O also highlighted the potential of WebAssembly in enhancing AI applications and how it contributes to sustainable software with its key features of startup times, low footprint.
Google also talked how they are using Wasm in Production as all levels and then very interestingly how WASM is used at edge.
Wasm I/O talks by team KubeSimplify:
Accelerating ML Inferencing with WebAssembly & Spin 2.0
Speakers: Saiyam Pathak and Radu Matei
This session explored the integration of AI with WebAssembly, providing an introduction to AI concepts, machine learning types (supervised and unsupervised), and the steps to develop an AI inference application. The use of SpinKube for local deployment and connection to remote GPUs, as well as deployment strategies on Kubernetes, were detailed, showcasing the scalability and efficiency of WASM in AI contexts.
Create Production-Grade Wasm Applications on Kubernetes
Speakers: Saiyam Pathak and Sven Pfennig Focusing on practical applications, This workshop started with basic introduction to Kubernetes and WebAssembly, then deploying a customised WASm version of potatohead application on Kubernetes. Enabling logging, metrics and doing some fancy chaos experiments with the application deployed.
Sustainability with WASM? - Faster, Greener Computing
Panelists: Saloni Narang, Saiyam Pathak, Danielle Lancashire, Shivay Lamba, and Rishit Dagli This panel discussion centered around the environmental impact of computing and how WASM contributes to sustainability. The conversation covered the software-hardware intersection, programming languages' effects on energy consumption, and how WASM aids in reducing this footprint, promoting a sustainable development model in the tech industry.
Cloud native Rejekts - Human sized conference
Paris(17th-18th March)
Cloud Native Rejekts EU 2024 conference unfolded as a beacon for technologists, from seasoned experts to novices eager to dive into the cloud-native ecosystem. This year's edition was held at a dope location that was a game centre and the talks ranged from community wellbeing to the depth of cutting-edge technologies. Kubesimplify was again proud to be the community SPONSOR for this event.
The conference demystified eBPF, offering a beginner-friendly introduction and its usage in different projects. Security discussions were equally robust, touching upon pivotal aspects of Kubernetes, containers, and the emerging Gateway API, alongside the vital role of OpenTelemetry in enhancing observability across cloud-native applications.
The talks recording will be out soon on their YouTube channel, all the talks were amazing but to cover the spectrum do watch these top three:
1)Kubernetes: container images and OCI Registry, resilience and security
2)eBPF: understanding for beginners, frameworks and observability solutions
3)Community: growth and initiatives, human aspects such as burnout and empathy
The knowledge shared at Cloud Native Rejekts EU 2024 promises to inspire and educate, fostering growth and innovation within the cloud-native landscape.
By breaking down the content from Wasm I/O and KubeCon + CloudNativeCon 2024, we can appreciate the profound advancements and discussions shaping the future of WebAssembly, AI, Kubernetes, and sustainable computing. These events have not only highlighted current technological capabilities but also charted the path for future innovations and practices in the tech industry.
Some pictures from both the events:
As always it was great meeting everyone in person, feel free to connect with us if you were not able to meet in person as we would be happy to chat.
For organisations looking to partner with Kubesimplify,DMus.


Why are network policies in Kubernetes so hard to understand?
Saiyam Pathak — Sat, 23 Mar 2024 16:34:14 GMT
In Kubernetes, the concept of network policies allows you to control the traffic flow within a cluster. Essentially, by creating policies, you determine which pods can access others, streamlining the process of restricting traffic between different applications within the cluster.
You will also run into many microservices in different namespaces within a Kubernetes environment. These applications are run as pods, which in turn run containers. These containers are your applications and are capable of communicating with every other pod, either directly or through services. However, this open communication model isn't always secure. Fortunately, Kubernetes offers the concept of network policy, implemented by various network providers, to provide out-of-the-box functionality for controlling this aspect securely.
The community often voices that network policies are complex, but by exploring concrete examples, we can gain a clearer understanding of how they work in action.
Prerequisites
To follow along with this tutorial, you need to ensure you have the following in place:
A Civo Account
A Kubernetes cluster
Kubectl installed
Civo CLI installed
Creating a Kubernetes cluster with Cilium
To begin with, lets create a Civo Kubernetes cluster with Cilium as the CNI. You can create the cluster from the UI or the CLI.
 For the purpose of this tutorial, we will be using Civo Kubernetes, but you can go with any Kubernetes cluster and CNI where network policies will work.
Interacting with the cluster
Once you have the cluster created, you can export the KUBECONFIG variable in your terminal and point it to the downloaded kubeconfig file for the cluster. From here, you should be able to interact with the cluster:
kubectl get nodesNAME                                                   STATUS   ROLES    AGE   VERSIONk3s-networkpolicies-7aed-fb151a-node-pool-71b0-krmn8   Ready       71s   v1.28.2+k3s1k3s-networkpolicies-7aed-fb151a-node-pool-71b0-jl4q3   Ready       69s   v1.28.2+k3s1k3s-networkpolicies-7aed-fb151a-node-pool-71b0-6ko85   Ready       67s   v1.28.2+k3s1
Create 2 namespaces dev1 and dev2:
kubectl create ns dev1namespace/dev1 createdkubectl create ns dev2namespace/dev2 created
Create a pod demo1 and pod demo2 in respective namespaces with NGINX image:
kubectl run demo1 --image=nginx -n dev1pod/demo1 createdkubectl run demo2 --image=nginx -n dev2pod/demo2 createdkubectl get pods -owide -n dev1NAME    READY   STATUS    RESTARTS   AGE   IP           NODE                                                   NOMINATED NODE   READINESS GATESdemo1   1/1     Running   0          97s   10.0.1.147   k3s-networkpolicies-7aed-fb151a-node-pool-71b0-6ko85              kubectl get pods -owide -n dev2NAME    READY   STATUS    RESTARTS   AGE   IP           NODE                                                   NOMINATED NODE   READINESS GATESdemo2   1/1     Running   0          94s   10.0.1.185   k3s-networkpolicies-7aed-fb151a-node-pool-71b0-6ko85              
Testing the connectivity
Lets now test the connectivity of one pod from another:
kubectl exec demo1 -n dev1 -- curl 10.0.1.185  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current                                 Dload  Upload   Total   Spent    Left  Speed  0     0    0     0    0     0      0      0 --:--:-- --:--:-- --:--:--     0Welcome to nginx!Welcome to nginx!
If you see this page, the nginx web server is successfully installed andworking. Further configuration is required.
For online documentation and support please refer to"http://nginx.org/">nginx.org.
Commercial support is available at"http://nginx.com/">nginx.com.
Thank you for using nginx.
100   615  100   615    0     0   405k      0 --:--:-- --:--:-- --:--:--  600k
In this code, we exec into pod demo1 in the dev1 namespace and try to curl the IP of pod demo2 in dev2 namespace, this shows that any pod can communicate with any other pod in any namespace.
Now how can we fix this? You are right! Using NetworkPolicy.
Creating a Network Policy
Lets create a network policy in the dev2 namespace so that no traffic can reach the pods in the dev2 namespace.
cat << EOF | kubectl apply -f -apiVersion: networking.k8s.io/v1kind: NetworkPolicymetadata:  name: deny-all  namespace: dev2spec:  podSelector: {}  policyTypes:  - IngressEOFnetworkpolicy.networking.k8s.io/deny-all created
Above is the manifest for all the pods in dev2 namespace. The purpose of this policy is to restrict all incoming traffic to the pods within the dev2 namespace. Here's a breakdown of how it works:
apiVersion: networking.k8s.io/v1: Specifies the API version for the network policy resource.
kind: NetworkPolicy: This specifies the kind of Kubernetes resource you're defining, which in this case is a Network Policy.
metadata: Contains metadata about the network policy, including its name (deny-all) and the namespace (dev2) it is applied to.
spec: Defines the specifications of the Network Policy.
podSelector: This is set to an empty object ({}), which means the policy applies to all pods within the specified namespace (dev2 in this case). You could specify label selectors here if you wanted to target specific pods.
policyTypes: Specifies the types of policies. In this case, it includes - Ingress, which means the policy will apply to incoming traffic to the pods. By not specifying Egress in the policy types, this policy does not restrict egress (outgoing) traffic from the pods.
Ingress: Since no rules are defined under the ingress section (which is implicitly understood from the lack of an ingress field under spec), it means no inbound connections are allowed to any pods in the dev2 namespace. You would define rules here if you wanted to allow specific types of ingress traffic.
The following outcome should appear:
kubectl exec demo1 -n dev1 -- curl --connect-timeout 5 10.0.1.185  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current                                 Dload  Upload   Total   Spent    Left  Speed  0     0    0     0    0     0      0      0 --:--:-- --:--:-- --:--:--     0     0    0     0    0     0      0      0 --:--:--  0:00:01 --:--:--     0     0    0     0    0     0      0      0 --:--:--  0:00:02 --:--:--     0     0    0     0    0     0      0      0 --:--:--  0:00:03 --:--:--     0     0    0     0    0     0      0      0 --:--:--  0:00:04 --:--:--     0     0    0     0    0     0      0      0 --:--:--  0:00:05 --:--:--     0curl: (28) Failed to connect to 10.0.1.185 port 80 after 5001 ms: Timeout was reachedcommand terminated with exit code 28
You can see that after applying this policy no traffic can reach the pods in the dev2 namespace.
Receiving incoming TCP traffic
Next, lets try to cover a couple more scenarios to understand the concept more clearly. Next, we will create a network policy that would allow the traffic from pods in dev1 namespace to dev2 namespace over port 80.
cat << EOF | kubectl apply -f -apiVersion: networking.k8s.io/v1kind: NetworkPolicymetadata:  name: kube-demo  namespace: dev2spec:  podSelector: {}  ingress:  - from:    - namespaceSelector:        matchLabels:          kubernetes.io/metadata.name: dev1    ports:    - protocol: TCP      port: 80EOFnetworkpolicy.networking.k8s.io/kube-demo created
Heres a breakdown of what is happening in the above code:
spec: Defines the specifics of the Network Policy.
podSelector: An empty object ({}) is specified, meaning this policy applies to all pods within the dev2 namespace. If you wanted this policy to apply to specific pods, you would use label selectors here.
ingress: Specifies the rules for incoming traffic to the selected pods.
from: Defines the sources from which the pods can receive traffic.
namespaceSelector: Specifies that the incoming traffic is allowed from pods in namespaces that match the specified labels. In this case, it's allowing traffic from the dev1 namespace, as indicated by the label selector kubernetes.io/metadata.name: dev1.
ports: Specifies the ports and protocols for the incoming traffic that is allowed.
protocol: The allowed protocol for ingress traffic, which is TCP in this case.
port: The port on which incoming traffic is allowed, which is port 80.
This network policy allows pods in the dev2 namespace to receive incoming TCP traffic on port 80 from any pod in the dev1 namespace. No other ingress traffic is permitted by this policy, effectively isolating the pods in dev2 from unwanted or unsolicited incoming traffic from pods in other namespaces, except for the allowed traffic from dev1.
Network Policy Example
Interestingly if you have a previous policy of deny all and this policy as well, it will be a combination, and the resultant will allow traffic on port 80 from pods in dev1 namespace to dev2 namespace.
You can check the output below:
kubectl exec demo1 -n dev1 -- curl --connect-timeout 5 10.0.1.185  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current                                 Dload  Upload   Total   Spent    Left  Speed  0     0    0     0    0     0      0      0 --:--:-- --:--:-- --:--:--   100   615  100   615    0     0   267k      0 --:--:-- --:--:-- --:--:--  300kWelcome to nginx!Welcome to nginx!
If you see this page, the nginx web server is successfully installed andworking. Further configuration is required.
For online documentation and support please refer to"http://nginx.org/">nginx.org.
Commercial support is available at"http://nginx.com/">nginx.com.
Thank you for using nginx.
To check that traffic is not allowed from any other namespace or on any other port, let's create a pod in the dev2 namespace listening on a different port than 80 and a pod in default namespace as well:
kubectl run default --image=nginxpod/default createdkubectl get pods -owideNAME                                 READY   STATUS      RESTARTS   AGE    IP           NODE                                                   NOMINATED NODE   READINESS GATESdefault                              1/1     Running     0          26s    10.0.2.237   k3s-networkpolicies-7aed-fb151a-node-pool-71b0-jl4q3              kubectl exec default -- curl --connect-timeout 5 10.0.1.185  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current                                 Dload  Upload   Total   Spent    Left  Speed  0     0    0     0    0     0      0      0 --:--:-- --:--:-- --:--:--     0     0    0     0    0     0      0      0 --:--:-- --:--:-- --:--:--     0     0    0     0    0     0      0      0 --:--:--  0:00:01 --:--:--     0     0    0     0    0     0      0      0 --:--:--  0:00:02 --:--:--     0     0    0     0    0     0      0      0 --:--:--  0:00:03 --:--:--     0     0    0     0    0     0      0      0 --:--:--  0:00:04 --:--:--     0     0    0     0    0     0      0      0 --:--:--  0:00:05 --:--:--     0curl: (28) Failed to connect to 10.0.1.185 port 80 after 5000 ms: Timeout was reachedcommand terminated with exit code 28
Now lets try to create a pod and service to listen on port 8080:
kubectl run http-echo --image=hashicorp/http-echo -n dev2 -- -listen=:8080 -text="Hello from http-echo"pod/http-echo createdkubectl get pods -n dev2 -owideNAME         READY   STATUS    RESTARTS   AGE   IP           NODE                                                   NOMINATED NODE   READINESS GATESdemo2        1/1     Running   0          80m   10.0.1.185   k3s-networkpolicies-7aed-fb151a-node-pool-71b0-6ko85              http-echo   1/1     Running   0          26s   10.0.1.115   k3s-networkpolicies-7aed-fb151a-node-pool-71b0-6ko85              
Creating a service
Create the service using the following:
kubectl expose pod http-echo -n dev2 --port=8080service/http-echo exposedkubectl get svc -n dev2NAME        TYPE        CLUSTER-IP       EXTERNAL-IP   PORT(S)     AGEhttp-echo   ClusterIP   10.98.167.221            8080/TCP   28s
kubectl exec demo1 -n dev1 -- curl --connect-timeout 5 10.98.167.221  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current                                 Dload  Upload   Total   Spent    Left  Speed  0     0    0     0    0     0      0      0 --:--:--  0:00:05 --:--:--     0curl: (28) Failed to connect to 10.98.167.221 port 80 after 5000 ms: Timeout was reachedcommand terminated with exit code 28
The kube-demo NetworkPolicy that was applied to all pods in the dev2 namespace (since podSelector is empty), allows them to receive incoming TCP traffic on port 80 only from pods within the dev1 namespace (identified by the label kubernetes.io/metadata.name: dev1). All other incoming traffic from different namespaces, or different ports, will be denied by default, as this is the standard behavior of Kubernetes NetworkPolicies when they are applied to a set of pods. We proved that with a pod in the default namespace and also by running a pod on port 8080 and trying to connect to it also failed.
Summary
Throughout this tutorial, you should now have a better understanding of how you can apply network policies within your Kubernetes cluster to limit the ingress/egress traffic for the pods. Another interesting way to learn more about this topic is by using this tool which allows you to create network policies for Kubernetes and gain a better understanding of the concept.
If you want other resources to keep learning more about this topic, I recommend checking out the following:
Kubernetes Documentation on network policies
Kubernetes network policy recipes
Follow Kubesimplify on Hashnode, Twitter and LinkedIn. Join our Discord server to learn with us!


Practical Guide to Kubernetes API
Kunal Verma — Tue, 19 Mar 2024 13:31:58 GMT
You're probably familiar with Kubernetes, the open-source platform designed by Google and now maintained by the Cloud Native Computing Foundation (CNCF), which automates the deployment, scaling, and management of containerized applications.
But, did you know that under-the-hood, Kubernetes is an API.
Thats right! Every action you take within Kubernetes, from creating pods to monitoring services, is ultimately an API interaction.
The Kubernetes API serves as the backbone of the platform, providing a unified interface for managing and interacting with Kubernetes resources. While you may already be using kubectl, the official Kubernetes CLI tool to interact with your cluster, understanding the API grants you deeper control over your cluster, allowing you to automate tasks, customize configurations, and integrate Kubernetes with other tools and systems.
Now, Kubernetes API is massive  with hundreds of endpoints and concepts involved. Truth be told, its a bit more advanced than just a bunch of HTTP endpoints thrown together and for someone new, lets just say things can get a bit overwhelming at first, if not approached it correctly.
That's where this blog comes in. In this practical guide, we'll cover the fundamental aspects of the Kubernetes API to help you navigate it confidently. Whether you're a developer, a system administrator, or simply curious about Kubernetes, this guide will provide you with the knowledge and skills needed to get started on the path of harnessing the power of the Kubernetes API effectively.
Understanding the Basics
Before proceeding to the main API concepts, let us first get some basics right about the Kubernetes API.
RESTful Nature
The Kubernetes API is RESTful in nature, which simply means that it follows the REST (Representational State Transfer) architectural style and standards for communication.
Well not go into the details of the REST architecture here, but its important to understand that RESTful APIs adheres to a set of principles and getting familiar with them would in turn help us in understanding the nature of the Kubernetes API itself. For example:
Stateless Communication - Each request from a client to the server must contain all the information needed to understand and fulfill the request, without relying on any previous interactions. That means, every time you talk to the Kubernetes API, you don't need to remember past API calls. Each interaction is independent, like sending a new message without needing to refer back to old ones.
Uniform Interface - Due to its RESTful nature, the Kubernetes API is pretty consistent and has a standardized interface of communication. This actually is helpful for us, because one just needs to understand a limited number of patterns and then apply that knowledge to understand the rest of the API. So, thats a win for us!
Self-descriptive Messages - RESTful APIs use descriptive messages to communicate between the client and server. This means that each interaction includes information about what action is being requested and how to process it. For example, when you send a request to the Kubernetes API, the response will include clear indications of whether the request was successful, along with any relevant data or error messages. This self-descriptive nature simplifies the communication process, making it easier to understand and troubleshoot interactions with the Kubernetes API.
Exposing the K8s API
Apart from it being RESTful in nature, its essential to know that API server component of the control plane is the one that exposes the Kubernetes API to users and other components within the cluster. Officially, the implementation is called kube-apiserver and this enables the end users, different parts of your cluster, and external components communicate with one another.
The API server acts as the first point of contact for any external user or request to the cluster and all the internal operations are channeled through the API server component as well.
📍 Not familiar with the API server or in general the Kubernetes architecture? Feel free to check out the Kubernetes 101 Workshop for more clarity.
Ways of Accessing the API
We understand that at the end of the day, every operation performed in Kubernetes is an API call and involves accessing the core K8s API is some way. Interestingly, there are different ways to access the Kubernetes API, catering to different use cases and preferences:
Via kubectl: kubectl is the official command-line tool for Kubernetes and provides a convenient way to interact with the Kubernetes API. It simplifies tasks like creating, updating, and deleting resources using intuitive commands.
 For instance, here is a command to fetch all the running pods in a cluster:
 $ kubectl get pods NAME    READY   STATUS    RESTARTS   AGE demo    1/1     Running   0          27s demo2   1/1     Running   0          23s demo3   1/1     Running   0          17s
Through Simple REST Calls via curl: If you are comfortable with HTTP requests, the Kubernetes API can be also be accessed directly through simple REST calls using the curl CLI command. This allows for more granular control and customization of API interactions.
 The same request of fetching all the running pods using the curl command looks something like:
 # Simple Request $ curl -s http://127.0.0.1:8080/api/v1/namespaces/default/pods {   "kind": "PodList",   "apiVersion": "v1",   "metadata": {     "resourceVersion": "3606"   },   "items": [     {       "metadata": {         "name": "demo",         "namespace": "default",         "uid": "9cbf8ea4-9fbf-4824-a170-6cace6888a57",         "resourceVersion": "2693",         "creationTimestamp": "2024-03-14T05:51:03Z",         "labels": {           "run": "demo"         }, ....
 # Formatted Request $ curl -s http://127.0.0.1:8080/api/v1/namespaces/default/pods | jq '.items[].metadata.name' "demo" "demo2" "demo3"
Using Client Libraries: Kubernetes also offers a set of client libraries in various programming languages (such as Python, Go, Java) for those looking to develop applications that interact with the Kubernetes API. These libraries abstract away the complexities of making HTTP requests and makes it easier to build applications that interact with Kubernetes.
 For instance, have a look at this code snippet in Go that fetches all the running pods in a cluster.
API Structure Breakdown
Now that we have a basic familiarity with the nature of the Kubernetes API, we'll now break down the internal structure of the Kubernetes API to give you a clear understanding of how it works.
Resources and Verbs
Alright, let us start from up top!
As we are trying to understand a RESTful API here, the communication primarily revolves around resources and verbs. By definition:
Resources represent the entities you want to interact with, while verbs specify the actions you can perform on those resources.
In simple terms, resources are the "things" you want to work with. In case of Kubernetes, we have pods, services, or deployments etc. Verbs, on the other hand, are the "actions" you can take on those things, such as create, get, update, or delete.
In conclusion, each resource is treated as a separate entity and is basically the endpoint of the API, that can be accessed and manipulated independently.
Now, let's apply this concept to the Kubernetes API. In Kubernetes, these resources or API endpoints are officially called as resources types, though in practice, we simply refer to them as resources. These are the ones that youll find at the end of the request URI. For example, take a look at this request below:
$ kubectl get --raw /api/v1/namespaces/default/pods
Here, /pods represents the pods resource type or endpoint of the API.
Extending this a bit further, a single instance of a resource type is called a resource, which often represents a Kubernetes object such as a pod, deployment, namespace etc. For example, take a look at this request below:
$ kubectl get --raw /api/v1/namespaces/default/pods/nginx-7c7db887d8-dkkcg
Here, we are querying the API to fetch the pod - nginx-7c7db887d8-dkkcg, which is a Kubernetes object or resource and is a single instance of the pods resource type.
💡 Note
In the examples above, you have noticed we are using kubectl with the -raw flag to send request to the API. This is the kubectl raw mode!
kubectl raw mode is a special feature (pretty handy one as well) allows you to interact with the Kubernetes API directly, bypassing some of the built-in kubectl functionality.
Why its useful?
This is useful when you need more flexibility and control over your API requests, such as when troubleshooting or debugging complex issues, or when you want to interact with Kubernetes resources in ways that aren't supported by the standard kubectl commands.
Lets move onto Verbs. As mentioned above, Verbs are the actions you are allowed to perform on the resources (or, resource types in case of Kubernetes).
Talking about the Kubernetes API, all the standard HTTP verbs are supported with some additional ones added to the list. Below is a list of all the operations you can perform (taken from sig-architecture/api-conventions.md):
GET /: Retrieves a list of resources of type . For instance, GET /pods returns a list of Pods.
POST /: Creates a new resource based on the JSON object provided by the client.
GET //: Fetches a single resource with the given name. For example, GET /pods/first retrieves a Pod named first.
DELETE //: Deletes a single resource with the given name.
DELETE /: Deletes a list of resources of type . For instance, DELETE /pods removes a list of Pods.
PUT //: Updates or creates the resource with the given name using the JSON object provided by the client.
PATCH //: Selectively modifies specified fields of the resource.
GET /?watch=true: Receives a stream of JSON objects corresponding to changes made to any resource of the given kind over time.
In summary, when we are dealing with a RESTful API like Kubernetes, communication primarily revolves around resources and verbs.
Resources denote the entities you interact with, while verbs signify the actions you can perform on them. In the Kubernetes API, these resources, or API endpoints, are referred to as resource types. Each resource type represents a distinct entity that can be accessed and manipulated independently. Furthermore, a single instance of a resource type is termed a resource, often representing a Kubernetes object like a pod or deployment.
Transitioning to verbs, they dictate the actions permissible on resources. In the Kubernetes API, standard HTTP verbs are supported, with additional ones included (such as PATCH). These include operations such as retrieving a list of resources, creating new resources, updating or deleting specific resources, and selectively modifying resource fields.
API Groups and Versions
In previous section we studied about resource types. As mentioned previously, Kubernetes API is massive and there are lot of different resource types involved  as we can now imagine!
For the purpose of increasing simplicity and extending the APIs capabilities, the resource types are carefully organized into API groups, with each group serving a different purpose.
Below are the two main API groups that are essential to understand:
The Core Group: The core (also called legacy) group is found at REST path - /api. This particular endpoint is only used by core K8s resources such as pods, secrets, configmaps, etc. You'll typically find this mentioned in a yaml file as - apiVersion: v1 field.
 For instance, consider the pod specification yaml file below:
 apiVersion: v1 kind: Pod metadata:   name: nginx spec:   containers:   - name: nginx     image: nginx:1.14.2     ports:     - containerPort: 80
Named Group: The named groups are a bit more modern and generic which can be found at the REST path - /apis/. This endpoint is used by all the other resources (including custom resources) and deals with specific areas like networking or storage. In a typical Kubernetes yaml file, you'll spot this as apiVersion: $GROUP_NAME/$VERSION.
 For instance, take a look at this yaml specification for a Kubernetes Job:
 apiVersion: batch/v1 kind: Job metadata:   name: pi spec:   template:     spec:       containers:       - name: pi         image: perl:5.34.0         command: ["perl",  "-Mbignum=bpi", "-wle", "print bpi(2000)"]       restartPolicy: Never   backoffLimit: 4
One thing you may have already noticed from the section above is how we also mention a specific version with each API group (whether core or named groups). This is a standard practice followed throughout the entire Kubernetes API. Each API group is versioned independently that evolves over time, moving through different stages of development and use:
alpha - Experimental, potentially unstable.
beta - More tested but still subject to change.
stable or GA (General Availability) - Reliable and ready for production use.
You can refer the full list of all the API groups with their versions in the K8s API reference or use a simple command of kubectl api-resources to do the same:
$ kubectl api-resourcesNAME                              SHORTNAMES   APIVERSION                        NAMESPACED   KINDbindings                                       v1                                true         Bindingcomponentstatuses                 cs           v1                                false        ComponentStatusconfigmaps                        cm           v1                                true         ConfigMapendpoints                         ep           v1                                true         Endpointsevents                            ev           v1                                true         Eventlimitranges                       limits       v1                                true         LimitRangenamespaces                        ns           v1                                false        Namespacenodes                             no           v1                                false        Nodepersistentvolumeclaims            pvc          v1                                true         PersistentVolumeClaimpersistentvolumes                 pv           v1                                false        PersistentVolumepods                              po           v1                                true         Pod...
💡 Interesting Fact
Did you know that in the latest Kubernetes v1.29 release, there are a total of 49 enhancements which includes:
11 features - stable or GA stage
19 features - beta stage
19 features - alpha stage
Check out the detailed breakdown of K8s 1.29 in this video and you may also refer the official release blog to know more!
In summary, the resource types in Kubernetes API are organized into API Groups for easier management and increasing Kubernetess capabilities.
The core API group, accessed at /api, handles fundamental Kubernetes resources like pods and secrets, often denoted by apiVersion: v1 in YAML files. On the other hand, named API groups, found at /apis/, cater to more specialized resources such as networking or storage, indicated by apiVersion: $GROUP_NAME/$VERSION.
Throughout the Kubernetes API, each API group is versioned independently, moving through stages like alpha, beta, and stable or GA (General Availability). This ensures that the API evolves reliably for production use.
Kind (Object Schema)
If you have worked with Kubernetes before and know your way around a typical Kubernetes yaml manifest, you've likely come across the kind field. For example - kind: Pod, kind: Ingress, kind: Deployment and so on.
apiVersion: batch/v1kind: Jobmetadata:  name: pispec:  template:    spec:      containers:      - name: pi        image: perl:5.34.0        command: ["perl",  "-Mbignum=bpi", "-wle", "print bpi(2000)"]      restartPolicy: Never  backoffLimit: 4
Viewing this from a beginners perspective, you might assume that kind denotes the name of the resource being created in Kubernetes i.e. a Pod, Ingress, Deployment etc. written in PascalCase format. But in reality, thats actually not the case!
In terms of Kubernetes, each resource (or, resource type when referring to the K8s API) is represented by an object (Kubernetes object to be specific) having a specific schema associated with it, called Kind. In simpler terms, a schema is like a blueprint that defines how a particular resource (or, object) looks and behaves.
Essentially, Kind specifies the structure, properties, and behavior of a particular resource, including what attributes it has and how those attributes can be used or modified. Essentially, it outlines the rules and guidelines for working with that specific resource type within a Kubernetes cluster.
Now, as per the sig-architecture API convention, kinds are grouped into three categories:
Objects (Pod, Service, etc) - persistent entities in the system.
Lists (PodList, APIResourceList, etc) - collections of resources of one or more kinds.
Simple - specific actions on objects (status, scale, etc.) or non-persistent auxiliary entities (ListOptions, Policy, etc).
You may think that  all this is good to know, but what is the actual importance of Kind in the Kubernetes API and why are we discussing this today?
Turns out, this particular field is important when it comes to client - server communication. It allows proper serialization and deserialization of the specified object (mentioned in the Kind field) before transmitting them over a network or storing them.
📍A Quick Tour - Concept of Serialization & Deserialization
Serialization and Deserialization refers to the process of converting data (such as objects, structures, or variables) into a format that can be easily transmitted over a network or stored in a database, and then converting it back to its original form when received by another system.
Talking in terms of the Kubernetes API to give you a better idea, when it comes to sending a request to the cluster to perform a certain operation on a resource  such as create, update, or retrieve, we need to serialize the data in the request i.e. convert it into a format that the Kubernetes API understands, typically JSON or YAML. Sounds familiar, right?
Once the Kubernetes API receives this serialized data, it deserializes it i.e. converts it back into its original form. This allows the Kubernetes API to process the request, validate the data, and perform the necessary actions, such as creating or updating resources within the cluster.
Theres more!
Interestingly, when the Kubernetes API sends the response back to the client, it serializes the data before transmitting it over the network. The client then deserializes the response, which allows it to interpret and use the data as needed.
Demo - List all the Running Pods in a Cluster
Alright, throughout the previous sections we have talked a lot about the theory of the Kubernetes API and now have a basic idea of how its structured.
Let's roll up our sleeves and get our hands dirty with the Kubernetes API by making a request that lists down all the current running pods in the cluster.
Prerequisites
To follow along, here is a list of all the prerequisites needed:
minikube installed
kubectl CLI installed
curl or any CLI tool for sending HTTP requests
JSON Output formatter - jq (a very handy tool)
Basic knowledge of working with Kubernetes via kubectl
Step 1 - Creating a Kubernetes Cluster
Here, well use minikube to bootstrap a single node K8s cluster using the following command:
$ minikube start😄  minikube v1.32.0 on Darwin 14.4 (arm64)  Using the docker driver based on existing profile👍  Starting control plane node minikube in cluster minikube🚜  Pulling base image ...🔥  Creating docker container (CPUs=2, Memory=7792MB) ...🐳  Preparing Kubernetes v1.28.3 on Docker 24.0.7 ...🔗  Configuring bridge CNI (Container Networking Interface) ...🔎  Verifying Kubernetes components...     Using image gcr.io/k8s-minikube/storage-provisioner:v5🌟  Enabled addons: storage-provisioner, default-storageclass...
As the cluster creation process finishes, use the following command to check the cluster information:
$ kubectl cluster-infoKubernetes control plane is running at CoreDNS is running at 
Here, well find the host URL i.e. address of the control plane, to which well making our HTTP request. In this case, its - https://127.0.0.1:57403.
Step 2 - Authenticating the API Server to the Client
As we have already studied above, the API server component in the control plane is the one thats responsible to expose the API to both the client and other components within the cluster.
Now, Kubernetes by default restricts access to its API endpoints. That means, in order for us to send any request, we first need establish trust both ways i.e. between the API Server and the client, and vice versa. Let us understand the first one here!
In Kubernetes, a method to authenticate the API server to the client is by using the CA certificate (Certificate Authority).
📍 The CA certificate is a trusted certificate issued by the Kubernetes cluster that verifies the identity of the API server to the client.
In our case, we are using a local cluster bootstrapped by minikube, which has a CA certificate signed by minikubeCA (minikubes own Certificate Authority). Therefore, in order to establish trust between the API server and the client, we need to manually point out the location of minikubes CA certificate in our request, using the --cacert flag provided by curl:
$ curl --cacert ~/.minikube/ca.crt https://127.0.0.1:57403/api/v1/namespaces/default/pods{  "kind": "Status",  "apiVersion": "v1",  "metadata": {},  "status": "Failure",  "message": "pods is forbidden: User \\"system:anonymous\\" cannot list resource \\"pods\\" in API group \\"\\" in the namespace \\"default\\"",  "reason": "Forbidden",  "details": {    "kind": "pods"  },  "code": 403}
Now, interestingly this would fail when executed, because an additional authentication is still remaining to be done!
Step 3 - Authenticating the Client to the API Server
Just as the API server authenticates itself to the client, the client also needs to authenticate itself to the API server. This ensures mutual trust between both parties.
Now, Kubernetes provides several authentication methods for this purpose, but well keep it simple and authenticate the request using client certificate and key.
📍 What is a Client Certificate and Client Key?
The client certificate is a digitally signed document issued by a trusted Certificate Authority (CA) that uniquely identifies the client (user). It contains information such as the client's identity (common name), a public key, and other metadata. When the client sends a request to the API server, it presents this certificate as proof of its identity.
Furthermore, the client key is the corresponding private key that pairs with the client certificate. It is securely stored and known only to the client. The key is used for cryptographic operations, such as encrypting data and generating digital signatures. When the client sends a request, it uses this key to prove ownership of the client certificate and to establish a secure connection with the API server.
In conclusion, the client certificate and key form a crucial part of the mutual TLS (Transport Layer Security) authentication, which ensures that both the client and the API server can trust each other's identities.
Luckily, in minikube, these credentials are typically generated during cluster initialization and stored securely and can be found here:
$ cat ~/.minikube/profiles/minikube/client.crt-----BEGIN CERTIFICATE-----MIIDITCCAgmgAwIBAgIBAjANBgkqhkiG9w0BAQsFADAVMRMwEQYDVQQDEwptaW5pa3ViZUNBMB4XDTI0MDMxNDA2MzAyOVoXDTI3MDMxNTA2MzAyOVowMTEXMBUGA1UEChMOc3lzdGVtOm1hc3RlcnMxFjAUBgNVBAMTDW1pbmlrdWJlLXVzZXIwggEiMA0G...$ cat ~/.minikube/profiles/minikube/client.key-----BEGIN RSA PRIVATE KEY-----MIIEpQIBAAKCAQEAniVfcQgFFSa+OTgfD1LRO1p2FN4vRBqRynNv5n43iHpaYXtWjIz4rPh230uXXfdpIGb9OBJ6Vrg1LN6eXtsS8e0mJgRR3n3vi0xecax+eB4kWATEKdN4LwLDWociXXgk7TK6bkU5y8kXIn7lwnpq57sput+NV4JevFAlBdy2tKtci6UD...
Overall, to make an authenticated request to the Kubernetes API, here are the credentials we need to provide:
minikube CA certificate
client certificate
client key
Step 4 - Making the HTTP request Using curl
Now, let use curl to send a request to the API, that will list down all the running pods in your cluster.
📍NOTE
Before making the request, make sure you already have a few pods running in your newly created cluster to see some output in the end.
You can use kubectl in this case:
$ kubectl run demo --image=nginxpod/demo created
Use the following command to make the HTTP request:
$ curl  \--cacert ~/.minikube/ca.crt \--cert ~/.minikube/profiles/minikube/client.crt --key ~/.minikube/profiles/minikube/client.key{  "kind": "PodList",  "apiVersion": "v1",  "metadata": {    "resourceVersion": "3208"  },  "items": [    {      "metadata": {        "name": "demo",        "namespace": "default",        "uid": "4d1b064d-6e25-4f9b-81d3-b72af5d68451",        "resourceVersion": "3130",        "creationTimestamp": "2024-03-15T08:24:17Z",        "labels": {          "run": "demo"        },...
Great, it works! Although, the output is pretty long and not so good looking. Let us use the jq tool to format it and print only the names of all the pods (without any other metadata):
$ curl  \--cacert ~/.minikube/ca.crt \--cert ~/.minikube/profiles/minikube/client.crt \--key ~/.minikube/profiles/minikube/client.key | jq '.items[].metadata.name'"demo""test-deploy-859f95ffcc-8p8t8""test-deploy-859f95ffcc-fcdld"
Congratulations 🎉 Youve successfully made an authenticated API request to Kubernetes!
Tips for Further Exploration
Here are a few additional things you can try out to solidify your understanding of the API:
List down all the resources (or, resource types) in your cluster along with their short names, and API groups:
  $ kubectl api-resources  NAME                              SHORTNAMES   APIVERSION                             NAMESPACED   KIND  bindings                                       v1                                     true         Binding  componentstatuses                 cs           v1                                     false        ComponentStatus  configmaps                        cm           v1                                     true         ConfigMap  endpoints                         ep           v1                                     true         Endpoints  events                            ev           v1                                     true         Event  limitranges                       limits       v1                                     true         LimitRange  namespaces                        ns           v1                                     false        Namespace  ...
List down all the API versions supported by your cluster:
  $ kubectl api-versions  admissionregistration.k8s.io/v1  apiextensions.k8s.io/v1  apiregistration.k8s.io/v1  apps/v1  ...
Sending an API request using the kubectl raw mode:
  $ kubectl get --raw /api/v1/namespaces/default/pods  {    "kind": "PodList",    "apiVersion": "v1",    "metadata": {      "resourceVersion": "3208"    },    "items": [...]  $ kubectl get --raw /api/v1/namespaces/default/pods | jq '.items[].metadata.name'  "demo"  "test-deploy-859f95ffcc-8p8t8"  "test-deploy-859f95ffcc-fcdld"
View the under-the-hood API calls made by the kubectl command:
  $ kubectl get -v 6 -n default pods  I0315 14:17:53.829941   32342 loader.go:395] Config loaded from file:  /Users/kunalverma/.kube/config  I0315 14:17:53.830498   32342 cert_rotation.go:137] Starting client certificate rotation controller  I0315 14:17:53.854051   32342 round_trippers.go:553] GET limit=500> 200 OK in 21 milliseconds  NAME                           READY   STATUS    RESTARTS      AGE  demo                           1/1     Running   0             23m  test-deploy-859f95ffcc-8p8t8   1/1     Running   3 (33m ago)   131m  test-deploy-859f95ffcc-fcdld   1/1     Running   3 (33m ago)   131m
  Here, we are using the -v flag to set the verbosity of the output. You may even use -v 8 to dig a bit deeper and view the complete response body.
Wrapping Up
In this guide, we covered the fundamentals of interacting with the Kubernetes API, from understanding its RESTful nature to exploring its internal structure. We leaned about the primary components that make up the API - Resources, Verbs, API Groups, Versions and Kinds i.e. object schema. Lastly, we went through an entire process of making an HTTP request to the API via curl to fetch all the running pods in the cluster.
As you continue your journey with Kubernetes, I encourage you to explore further by diving into more hands-on practice. Experiment with different API endpoints, try out various authentication mechanisms, and even build automation scripts or applications that interact with the Kubernetes API.
By gaining practical experience and deepening your understanding of the Kubernetes API, you'll be better equipped to manage and orchestrate containerized applications effectively in Kubernetes clusters.
Happy Learning!
Resources
Kubernetes Documentation - API Overview
Kubernetes Documentation - API Concepts
SIG Architecture - API Conventions
Exploring Kubernetes API with Curl
Working with Kubernetes API - Learning Series
Follow Kubesimplify on Hashnode, Twitter and LinkedIn. Join our Discord server to learn with us!


Kubesimplify at WasmIO and KubeCon EU 2024
Saloni Narang — Sat, 09 Mar 2024 14:26:15 GMT
Get ready for an action-packed fortnight in the WebAssembly realm! Saiyam Pathak(Founder of Kubesimplify) and Saloni Narang(DevRel) are on their way to the Wasm I/O event in Barcelona, Spain, and then off to the Cloud native Rejekts and then to KubeCon + CloudNativeCon EU 24 in Paris, France. Each event is going to be amazing as they both are speaking at the events and Kubesimplify is a proud community sponsor for Rejekts. Continue reading to discover all the sessions at both conferences!
You can talk with Saiyam/Saloni and understand Kubesimplify's mission and collaboration opportunities for DevRel, Content and Interviews onsite.
Wasm I/O 24
Accelerating ML Inferencing with WebAssembly & Spin 2.0: Saiyam Pathak and Radu Matei
  Session Link
  Dive into ML inferencing and the evolution of AI with WASM and Spin 2.0. Discover the potential of WASM for AI app development, performance enhancements, and cross-platform security.
Create Production-Grade Wasm Applications on Kubernetes [Workshop]: Saiyam Pathak and Sven Pfennig
  Session Link
  Learn to deploy and manage robust Wasm applications on Kubernetes. This workshop covers everything from deployment to monitoring, focusing on production-grade solutions.
Sustainability with WASM? - Faster, Greener Computing [Panel]: Saloni Narang, Saiyam Pathak, Danielle Lancashire, Shivay Lamba and Rishit Dagli
  Session Link
  Explore the role of WASM in achieving greener computing. Our panel discusses the environmental benefits of WASM, optimizing resources for sustainable development.
KubeCon + CloudNativeCon EU 24
Colocated Event: The Kubernetes Hunger Games: Distro Performance in the Edge: Saiyam Pathak and Shivay Lamba
  Session Link
  Join us for an analysis of Kubernetes distributions in edge computing environments. Discover how different distros cater to diverse edge computing needs.
CloudnativeHacks Sustainability Panel Discussion: Saiyam Pathak, Leonard Pahlke, Saloni Narang, Kristina D. and Niki Manoledaki
  Session Link
  Join our panel as we dive into the sustainability practices within cloud-native ecosystems. Learn about the advancements in TAG sustainability.
Heating Pools with Cloud Power: A New Wave in Green Computing: Saiyam Pathak and Mark Bjornsgaard
  Session Link
  Discover how Civo & DEEP GREEN's project uses server heat for community pools. This case study highlights the synergy between cooling technologies and cloud solutions.
Building a Tool to Debug Minimal Container Images: Saiyam Pathak and Kyle Quest
  Session Link
  Learn about tools that enhance the debugging of minimal container images across Kubernetes, Docker, and ContainerD, filling gaps in existing developer tooling.
Kubesimplify is thrilled to engage with the community, share insights, and learn from fellow enthusiasts. Be sure to add these sessions to your schedule, and Saiyam Pathak + Saloni Narang can't wait to connect with you in Barcelona and Paris!
Be ready for KubeCon and WasmIO by learning about WebAssembly and Kubernetes from our our workshops!
https://www.youtube.com/watch?v=PN3VqbZqmD8
 
https://www.youtube.com/watch?v=eYekV2Do0YU
 
Follow Kubesimplify on Hashnode, Twitter and LinkedIn. Join our Discord server to learn with us.


Platform Engineering Demystified -  Navigating the Basics
Kunal Verma — Sat, 24 Feb 2024 17:30:50 GMT
Picture this: You are working as a Developer in a fast-growing tech startup. A startup which is centered around the whole idea of operating through DevOps - the whole philosophy of developers & operations collaboration, establishing an automated workflow and all that fun stuff. The main aim being - creating a seamless and automated pipeline for faster software delivery to customers.
But, as the startup expands its portfolio of projects, you notice a subtle shift in the work environment! What was once a very streamlined and efficient process, powered by the DevOps principles now looked more like a complicated puzzle, with missing pieces and connections out of sync. Rolling out new updates became a challenge, dev environments for different teams lacked uniformity, automation scripts written in different languages and formats scattered all over the place, and finding & fixing bugs felt like solving a mystery in itself.
Overall, what was once a smooth collaboration between developers and operations was turning into a bit of a struggle, leaving everyone in team stressed and in need of a new solution!
Sounds pretty familiar, right? Because this scenario isn't unique anymore. Its the story of many companies navigating the path of expansion and innovation today. The very philosophy that initially fueled productivity  DevOps  now seems to be causing more headaches than solutions. And that's a hard yet compelling truth of modern software delivery!
Interestingly, the solution to this (atleast, its believed to be) already exists and is being adopted very rapidly by big tech giants and even early-stage startups. And that solution is - Platform Engineering.
Platform Engineering is not just a buzzword; it's a strategic shift from the pitfalls of today's DevOps reality to a modern software delivery approach.
In this blog post, well demystify the basic concepts of Platform Engineering, building a solid foundation by understanding how its a natural and necessary evolution to DevOps.
The rise of DevOps - An Answer to Traditional Software Delivery
To understand today's software reality, we need to take a step back and understand the reasons and events that led to the advent (rise) of DevOps in the whole software development workflow.
If we take a look at the traditional approach to software development and delivery back in the time (probably in the late 90s and early 2000s), making software was a bit like building a house in separate parts. On one side we had the developers, fully dedicating their time to "crafting the code" and on the other side, the operations team - responsible for the the actual software delivery, the underlying infrastructure and the maintenance of that infrastructure.
As software grew in size and complexity, this "siloed approach" led to communication gaps, a lot of delays, and a lack of synergy among these internal teams. On one hand, if developers wanted to get anything done to run their applications, they had to go through the operations team and on the other hand, the operations were totally dependent on the developers for any issues in the production environment. This approach was termed as - throw over the fence workflow, which eventually led to poor experiences on both sides of the fence. As an industry, we all agreed this was not the ideal we should aspire for!
Thus, DevOps quickly established itself as a solution; a cultural shift that aimed to break down this "silo" and foster collaboration between both sides of the fence. The main aim being - to align the Development and Operations teams towards the goal of delivering software at higher speeds. Complimented with the modern approach of Continuous Integration and Continuous Delivery (CI/CD), the aim was to introduce automation and smooth collaboration in the software lifecycle for a faster and more reliable software delivery, plus reducing the cognitive load for both the teams at the same time.
Going Beyond DevOps - Pitfalls in Today's Software Reality
Now, although DevOps as a philosophy started with a great intent and did solve many problems, we also saw scenarios where following DevOps clearly didnt make sense and it seemed to be causing more headaches than solutions. Let us analyse how this happened!
We all agree that the decade of 2010 saw a tremendous growth in cloud adoption and containers. I often like to refer this phase as a revolution in the software industry and there are a lot of factual data available that proves this point.
To give you a glimpse of this adoption phase:
AWS went from 100K customers in 2010 to a million in 2015. Docker which was introduced in 2013 had 100K companies using/evaluating in 2014, and within a year, a million companies were using containers in the form of Docker 😲
Ill not get into the details, but essentially, this enabled organisations to build softwares with a bit more complex architectures that can be easily scaled and distributed very well at the same time. But, with this increasing complexity of software the areas of concern for an organisation werent just limited to a faster application development and delivery. With an increasing adoption of cloud native philosophies such as security, reliability, availability and the development of a diverse variety of solutions to achieve these, organisations had to adapt their existing strategies to tackle these broader areas for long-term success in the ever-changing tech ecosystem.
Thus, this also meant that setups became a lot more complex. Those were long gone when you just needed to run a single script to deploy a simple application on a server, connected with a database service.
Let us see an analogy to understand this in a better way.
Imagine you have a single DevOps team (developers + operations combined - lets call it Team A) to manage the entire workflow of software delivery. This includes everything from application development and infrastructure provisioning to application deployment and continuous maintenance of the production environment. It's a continuous cycle.
To streamline these operations, the team wisely chooses a specific toolchain - Kubernetes for orchestration, Terraform of infrastructure provisioning, Jenkins for the automation pipeline and so on for all the different stages of a DevOps cycle.
Now, with increase in software complexity, you would agree that achieving efficiency demands scaling up, so now we have another DevOps team in the process. Lets call it Team B.
The interesting part is, with a lot of solutions around for the same problem, they now have the freedom to choose a different toolchain to manage their segment of the underlying software infrastructure, in their own way. For instance, they might go for Amazon EKS for orchestration, Pulumi for infrastructure provisioning, and so forth.
So now, we find ourselves with two DevOps teams, each operating with a distinct toolchain, all under the umbrella of the same organization.
Sounds efficient? Well, lets elevate this scenario and imagine 10x more teams working in the similar fashion to build different parts of a complex software.
Not sounding much efficient now, right? And the truth is, it isnt at all and here are a couple of reasons why:
Re-defining roles in You Build it, You Run it
  The idea of "You Build it, You Run it" means developers are not just writing code but also responsible for managing the entire process, from testing to deploying and running applications. While this may seem efficient, it puts extra pressure on developers to learn and handle infrastructure tasks. This can divert their focus from their main expertise - coding and building application logic, thus may result in decreased application quality and overall user experience.
Hard to Scale
  When multiple DevOps teams use different toolchains for similar tasks - for example, Team A is using Terraform for defining their cloud infrastructure, but Team B is performing the same task with Pulumi. This leads to a lack of consistency. As the number of teams grows, managing and coordinating becomes complex, resulting in a bottleneck for software delivery. This lack of uniformity in the organisation hinders scalability and efficiency.
Lack of Expertise
  With the "You Build it, You Run it" approach, developers might need to stretch their expertise beyond coding into the complexities of infrastructure and operations. This dual role can spread skills thin, potentially leading to gaps in specialized knowledge required for efficient problem-solving.
Increase in TicketOps
  Now, developers, dealing with both coding and operations, might find themselves seeking help more often. Therefore, relying on Ops teams for rescue missions increases the number of support tickets. This dependency on Ops can slow downthe workflow, creating a bottleneck in issue resolution.
Inconsistency across the Organisation
  The diversity of toolchains used by different DevOps teams within the same organization leads to inconsistency. Each team might prefer different tools and services, making it challenging to maintain a standardized approach. This lack of uniformity can result in confusion, inefficiencies, and hinder collaboration.
Overall, while the initial concept of combining development and operations aimed for efficiency, a consistent pattern of the observed challenges lead to re-evaluation of this approach. The goal being simple - to find a balance that ensures efficiency, scalability, and expertise while maintaining consistency across the organization!
The Era Of Platforms
Alright, I hope we agree that DevOps started with a great intent, but due to increased software complexity with time, we are in the need of a different and a modern strategic approach to meet todays software needs.
Essentially, the most basic need is to standardise the vast pool of toolchains being used by different teams under the same organisation, to streamline operations and reduce overall complexity of software production. And, this can be achieved by building a Platform.
In simple terms, a platform is developer-friendly interface that grants easy access to underlying infrastructure technologies like VMs or cloud services, eliminating the need for an in-depth understanding of the technical nitty-gritty behind them. It basically makes things easier by combining different tools and services in one place, so developers can focus on what they do best i.e. building the application logic, instead of dealing with all the technical stuff underneath such as, the infrastructure itself.
Interestingly, this particular concept of building an internal platform that can be commonly accessed by different team is not very old! With the increasing adoption of cloud technologies as well as the growing complexity of applications, many advanced organisations and big tech giants like Google, Facebook, Airbnb, and Netflix have already been working on and adopted such an approach to increase their teams productivity, speed and flexibility for building complex softwares with maximum efficiency.
Although, well be looking into platforms from a bit more technical standpoint in the next section, let's quickly highlight the advantages this methodology brings to the table:
Developer-Centric Access: Platforms follow a more developer-friendly and self-serving approach, offering easy access to standardized tools without requiring deep expertise. This enables developers to concentrate on building applications instead of getting entangled in infrastructure complexities, resulting in faster learning and time to market.
Standardization Principle: Standardization - thats the main and the most basic principle on which a platform methodology works upon. Platforms essentially provide a well-defined ecosystem to developers by gluing together applications and the infrastructure underneath them. This ensures that every distributed team within an organization adheres to the same set of tools and processes, fostering better collaboration and reducing cognitive load on team members.
"You Build it, You Run it" Simplified: Developers should be able to deploy and run their apps and services end to end - that was the principle of You Build it, You Run it and honestly, the main challenge as well at the same time (as we discovered above). With Platforms coming into the picture, developers now have the flexibility to focus on their core expertise i.e. building the application logic itself (coding), without the need to worry or delve into the complexities of the underlying infrastructure and the technologies associated with it. Thus, fully dedicating their time and efforts in developing new features, building an efficient application logic, fixing bugs instead of worrying about - how do I provision an EKS cluster to run and test my application?.
Platform Engineering - The Evolution of DevOps
So, what is Platform Engineering? It's the art of designing, building, and maintaining platforms - simple as that! Well, atleast thats the most basic way to define a rather complex philosophy with a lot of different parts to it, but well eventually get into those.
The primary goal? To improve the developer experience and productivity by providing self-service capabilities with automated infrastructure operations. As we learned in the previous section, this goal can be achieved by standardizing the usage of tools and processes across the organisation with a unified (common) internal platform, that caters to all the non-functional requirements of the application (the actual software), based on the developers request.
📍What are the non-functional requirements of an Application?
Non-functional requirements are like the behind-the-scenes tasks needed to get a finished software to users. They're not the actual business logic or code but important for making sure everything runs smoothly and is complete.
Think of them as the backstage crew making sure the show goes on seamlessly for the audience!
Here are some of the basic non-functional requirements needed by todays software:
To achieve this in an organisation, we have a dedicated platform engineering team. This team is responsible for the entiregroundwork of designing, building and maintaining these internal platforms.
To delve a bit deeper, lets elaborate this a bit more by understanding the components of Platform Engineering.
Components Of Platform Engineering
TLDR; You essentially need 3 things to implement platform engineering the right way:
the platform itself,
a platform team (comprising of platform engineers) and,
a correct ideology to follow for maximum results.
Let us elaborate these a bit more.
Internal Developer Platform (IDP) - The Platform Itself
The most integral part of implementing the Platform Engineering philosophy in an organisation is the platform itself, which is referred to as the Internal Developer Platform or IDP.
If we go by the official definition of an IDP:
An Internal Developer Platform (IDP) is built by a platform team to enable developer self-service. An IDP consists of many different techs and tools, glued together in a way that lowers cognitive load on developers without abstracting away context and underlying technologies.
In simple terms, IDPs are configured by a separate platform engineering team and used by developers. Again, the goal is to standardize the the use of tools and processes throughout the organization with this unified (common) internal platform. Therefore, the platform team is fully responsible for specifying what resources start up with what environment or at what request. The tools, the underlying the services and the necessary permissions needed to run those services all are the responsibilities of platform engineers. Thus, giving a whole lot of flexibility to the developers. They can now effortlessly request their preferred tool or service, dynamically configure it based on specific use cases, and concentrate on building the actual application at an accelerated pace.
The Platform Team - Actual Builders
To elaborate a bit more on the platform team, it consists of platform engineers who primarily build, run, configure and maintain the IDP. This team focuses on standardization by design, infrastructure, service level permissions, and configure the IDP to automate recurring or repetitive tasks, such as spinning up resources or environments for developers. In the end, the Developer teams gain flexibility of changing configurations, deploying the application, spinning up fully provisioned environments, and much more.
Hold On A Second!
But, hold on a second! If I'm getting this right, there's now a separate Platform team which is responsible for building this common platform (IDP) with a set of fixed tools/services (that too company-wide) which they decide and as a developer, I can ask for what I need and how much I need based on my own requirements. Doesnt this defeat the purpose of the collaborative approach of DevOps itself, where we brought together the responsibilities of both development and operations to avoid the old "siloed" way of doing things? Because it seems like now, as a developer, I'd be constantly reaching out to the platform team for the services I want.
It absolutely does sound that way and we definitely agree to not walk on that path again, right?
Theres actually more to this and the interesting part lies here!
What a platform team actually does is, apply their Ops expertise in configuring the actual underlying infrastructure tools and service - lets say Kubernetes for orchestration, Google cloud as a cloud platform, database service etc - essentially, all the tools necessary to meet the non-functional requirements of an application, as discussed earlier. On top of this, they build an abstraction layer in the form of a user interface, consisting of those same tools and services, ready to be used and without the underlying complexity - which we refer to as an Internal Developer Platform (IDP).
Moving forward, from here on the developers can self-serve their needs of different tools and services through this internal platform, without worrying about:
the nitty-gritty of the underlying infrastructure,
learning that specific tool or service end-to-end in order to implement it and,
constantly requesting new resources/services directly from the platform team.
This is a game changer and I hope you can see that too now!
When using these IDPs, developers now have the flexibility to choose the right level of abstraction for running their apps and services, based on their preferences. For instance, do they like messing around with Helm charts, YAML files and Terraform modules? Great, they can do so. Are they a junior frontend developer who doesnt care if the app is running on GKE or EKS? Fantastic, they can just self-serve an environment that comes fully provisioned with everything they need to deploy and test their code, without worrying where it runs!
To end this section, we mentioned above about following the correct ideology when building these internal developer platforms for maximising the results and efficiency of all the teams. What is that correct ideology? Well, that is something well be covering in a future article because of the topic complexity and importance.
Leveraging Platform Engineering to Address DevOps Challenges
In one of the previous sections, we talked about the challenges and pitfalls associated with DevOps when used in todays software workflow. Let us briefly see how the principles of Platform Engineering come out as a solution to those challenges.
"You Build it, You Run it" Harmony: We talked about the cognitive load on developers how due to Ops being embedded in Dev. With Platform Engineering, this principle gets refined. Developers can now focus on building the applications using readily available, self-serviced infrastructure tools and services provided by a common internal platform, while a dedicated platform team manages the entire underlying operational aspects. This establishes a collaborative bridge between Dev and Ops, without overwhelming either side with additional responsibilities.
No More Expertise Stress: Internal Developer Platforms act as a technical facilitator, reducing the burden on developers to have extensive expertise in all areas of the software lifecycle (development and operational knowledge). The platform team handles the intricate technical aspects, allowing developers to focus on their core responsibilities i.e building the actual application logic.
Bye-bye TicketOps: Internal Developer Platforms empower developers through self-service capabilities, allowing them to access necessary tools and resources whenever and however they need. This essentially eliminates the need for constant ticket submissions or requests for new services to the platform team. Thus, fostering a more streamlined and efficient workflow.
Consistency for All: Platforms bring order to the tech chaos. Everyone in the team follows the same rules and tools across the organisation, ensuring a smooth and consistent approach. This results in accelerated and efficient software delivery, as everyone operates within the same framework.
Wrapping Up - Conclusion
Yes, it might seem surprising to reach the conclusion already, but our main goal was to lay the ground work for understanding the fundamental concepts of platform engineering and, most importantly, to answer the crucial question: why is it needed?
The knowledge gained here will definitely help building a solid foundation for upcoming articles that will delve deeper into the intricate world of Platform Engineering.
Rest assured, there's much more to explore in Platform Engineering besides it being a hype and a natural evolution of DevOps.
If you have any further questions, feel free to reach out to me on Twitter/X.
Happy learning and see you in the next one!
Follow Kubesimplify on Hashnode, Twitter and LinkedIn. Join our Discord server to learn with us.


Automated GitHub Releases with GitHub Actions and Conventional Commits
Krish Gupta — Mon, 12 Feb 2024 12:30:45 GMT
Releases are a very important way to:
Track versioning
Showcase changes
Acknowledge contributors
Distribute Binaries
But, who does releases manually? That is boring. True engineers spend 6 hours automating tasks that take 6 minutes! So let's build a CI/CD Pipeline to automate this:
Problems
Let's break this problem down into smaller bits. I need to:
Check if my application works so that I don't release any broken code
Store the previous version to iterate only forwards
Figure out how I want to bump the version (spoiler: we use conventional commits)
Showcase all the commits in the release
Acknowledge the contributors by using GitHub's release template
Upload the binary assets to GitHub in the release
Check if my application works so that I don't release any broken code
Let's create a simple GitHub action that checks if my application is up to standards:
name: CIon:  push:    branches:      - main  pull_request:permissions:  contents: write  packages: write  pull-requests: writeenv:  GO_VERSION: 1.21.3  APP_NAME: go-todo-apijobs:  lint:    name: Lint    runs-on: ubuntu-latest    steps:      - name: Checkout        uses: actions/checkout@b4ffde65f46336ab88eb53be808477a3936bae11 # v4      - name: Setup Go        uses: actions/setup-go@0c52d547c9bc32b1aa3301fd7a9cb496313a4491 # v5        with:          go-version: ${{ env.GO_VERSION }}      - name: Install Dependencies        run: go mod download      - name: Verify Dependencies        run: go mod verify      - name: Lint ${{ env.APP_NAME }}        run: go vet ./...  build:    name: Build    runs-on: ubuntu-latest    steps:      - name: Checkout        uses: actions/checkout@b4ffde65f46336ab88eb53be808477a3936bae11 # v4      - name: Setup Go        uses: actions/setup-go@0c52d547c9bc32b1aa3301fd7a9cb496313a4491 # v5        with:          go-version: ${{ env.GO_VERSION }}      - name: Install Dependencies        run: go mod download      - name: Verify Dependencies        run: go mod verify      - name: Build ${{ env.APP_NAME }}        run: |          chmod +x ./scripts/build.sh          ./scripts/build.sh  test:    name: Test    needs: build    runs-on: ubuntu-latest    steps:      - name: Checkout        uses: actions/checkout@b4ffde65f46336ab88eb53be808477a3936bae11 # v4      - name: Setup Go        uses: actions/setup-go@0c52d547c9bc32b1aa3301fd7a9cb496313a4491 # v5        with:          go-version: ${{ env.GO_VERSION }}      - name: "Warning: No test cases"        run: echo "Reminder to create test cases"      - name: Install Dependencies        run: go mod download      - name: Verify Dependencies        run: go mod verify      - name: Test ${{ env.APP_NAME }}        run: go test -v ./...
That looks like a lot of stuff, let me break it down:
We give the workflow a name: CI
Set it to trigger on push to the main branch and pull_request (I know we only release from the pushes, but having CI for preview environments is like a bonus cherry on top)
We give it some permissions, we want it to be able to:
push (contents permission)
pull-requests (to create the pull request that will merge in the release)
OPTIONAL: packages used to publish to the GitHub Container Registry
We create two environment variables simply to reuse them later:
GO_VERSION: If I update my go versions I don't want to go and change it everywhere, so this env variable is for standardising the GO_VERSION
APP_NAME: This comes in handy when I want to push to dockerhub and create the binaries with proper naming! We also use it to name the steps and jobs so that they don't look like 'Linting project-name' rather than linting.
We create 3 jobs:
lint:
We run actions/checkout
Setup Go with actions/setup-go, with our ENV variable of GO_VERSION replace this with your programming language
We install modules by running go mod download and go mod verify
We run go vet ./... to test the app. Replace with your test command.
build:
We duplicate the lint, rename lint to build and replace go vet ./... with the build script
test:
We duplicate the lint, rename lint to test and replace go vet ./... with the go test -v ./...
Store the previous version to iterate only forwards
Let's create a file called package.yaml inside .github with one property called version. If you are using Node.Js then don't do it (we will use package.json instead).
Figure out how I want to bump the version
There is a specification called Conventional Commits for making commit messages both human-readable and machine-readable. The commit message is:
[optional scope]: [optional body][optional footer(s)]
Inside 1.2.11 as the version of a software:
1 is for major releases, only used when breaking changes are introduced
2 is for
The important thing is  here, if the type is one of:
feat: Feature enhancement / Minor release, if the previous version is 0.2.0 it becomes 0.3.0
fix: Bug Fix / Patch release, if the previous version is 0.2.1, it bumps to 0.2.2
FYI, if you get to 0.9.0 the next release will be 0.10.0 and not 1.0.0. But then how do we get to 1.0.0?
We can make a commit like feat!: breaking change to push the version to 1.0.0 or you can also do:
[optional scope]: BREAKING CHANGE: update description[optional footer(s)]
💡
FYI, you should only do a 'major' release when you make a change that is big enough to make previous ways of using it not work anymore.
So now, we can use the tooling for conventional-commits to bump the version and showcase the changes.
Showcase all the commits in the release
We will use the TriPSs/conventional-changelog-action action for this:
  changelog:    name: Changelog    needs:      - lint      - build      - test    if: github.event_name != 'pull_request'    runs-on: ubuntu-latest    outputs:      skipped: ${{ steps.changelog.outputs.skipped }}      tag: ${{ steps.changelog.outputs.tag }}      clean_changelog: ${{ steps.changelog.outputs.clean_changelog }}      version: ${{ steps.changelog.outputs.version }}    env:      PR_BRANCH: release-ci-${{ github.sha }}    steps:      - uses: actions/checkout@b4ffde65f46336ab88eb53be808477a3936bae11 # v4      - name: Create Branch        run: |          git checkout -b ${{ env.PR_BRANCH }}      - name: Create Changelog        uses: TriPSs/conventional-changelog-action@dd734f74fce61a6e02f821ee1b5930bc79a23534 # v5        id: changelog        with:          github-token: ${{ github.token }}          git-user-name: "github-actions[bot]"          git-user-email: "github-actions[bot]@users.noreply.github.com"          git-branch: ${{ env.PR_BRANCH }}          skip-git-pull: true          output-file: false          version-file: .github/package.yaml          create-summary: true      - name: Create Changelog PR        if: steps.changelog.outputs.skipped == 'false'        run: |          gh pr create --base main --head ${{ env.PR_BRANCH }} --title 'chore(release): ${{ steps.changelog.outputs.tag }} [skip-ci]' --body '${{ steps.changelog.outputs.clean_changelog }}'        env:          GH_TOKEN: ${{ github.token }}      - name: Approve Changelog PR        if: steps.changelog.outputs.skipped == 'false'        run: |          gh pr review --approve ${{ env.PR_BRANCH }}        env:          GH_TOKEN: ${{ secrets.GH_OWNER_TOKEN }}      - name: Merge Changelog PR        if: steps.changelog.outputs.skipped == 'false'        run: |          gh pr merge --squash --auto --delete-branch ${{ env.PR_BRANCH }}        env:          GH_TOKEN: ${{ secrets.GH_OWNER_TOKEN }}
Breaking it down:
We have a changelog job that depends on lint, build and test
It runs only if the event name is not pull_request
We have defined the outputs so that we access them in the further jobs!
We declare a PR_BRANCH an environment variable to reuse the PR branch across all the jobs
We created the branch for the PR git checkout -b ${{ env.PR_BRANCH }}
TriPSs/conventional-changelog-action has all the options specified, you can look at the docs to see what they do!
The only important options are version-file which should be package.json for NodeJs Users, and .github/package.yaml (the file we created) for most other users. output-file is for changelog, I don't like having a changelog file because I can just use the GitHub Releases page to replace it.
Then we create the PR to update the version gh pr create
Finally, we merge it with A SECRET called GH_OWNER_TOKEN
💡
Make sure to add the GH_OWNER_TOKEN with the permissions necessary to merge the PR
This does 3 things:
Creates the tag on GitHub
Makes the PR to update the version on GitHub
Create an automated change log
The rest of the problems in bulk
Now we finally add a release job with softprops/action-gh-release
  release:    name: Release    needs: changelog    if: github.event_name != 'pull_request' && needs.changelog.outputs.skipped == 'false'    runs-on: ubuntu-latest    steps:      - name: Checkout        uses: actions/checkout@b4ffde65f46336ab88eb53be808477a3936bae11 # v4      - name: Setup Go        uses: actions/setup-go@0c52d547c9bc32b1aa3301fd7a9cb496313a4491 # v5        with:          go-version: ${{ env.GO_VERSION }}      - name: Install Dependencies        run: go mod download      - name: Verify Dependencies        run: go mod verify      - name: Cross-Build ${{ env.APP_NAME }}        run: |          chmod +x ./scripts/build.sh          CROSS_BUILD=true APP_NAME=${{ env.APP_NAME }} VERSION=${{ needs.changelog.outputs.version }} ./scripts/build.sh      - name: Create Release        uses: softprops/action-gh-release@de2c0eb89ae2a093876385947365aca7b0e5f844 # v1        with:          token: ${{ secrets.GH_OWNER_TOKEN }}          tag_name: ${{ needs.changelog.outputs.tag }}          prerelease: false          draft: false          files: bin/*          generate_release_notes: true          name: ${{ needs.changelog.outputs.tag }}          body: |                          🤖 Autogenerated Conventional Changelog
            ${{ needs.changelog.outputs.clean_changelog }}            
Breaking down what we've done here:
We only execute this job changelog.outputs.skipped is not false
We set up to repeat all the things we did during build
softprops/action-gh-release is being used to create the release on GitHub
We tell it the tag we get from the changelog job
generate_release_notes is set to true to generate the GitHub-style release notes + it creates those cool contributor shoutouts
files tells it which files to upload with the release
body uses the changelog from the changelog step
Let's look at the progress
This is the workflow we have right now. It builds, tests, figures out what release to make and makes a release on GitHub.
name: CIon:  push:    branches:      - main  pull_request:permissions:  contents: write  packages: write  pull-requests: writeenv:  GO_VERSION: 1.21.3  APP_NAME: go-todo-apijobs:  lint:    name: Lint    runs-on: ubuntu-latest    steps:      - name: Checkout        uses: actions/checkout@b4ffde65f46336ab88eb53be808477a3936bae11 # v4      - name: Setup Go        uses: actions/setup-go@0c52d547c9bc32b1aa3301fd7a9cb496313a4491 # v5        with:          go-version: ${{ env.GO_VERSION }}      - name: Install Dependencies        run: go mod download      - name: Verify Dependencies        run: go mod verify      - name: Lint ${{ env.APP_NAME }}        run: go vet ./...  build:    name: Build    runs-on: ubuntu-latest    steps:      - name: Checkout        uses: actions/checkout@b4ffde65f46336ab88eb53be808477a3936bae11 # v4      - name: Setup Go        uses: actions/setup-go@0c52d547c9bc32b1aa3301fd7a9cb496313a4491 # v5        with:          go-version: ${{ env.GO_VERSION }}      - name: Install Dependencies        run: go mod download      - name: Verify Dependencies        run: go mod verify      - name: Build ${{ env.APP_NAME }}        run: |          chmod +x ./scripts/build.sh          ./scripts/build.sh  test:    name: Test    needs: build    runs-on: ubuntu-latest    steps:      - name: Checkout        uses: actions/checkout@b4ffde65f46336ab88eb53be808477a3936bae11 # v4      - name: Setup Go        uses: actions/setup-go@0c52d547c9bc32b1aa3301fd7a9cb496313a4491 # v5        with:          go-version: ${{ env.GO_VERSION }}      - name: "Warning: No test cases"        run: echo "Reminder to create test cases"      - name: Install Dependencies        run: go mod download      - name: Verify Dependencies        run: go mod verify      - name: Test ${{ env.APP_NAME }}        run: go test -v ./...  changelog:    name: Changelog    needs:      - lint      - build      - test    if: github.event_name != 'pull_request'    runs-on: ubuntu-latest    outputs:      skipped: ${{ steps.changelog.outputs.skipped }}      tag: ${{ steps.changelog.outputs.tag }}      clean_changelog: ${{ steps.changelog.outputs.clean_changelog }}      version: ${{ steps.changelog.outputs.version }}    env:      PR_BRANCH: release-ci-${{ github.sha }}    steps:      - uses: actions/checkout@b4ffde65f46336ab88eb53be808477a3936bae11 # v4      - name: Create Branch        run: |          git checkout -b ${{ env.PR_BRANCH }}      - name: Create Changelog        uses: TriPSs/conventional-changelog-action@dd734f74fce61a6e02f821ee1b5930bc79a23534 # v5        id: changelog        with:          github-token: ${{ github.token }}          git-user-name: "github-actions[bot]"          git-user-email: "github-actions[bot]@users.noreply.github.com"          git-branch: ${{ env.PR_BRANCH }}          skip-git-pull: true          output-file: false          version-file: .github/package.yaml          create-summary: true      - name: Create Changelog PR        if: steps.changelog.outputs.skipped == 'false'        run: |          gh pr create --base main --head ${{ env.PR_BRANCH }} --title 'chore(release): ${{ steps.changelog.outputs.tag }} [skip-ci]' --body '${{ steps.changelog.outputs.clean_changelog }}'        env:          GH_TOKEN: ${{ github.token }}      - name: Approve Changelog PR        if: steps.changelog.outputs.skipped == 'false'        run: |          gh pr review --approve ${{ env.PR_BRANCH }}        env:          GH_TOKEN: ${{ secrets.GH_OWNER_TOKEN }}      - name: Merge Changelog PR        if: steps.changelog.outputs.skipped == 'false'        run: |          gh pr merge --squash --auto --delete-branch ${{ env.PR_BRANCH }}        env:          GH_TOKEN: ${{ secrets.GH_OWNER_TOKEN }}  release:    name: Release    needs: changelog    if: github.event_name != 'pull_request' && needs.changelog.outputs.skipped == 'false'    runs-on: ubuntu-latest    steps:      - name: Checkout        uses: actions/checkout@b4ffde65f46336ab88eb53be808477a3936bae11 # v4      - name: Setup Go        uses: actions/setup-go@0c52d547c9bc32b1aa3301fd7a9cb496313a4491 # v5        with:          go-version: ${{ env.GO_VERSION }}      - name: Install Dependencies        run: go mod download      - name: Verify Dependencies        run: go mod verify      - name: Cross-Build ${{ env.APP_NAME }}        run: |          chmod +x ./scripts/build.sh          CROSS_BUILD=true APP_NAME=${{ env.APP_NAME }} VERSION=${{ needs.changelog.outputs.version }} ./scripts/build.sh      - name: Create Release        uses: softprops/action-gh-release@de2c0eb89ae2a093876385947365aca7b0e5f844 # v1        with:          token: ${{ secrets.GH_OWNER_TOKEN }}          tag_name: ${{ needs.changelog.outputs.tag }}          prerelease: false          draft: false          files: bin/*          generate_release_notes: true          name: ${{ needs.changelog.outputs.tag }}          body: |                          🤖 Autogenerated Conventional Changelog
            ${{ needs.changelog.outputs.clean_changelog }}            
Now we have an awesome release automation that releases every single time we push to GitHub 🚀
Bonus: Publishing to DockerHub & GHCR
If you have a Dockerfile, let's build an image and push it to DockerHub & GHCR.
  deploy:    name: Deploy Image    needs: changelog    if: github.event_name != 'pull_request' && needs.changelog.outputs.skipped == 'false'    runs-on: ubuntu-latest    steps:      - name: Checkout        uses: actions/checkout@b4ffde65f46336ab88eb53be808477a3936bae11 # v4      - name: Login docker.io        uses: docker/login-action@343f7c4344506bcbf9b4de18042ae17996df046d # v3        with:          registry: docker.io          username: ${{ secrets.DOCKER_USERNAME }}          password: ${{ secrets.DOCKER_PASSWORD }}      - name: Login ghcr.io        uses: docker/login-action@343f7c4344506bcbf9b4de18042ae17996df046d # v3        with:          registry: ghcr.io          username: ${{ github.repository_owner }}          password: ${{ secrets.GH_OWNER_TOKEN }}      - name: Setup Docker Metadata        uses: docker/metadata-action@dbef88086f6cef02e264edb7dbf63250c17cef6c # v5        id: meta        with:          images: |            docker.io/${{ secrets.DOCKER_USERNAME }}/${{ env.APP_NAME }}            ghcr.io/${{ github.repository_owner }}/${{ env.APP_NAME }}          tags: |            latest            ${{ needs.changelog.outputs.version }}            ${{ github.sha }}      - name: Build and Push Docker Image        uses: docker/build-push-action@4a13e500e55cf31b7a5d59a38ab2040ab0f42f56 # v5        with:          context: .          push: true          tags: ${{ steps.meta.outputs.tags }}          labels: ${{ steps.meta.outputs.labels }}
We created a deploy job in the workflow, here is what it does:
Once we have successfully made the release
We authenticate to 2 Docker Registries using the official docker/login-action
Using docker/metadata-action we set up metadata of the action, including:
Image name, we define the image name using APP_NAME variable we declared earlier for both the docker hub and ghcr.
We set three tags for each of the images: latest, the version of the release and the SHA of the commit
Finally, we build and push to the registries using the docker/build-push-action with the Dockerfile in the root of the repository.
💡
Make sure to create the DOCKER_USERNAME and DOCKER_PASSWORD to push to DockerHub.
Here's the source code for the workflow file, and this is an example run.  
Follow Kubesimplify on Hashnode, Twitter and LinkedIn. Join our Discord server to learn with us.


Reducing Image Size using Multi-stage builds for a Go application
Krish Gupta — Mon, 05 Feb 2024 04:30:18 GMT
Introduction
Anyone whos built their own containers, either for local development or for cloud deployment, knows the advantages of keeping container sizes small. In most cases, keeping the container image size small translates to real dollars saved by reducing bandwidth and storage costs on the cloud. In addition, smaller images ensure faster transfer and deployments when using them in a CI/CD server.
- The Official Docker Blog
Therefore smaller images translate to:
Lesser Bandwidth Consumption
Lesser Storage Consumption
Faster Deployments on CI/CD
Which all translates to saving money 🤑
What are multi-stage builds?
Docker has a very cool feature called multi-stage builds, it allows you to build the image with one base and run with another. Let's have a look:
With multi-stage builds, the image get rid of:
The heavy base image you needed to build the application
The source code
The dependencies need to "build" the application.
Golang programs are the perfect lab rats to carry out this on, why?
Golang applications compile into "binaries" with customizable OS/ARCH
The binary is standalone and does not need "go" in the environment or any of the dependencies
The lab-rat
So for this 'article', I am creating a hello-world gin-gonic API, you can skip to the next section if you already have a Golang project.
Let's first initialize the application:
go mod init go-gin-api-templatego get -u github.com/gin-gonic/gintouch main.go
Now, let's write the code, inside of main.go.
package mainimport "github.com/gin-gonic/gin"func main() {    router := gin.Default()    router.GET("/", func(c *gin.Context) {        c.JSON(200, gin.H{            "message": "Hello World!",        })    })    router.Run(":8080")}
This does 4 things:
Imports gin-gonic as a dependency
Use gin gonic to create a router
Set the / endpoint handler to a "Hello World" returner
Listens on port:8080 for requests
Now let's create the docker image how we would do it usually (without multistage builds)
FROM --platform=linux/amd64 golang:1.21.6-alpine@sha256:fd78f2fb1e49bcf343079bbbb851c936a18fc694df993cbddaa24ace0cc724c5WORKDIR /appCOPY . .RUN go getRUN go build -tags=jsoniter -o app .EXPOSE 8080CMD ["./app"]
In this Dockerfile:
I am using golang-alpine as the base image
Copying the entire source code (COPY . .)
Installing dependencies (RUN go get)
Building in the app as the filename 'app' (RUN go build -tags=jsoniter -o app .)
Now let's build this image:
Running it with docker run -p 8080:8080 and looks like it works 🔥
For reference, I built the image twice and the size is 582MB
The image is one GitHub Container Registry and the Repository Source/Template is also on GitHub
Implementing multi-stage builds
To implement that let's start with the original Dockerfile we already have:
FROM --platform=linux/amd64 golang:1.21.6-alpine@sha256:fd78f2fb1e49bcf343079bbbb851c936a18fc694df993cbddaa24ace0cc724c5WORKDIR /appCOPY . .RUN go getRUN go build -tags=jsoniter -o app .EXPOSE 8080CMD ["./app"]
Now, first of all, let's give the first base image a name,
- FROM --platform=linux/amd64 golang:1.21.6-alpine@sha256:fd78f2fb1e49bcf343079bbbb851c936a18fc694df993cbddaa24ace0cc724c5+ FROM --platform=linux/amd64 golang:1.21.6-alpine as builder
Next up let's after RUN go build -tags=jsoniter -o app . let's add some code to configure the second base image:
FROM alpine:latestWORKDIR /appCOPY --from=builder /app/app .
So now the final Dockerfile is:
FROM --platform=linux/amd64 golang:1.21.6-alpine as builderWORKDIR /appCOPY . .RUN go getRUN go build -tags=jsoniter -o app .FROM alpine:latestWORKDIR /appCOPY --from=builder /app/app .CMD ["./app"]EXPOSE 8080
Let's run docker build now!
Diving into each line of the Dockerfile
Let's look at what we are doing in the multi-stage build Dockerfile
FROM --platform=linux/amd64 golang:1.21.6-alpine as builderWORKDIR /appCOPY . .RUN go getRUN go build -tags=jsoniter -o app .FROM alpine:latestWORKDIR /appCOPY --from=builder /app/app .CMD ["./app"]EXPOSE 8080
In the first line, FROM --platform=linux/amd64 golang:1.21.6-alpine as builder we are taking the Golang image from dockerhub and using it as our base image under the name of builder
Second line we switch to the workdir /app, and third, we copy all the files we have in the folder to /app in the builder
In the fourth line, we execute go get to install all of our dependencies
Finally, in the fifth line, we build the application with the go build command (go build -tags=jsoniter -o app .). This created an executable binary at /app/app inside the builder (golang image).
Then from the 6th line we move to a new base image (alpine), consider it like a total image we are creating. Alpine is a very lightweight image which is why we chose it.
Inside Alpine, we /app/app from builder to use it
We tell docker that this image is run using the ./app binary in Alpine's workdir (the end path being: /app/app)
Finally, we write EXPOSE 8080 so that the user knows that they have to run it with: docker run -p 8080:8080 image-name
Things to be careful of
You need to make sure that the final base image has all the dependencies or requirements.
This cannot be used if your project is not standalone, for example: NodeJs projects (It's alright if you use it to transpile typescript)
Some tooling such as TestContainers does not support it. Refer to testcontainers/testcontainers-java#1112
FYI It does increase the build time, it took around ~35 seconds to build the first image and ~60 seconds to build the second one
Result
So the original image was roughly 582MB and now it comes around to 18.8MB
That is ~30.9 times less than the original image 😱 This is how docker multi-stage builds can help us in reducing the image size!
Once again the image and the source are available on GitHub
Follow Kubesimplify on Hashnode, Twitter, and LinkedIn. Join our Discord server to learn with us.


Automate the creation of Kubernetes self-managed HA cluster
Dipankar Das — Mon, 29 Jan 2024 12:30:51 GMT
Introduction
[WARN] you should know about kubernetes!
here are some suggested reads and Kubernetes workshop that you can watch to get upto speed with Kubernetes
https://www.youtube.com/watch?v=PN3VqbZqmD8
 
https://blog.kubesimplify.com/kubernetes-containerd-setup
https://blog.kubesimplify.com/pods-in-kubernetes
https://blog.kubesimplify.com/understanding-etcd-in-kubernetes-a-beginners-guide
With the rise of managed Kuberentes, most companies opt for it but yet there are cases when you would need complete control of your Kubernetes cluster, managing the control plane, handling the backups when you are deploying at edge or in your own data center or otherwise. In such cases you have to go for a self managed Kubernetes cluster on the nodes which are wither VM's in the cloud or bare metal instances. In this post, we will dive intro creation of a complete self managed HA Kubernetes cluster with different configurations.
Let's try to understand from the design prospective, when we talk about a highly available Kubernetes clusters, it means that it can withstand failures even when the control plan goes down or etcd goes down. In the below design architecture can see that for a HA Kubernetes cluster there are
3 Controlplane nodes
3 etcd nodes that are outside of the cluster
HA Proxy for loadbalancing the traffic going to the controlplane nodes
Let's start rolling....
Assumptions & Prerequisites
For the steps to be done in this blog, you need to provision the infrastructure and you can choose any cloud provider for doing that. To perform all the next steps you would need to provision a total of 9 Virtual machines(3 for controlplane, 3 for etcd, 1 for Ha Proxy and 2 worker nodes).
Ksctl is a cloud agnostic infrastructure management tool and currently the POC for HA cluster in in progress.
https://github.com/ksctl/enhancements/tree/main/poc/etcd#readme
 
Note we will be doing Etcd TLS configuration
client connection is self-signed tls certificate
peer connection is auto-tls certificate
  (If you want this as well with self-managed tls you can generate it, for this blog will go with auto-tls for peer conn)
Make Sure you are a Root User when executing script
Network Map
Hostname Role Private IP public IP
lb-0 LoadBalancer 192.168.1.8 74.220.22.92
- - - -
db-0 Etcd-0 192.168.1.2 -
db-1 Etcd-1 192.168.1.3 -
db-2 Etcd-2 192.168.1.4 -
- - - -
cp-0 Control-Plane-0 192.168.1.9 -
cp-1 Control-Plane-1 192.168.1.10 -
cp-2 Control-Plane-2 192.168.1.11 -
- - - -
wp-0 Worker-Plane-0 192.168.1.12 -
wp-1 Worker-Plane-1 192.168.1.13 -
Above are the set of VM's we have provisioned.
Step 1: Install tools according to the role of the VM
Run this on all Etcd VMs
#!/bin/bashset -xeETCD_VER=v3.5.10# choose either URLGOOGLE_URL=https://storage.googleapis.com/etcdGITHUB_URL=https://github.com/etcd-io/etcd/releases/downloadDOWNLOAD_URL=${GOOGLE_URL}rm -f /tmp/etcd-${ETCD_VER}-linux-amd64.tar.gzrm -rf /tmp/etcd-download-test && mkdir -p /tmp/etcd-download-testcurl -L ${DOWNLOAD_URL}/${ETCD_VER}/etcd-${ETCD_VER}-linux-amd64.tar.gz -o /tmp/etcd-${ETCD_VER}-linux-amd64.tar.gztar xzvf /tmp/etcd-${ETCD_VER}-linux-amd64.tar.gz -C /tmp/etcd-download-test --strip-components=1rm -f /tmp/etcd-${ETCD_VER}-linux-amd64.tar.gzmv -v /tmp/etcd-download-test/etcd /usr/local/binmv -v /tmp/etcd-download-test/etcdctl /usr/local/binmv -v /tmp/etcd-download-test/etcdutl /usr/local/binrm -rf /tmp/etcd-download-testetcd --versionetcdctl versionetcdutl version
Creating the directory to hold Etcd certificates.
mkdir -p /var/lib/etcd
Reference
https://github.com/etcd-io/etcd/releases/tag/v3.5.10
 
Run this on all Control-plane and worker VMs
Below is the installation of the tools required for bootstrapping a Kubernetes cluster including kubelet, kubeadm and kubectl.
Note: In releases older than Debian 12 and Ubuntu 22.04, folder /etc/apt/keyrings does not exist by default, and it should be created before the curl command.
Reference
https://kubernetes.io/docs/setup/production-environment/tools/kubeadm/install-kubeadm/
https://kubernetes.io/docs/setup/production-environment/container-runtimes/
#!/bin/bashset -xe############# NOTE: script for K8s v1.28 #############echo "memory swapoff"sudo sed -i '/ swap / s/^\(.*\)$/#\1/g' /etc/fstabsudo swapoff -acat <# sysctl params required by setup, params persist across rebootscat <# Apply sysctl params without rebootsudo sysctl --system# Status checklsmod | grep br_netfilterlsmod | grep overlaysysctl net.bridge.bridge-nf-call-iptables net.bridge.bridge-nf-call-ip6tables net.ipv4.ip_forward######### CONTAINER-D ###########sudo apt-get updatesudo apt-get install ca-certificates curl gnupgsudo install -m 0755 -d /etc/apt/keyringscurl -fsSL https://download.docker.com/linux/ubuntu/gpg | sudo gpg --dearmor -o /etc/apt/keyrings/docker.gpgsudo chmod a+r /etc/apt/keyrings/docker.gpgecho \  "deb [arch="$(dpkg --print-architecture)" signed-by=/etc/apt/keyrings/docker.gpg] https://download.docker.com/linux/ubuntu \  "$(. /etc/os-release && echo "$VERSION_CODENAME")" stable" | \  sudo tee /etc/apt/sources.list.d/docker.list > /dev/nullsudo apt-get updatesudo apt-get install containerd.io -ymkdir -p /etc/containerdcontainerd config default > /etc/containerd/config.tomlsudo systemctl restart containerdsudo systemctl enable containerdsudo sed -i 's/SystemdCgroup \= false/SystemdCgroup \= true/g' /etc/containerd/config.tomlsudo systemctl restart containerd################# Kubernetes Install #################sudo apt-get update -ysudo apt-get install -y apt-transport-https ca-certificates curl gpgcurl -fsSL https://pkgs.k8s.io/core:/stable:/v1.28/deb/Release.key | sudo gpg --dearmor -o /etc/apt/keyrings/kubernetes-apt-keyring.gpgecho 'deb [signed-by=/etc/apt/keyrings/kubernetes-apt-keyring.gpg] https://pkgs.k8s.io/core:/stable:/v1.28/deb/ /' | sudo tee /etc/apt/sources.list.d/kubernetes.listsudo apt-get updatesudo apt-get install -y kubelet kubeadm kubectlsudo apt-mark hold kubelet kubeadm kubectlsudo systemctl enable kubelet
Next up creating of directory to hold Etcd certificates.
mkdir -vp /etcd/kubernetes/pki/etcd/
Step 2: Configure Loadbalancer VM
Below is the general Script where you can replace the IP's according to your infrastructure setup.
#!/bin/bashset -xesudo apt updatesudo apt install haproxy -ysleep 2ssudo systemctl start haproxy && sudo systemctl enable haproxycat < haproxy.cfgfrontend kubernetes-frontend  bind *:6443  mode tcp  option tcplog  timeout client 10s  default_backend kubernetes-backendbackend kubernetes-backend  timeout connect 10s  timeout server 10s  mode tcp  option tcp-check  balance roundrobin  server k3sserver-0 ${Controlplane_Private_IP_0}:6443 check  server k3sserver-1 ${Controlplane_Private_IP_1}:6443 check  server k3sserver-2 ${Controlplane_Private_IP_2}:6443 checkfrontend nodeport-frontend  bind *:30000-35000  mode tcp  option tcplog  timeout client 10s  default_backend nodeport-backendbackend nodeport-backend  mode tcp  timeout connect 10s  timeout server 10s  balance roundrobin  server nodeport-0 ${Controlplane_Private_IP_0}  server nodeport-1 ${Controlplane_Private_IP_1}  server nodeport-2 ${Controlplane_Private_IP_2}EOFsudo mv haproxy.cfg /etc/haproxy/haproxy.cfgsudo systemctl restart haproxy
You can validate your haproxy configuration using below command.
haproxy -f /etc/haproxy/haproxy.cfg -c
Script according to our list of VM's:
#!/bin/bashset -xesudo apt updatesudo apt install haproxy -ysleep 2ssudo systemctl start haproxy && sudo systemctl enable haproxycat < haproxy.cfgfrontend kubernetes-frontend  bind *:6443  mode tcp  option tcplog  timeout client 10s  default_backend kubernetes-backendbackend kubernetes-backend  timeout connect 10s  timeout server 10s  mode tcp  option tcp-check  balance roundrobin  server k3sserver-0 192.168.1.9:6443 check  server k3sserver-1 192.168.1.10:6443 check  server k3sserver-2 192.168.1.11:6443 checkfrontend nodeport-frontend  bind *:30000-35000  mode tcp  option tcplog  timeout client 10s  default_backend nodeport-backendbackend nodeport-backend  mode tcp  timeout connect 10s  timeout server 10s  balance roundrobin  server nodeport-0 192.168.1.9  server nodeport-1 192.168.1.10  server nodeport-2 192.168.1.11EOFsudo mv haproxy.cfg /etc/haproxy/haproxy.cfgsudo systemctl restart haproxy
Step 3: Generate the self-signed certificate for client-server Etcd connection
Generation of certificates
the below steps generate
etcd-key.pem -> Client key
etcd.pem -> Client certificate
ca.pem -> CA certificate
There are 2 methods you can create them:
Manual way
Run this on your local system (MANUAL STEP TO GENERATE TLS CERTS)
cd opensslopenssl genrsa -out ca-key.pem 2048openssl req -new -key ca-key.pem -out ca-csr.pem -subj "/CN=etcd cluster"openssl x509 -req -in ca-csr.pem -out ca.pem -days 3650 -signkey ca-key.pem -sha256openssl genrsa -out etcd-key.pem 2048openssl req -new -key etcd-key.pem -out etcd-csr.pem -subj "/CN=etcd"echo subjectAltName = DNS:localhost,IP:192.168.1.2,IP:192.168.1.3,IP:192.168.1.4,IP:127.0.0.1 > extfile.cnfopenssl x509 -req -in etcd-csr.pem -CA ca.pem -CAkey ca-key.pem -CAcreateserial -days 3650 -out etcd.pem -sha256 -extfile extfile.cnf
Automated way
Here is the gist to do the certificates in an automated way.
https://gist.github.com/dipankardas011/e425e4d1573a6f5f1f73dcaf53226ed7
 
You can copy the code and modify it according to your configuration.
It creates a ca.pem root certificate which will be used to sign other certificates, and then create etcd.pem client certificate and etcd-key.pem key which is signed by this root certificate
go run . 192.168.1.2 192.168.1.3 192.168.1.4 # provide the private IP of the etcd VMs to make ca only valid for SAN on them
now you need to move these files to all etcd and controlplane VMs and the below steps will help
Move the certificates to Etcd VMs
Note make sure the directory already exists before copying the certificates in our case we have already created /var/lib/etcd
Generic command:
scp -i ${ssh-private-key} ca.pem etcd.pem etcd-key.pem ${username-vm}@${public-ip-etcd or via using baston host}:/var/lib/etcd
Our script:
scp -i ksctl-key ca.pem etcd.pem etcd-key.pem root@74.220.16.178:/var/lib/etcdscp -i ksctl-key ca.pem etcd.pem etcd-key.pem root@74.220.19.12:/var/lib/etcdscp -i ksctl-key ca.pem etcd.pem etcd-key.pem root@74.220.21.101:/var/lib/etcd
Move the certificates Control plane VMs
Note make sure the directory already exists before copying the certificate in my case I have already created /etcd/kubernetes/pki/etcd/
Generic command:
scp -i ${ssh-private-key} ca.pem etcd.pem etcd-key.pem ${username-vm}@${public-ip-controlplane-vm or via using baston host}:/etcd/kubernetes/pki/etcd/
Our script:
scp -i ksctl-key ca.pem etcd.pem etcd-key.pem root@74.220.23.131:/etcd/kubernetes/pki/etcd/scp -i ksctl-key ca.pem etcd.pem etcd-key.pem root@74.220.19.191:/etcd/kubernetes/pki/etcd/scp -i ksctl-key ca.pem etcd.pem etcd-key.pem root@74.220.22.42:/etcd/kubernetes/pki/etcd/
Step 4: Configure Etcd VMs
Now it's time to do some cool stuff:
Generic template that you can modify according to your configuration.
#!/bin/bashset -xecat < /etc/systemd/system/etcd.service[Unit]Description=etcd[Service]ExecStart=/usr/local/bin/etcd \\  --name infra0 \\  --initial-advertise-peer-urls https://${current-vm-private-ip}:2380 \  --listen-peer-urls https://${current-vm-private-ip}:2380 \\  --listen-client-urls https://${current-vm-private-ip}:2379,https://127.0.0.1:2379 \\  --advertise-client-urls https://${current-vm-private-ip}:2379 \\  --initial-cluster-token etcd-cluster-1 \\  --initial-cluster infra0=https://${current-vm-private-ip}:2380,infra1=https://${other-vms-private-ip}:2380,infra2=https://${other-vms-private-ip}:2380 \\  --log-outputs=/var/lib/etcd/etcd.log \\  --initial-cluster-state new \\  --peer-auto-tls \\  --snapshot-count '10000' \\  --wal-dir=/var/lib/etcd/wal \\  --client-cert-auth \\  --trusted-ca-file=/var/lib/etcd/ca.pem \\  --cert-file=/var/lib/etcd/etcd.pem \\  --key-file=/var/lib/etcd/etcd-key.pem \\  --data-dir=/var/lib/etcd/dataRestart=on-failureRestartSec=5[Install]WantedBy=multi-user.targetEOFsudo systemctl daemon-reloadsudo systemctl enable etcd
Lets discuss about various configuration settings
General:
ExecStart=/usr/local/bin/etcd: Specifies the executable path to start the etcd process.
Restart=on-failure: Instructs the system to automatically restart etcd if it fails.
RestartSec=5: Sets a 5-second delay before attempting a restart.
Cluster Configuration:
--name infra0: Assigns the name "infra0" to this member of the etcd cluster.
--initial-advertise-peer-urls: Advertises this member's peer URL to other members for cluster communication.
--listen-peer-urls: Listens for peer connections on this URL.
--initial-cluster-token etcd-cluster-1: Defines a shared token ensuring all members belong to the same cluster.
--initial-cluster: Lists initial cluster members and their peer URLs.
--initial-cluster-state new: Instructs etcd to create a new cluster, not join an existing one.
Client Communication:
--listen-client-urls: Listens for client connections on these URLs.
--advertise-client-urls: Advertises the client URL for this member to other members.
Logging and Data Storage:
--log-outputs=/var/lib/etcd/etcd.log: Logs etcd output to this file.
--data-dir=/var/lib/etcd/data: Stores etcd data in this directory.
--wal-dir=/var/lib/etcd/wal: Stores the write-ahead log (WAL) in this directory for data durability.
Security:
--peer-auto-tls: Automatically generates and manages TLS certificates for peer communication.
--client-cert-auth: Requires clients to authenticate with TLS certificates.
--trusted-ca-file=/var/lib/etcd/ca.pem: Specifies the trusted certificate authority (CA) file for client certificates.
--cert-file=/var/lib/etcd/etcd.pem: Specifies the certificate file for this etcd member.
--key-file=/var/lib/etcd/etcd-key.pem: Specifies the private key file for this etcd member.
Snapshots:
--snapshot-count '10000': Triggers a snapshot of the data every 10,000 transactions for backup and recovery.
Etcd-0
#!/bin/bashset -xecat < /etc/systemd/system/etcd.service[Unit]Description=etcd[Service]ExecStart=/usr/local/bin/etcd \\  --name infra0 \\  --initial-advertise-peer-urls https://192.168.1.2:2380 \  --listen-peer-urls https://192.168.1.2:2380 \\  --listen-client-urls https://192.168.1.2:2379,https://127.0.0.1:2379 \\  --advertise-client-urls https://192.168.1.2:2379 \\  --initial-cluster-token etcd-cluster-1 \\  --initial-cluster infra0=https://192.168.1.2:2380,infra1=https://192.168.1.3:2380,infra2=https://192.168.1.4:2380 \\  --log-outputs=/var/lib/etcd/etcd.log \\  --initial-cluster-state new \\  --peer-auto-tls \\  --snapshot-count '10000' \\  --wal-dir=/var/lib/etcd/wal \\  --client-cert-auth \\  --trusted-ca-file=/var/lib/etcd/ca.pem \\  --cert-file=/var/lib/etcd/etcd.pem \\  --key-file=/var/lib/etcd/etcd-key.pem \\  --data-dir=/var/lib/etcd/dataRestart=on-failureRestartSec=5[Install]WantedBy=multi-user.targetEOFsudo systemctl daemon-reloadsudo systemctl enable etcd
Etcd-1
#!/bin/bashset -xecat < /etc/systemd/system/etcd.service[Unit]Description=etcd[Service]ExecStart=/usr/local/bin/etcd \\  --name infra1 \\  --initial-advertise-peer-urls https://192.168.1.3:2380 \  --listen-peer-urls https://192.168.1.3:2380 \\  --listen-client-urls https://192.168.1.3:2379,https://127.0.0.1:2379 \\  --advertise-client-urls https://192.168.1.3:2379 \\  --initial-cluster-token etcd-cluster-1 \\  --initial-cluster infra0=https://192.168.1.2:2380,infra1=https://192.168.1.3:2380,infra2=https://192.168.1.4:2380 \\  --log-outputs=/var/lib/etcd/etcd.log \\  --initial-cluster-state new \\  --peer-auto-tls \\  --wal-dir=/var/lib/etcd/wal \\  --client-cert-auth \\  --trusted-ca-file=/var/lib/etcd/ca.pem \\  --cert-file=/var/lib/etcd/etcd.pem \\  --key-file=/var/lib/etcd/etcd-key.pem \\  --snapshot-count '10000' \\  --data-dir=/var/lib/etcd/dataRestart=on-failureRestartSec=5[Install]WantedBy=multi-user.targetEOFsudo systemctl daemon-reloadsudo systemctl enable etcd
Etcd-2
#!/bin/bashset -xecat < /etc/systemd/system/etcd.service[Unit]Description=etcd[Service]ExecStart=/usr/local/bin/etcd \\  --name infra2 \\  --initial-advertise-peer-urls https://192.168.1.4:2380 \  --listen-peer-urls https://192.168.1.4:2380 \\  --listen-client-urls https://192.168.1.4:2379,https://127.0.0.1:2379 \\  --advertise-client-urls https://192.168.1.4:2379 \\  --initial-cluster-token etcd-cluster-1 \\  --initial-cluster infra0=https://192.168.1.2:2380,infra1=https://192.168.1.3:2380,infra2=https://192.168.1.4:2380 \\  --log-outputs=/var/lib/etcd/etcd.log \\  --initial-cluster-state new \\  --peer-auto-tls \\  --snapshot-count '10000' \\  --client-cert-auth \\  --trusted-ca-file=/var/lib/etcd/ca.pem \\  --cert-file=/var/lib/etcd/etcd.pem \\  --key-file=/var/lib/etcd/etcd-key.pem \\  --wal-dir=/var/lib/etcd/wal \\  --data-dir=/var/lib/etcd/dataRestart=on-failureRestartSec=5[Install]WantedBy=multi-user.targetEOFsudo systemctl daemon-reloadsudo systemctl enable etcd
For all Etcd VMs
sudo systemctl start etcd
to test whether you can access etcd server via the etcdctl
below are some example commands to test whether all the etcd members are working as expected
etcdctl \  --cacert=/var/lib/etcd/ca.pem \  --cert=/var/lib/etcd/etcd.pem \  --key=/var/lib/etcd/etcd-key.pem \  endpoint health \  -w=table \  --clusteretcdctl \  --cacert=/var/lib/etcd/ca.pem \  --cert=/var/lib/etcd/etcd.pem \  --key=/var/lib/etcd/etcd-key.pem \  endpoint status \  -w=table \  --clusteretcdctl \  --cacert=/var/lib/etcd/ca.pem \  --cert=/var/lib/etcd/etcd.pem \  --key=/var/lib/etcd/etcd-key.pem \  member list \  -w=tableetcdctl \  --cacert=/var/lib/etcd/ca.pem \  --cert=/var/lib/etcd/etcd.pem \  --key=/var/lib/etcd/etcd-key.pem \  get / --prefix --keys-only
Step 5: Run the Kubeadm init command on the control plane node
In a High Availability (HA) cluster setup, where you have multiple control plane nodes, the localAPIEndpoint in the InitConfiguration is typically not used. The localAPIEndpoint specifies the endpoint that the control plane components advertise to other nodes in the cluster. In a HA setup, the API server is typically load-balanced, and each control plane node advertises itself at the load balancer's address.
Single Control Plane Node Setup: In a single control plane node setup, you might specify the IP address and port of the single control plane node in localAPIEndpoint.
High Availability (HA) Control Plane Setup: In an HA setup, you generally set up a load balancer in front of multiple control plane nodes. The load balancer has a single IP address and distributes incoming requests among the control plane nodes. Each control plane node does not advertise itself directly; instead, they are behind the load balancer. The controlPlaneEndpoint in the ClusterConfiguration is typically used to specify the address and port of the load balancer. In summary, for HA setups, you often configure the controlPlaneEndpoint in the ClusterConfiguration to point to the load balancer's address, and you may not need to explicitly configure localAPIEndpoint in the InitConfiguration. The load balancer handles directing traffic to the active control plane node.
generate certificate key  (CERT_KEY)
kubeadm certs certificate-key # copy the output IMPORTANT
In our case, below is the output:
8b80729b738b2eef8dc2dbec17e927aa2fd03d43b7f0f4925c7e47bf9ae1e561
now let's create the kubeadm init configuration
Generic configuration that you can edit based on your requirements.
cat < kubeadm-config.ymlapiVersion: kubeadm.k8s.io/v1beta3kind: InitConfigurationbootstrapTokens:- groups:  - system:bootstrappers:kubeadm:default-node-token  token: ${some random string}  # important thing to set as it will be used when joining nodes to the k8s cluster.  ttl: 24h0m0s  usages:  - signing  - authenticationcertificateKey: ${get it from the output of kubeadm certs certificate-key command}nodeRegistration:  criSocket: unix:///var/run/containerd/containerd.sock  imagePullPolicy: IfNotPresent  taints: null---apiVersion: kubeadm.k8s.io/v1beta3kind: ClusterConfigurationapiServer:  timeoutForControlPlane: 4m0s  certSANs:    - "${public ip of loadbalancer}" #     - "127.0.0.1"certificatesDir: /etc/kubernetes/pkiclusterName: kubernetescontrollerManager: {}dns: {}etcd:  external:    endpoints:    - "https://${private ip of the etcd 0}:2379"    - "https://${private ip of the etcd 1}:2379"    - "https://${private ip of the etcd 2}:2379"    caFile: "/etcd/kubernetes/pki/etcd/ca.pem"    certFile: "/etcd/kubernetes/pki/etcd/etcd.pem"    keyFile: "/etcd/kubernetes/pki/etcd/etcd-key.pem"imageRepository: registry.k8s.iokubernetesVersion: 1.28.0controlPlaneEndpoint: "${public ip of loadbalancer}:6443"networking:  dnsDomain: cluster.local  serviceSubnet: 10.96.0.0/12scheduler: {}EOF
Our script
cat < kubeadm-config.ymlapiVersion: kubeadm.k8s.io/v1beta3kind: InitConfigurationbootstrapTokens:- groups:  - system:bootstrappers:kubeadm:default-node-token  token: abcdef.0123456789abcdef  # important thing to set as it will be used when joining nodes to the k8s cluster. any random string is allowed for more info can refer to the docs  ttl: 24h0m0s  usages:  - signing  - authenticationcertificateKey: 8b80729b738b2eef8dc2dbec17e927aa2fd03d43b7f0f4925c7e47bf9ae1e561 # get it from the output of kubeadm certs certificate-key commandnodeRegistration:  criSocket: unix:///var/run/containerd/containerd.sock  imagePullPolicy: IfNotPresent  taints: null---apiVersion: kubeadm.k8s.io/v1beta3kind: ClusterConfigurationapiServer:  timeoutForControlPlane: 4m0s  certSANs:    - "74.220.22.92" #     - "127.0.0.1"certificatesDir: /etc/kubernetes/pkiclusterName: kubernetescontrollerManager: {}dns: {}etcd:  external:    endpoints:    - "https://192.168.1.2:2379"    - "https://192.168.1.3:2379"    - "https://192.168.1.4:2379"    caFile: "/etcd/kubernetes/pki/etcd/ca.pem"    certFile: "/etcd/kubernetes/pki/etcd/etcd.pem"    keyFile: "/etcd/kubernetes/pki/etcd/etcd-key.pem"imageRepository: registry.k8s.iokubernetesVersion: 1.28.0controlPlaneEndpoint: "74.220.22.92:6443"networking:  dnsDomain: cluster.local  serviceSubnet: 10.96.0.0/12scheduler: {}EOF
Control-plane-0
Generic command where you can provide your config file:
kubeadm init --config ${cluster-config-file-defined-above} --upload-certs
Our command to create the HA Kubernetes cluster:
kubeadm init --config kubeadm-config.yml --upload-certs
the output will generate join commands
You can now join any number of the control-plane node running the following command on each as root:  kubeadm join 74.220.22.92:6443 --token abcdef.0123456789abcdef \        --discovery-token-ca-cert-hash sha256:a633923134ac00a1e938dde1a28033d2a7d5bc3fb325e7280d000148fef854e2 \        --control-plane --certificate-key 8b80729b738b2eef8dc2dbec17e927aa2fd03d43b7f0f4925c7e47bf9ae1e561Please note that the certificate-key gives access to cluster sensitive data, keep it secret!As a safeguard, uploaded-certs will be deleted in two hours; If necessary, you can use"kubeadm init phase upload-certs --upload-certs" to reload certs afterward.Then you can join any number of worker nodes by running the following on each as root:kubeadm join 74.220.22.92:6443 --token abcdef.0123456789abcdef \        --discovery-token-ca-cert-hash sha256:a633923134ac00a1e938dde1a28033d2a7d5bc3fb325e7280d000148fef854e2
When you do kubeadm init, you will get the join command in the output
If you carefully see it has the token and discovery-tokn-ca-cert-hash.
token: we defined in the configuration file already so we know the value.
discovery-token-ca-cert-hash: copy from below command
openssl x509 -in /etc/kubernetes/pki/ca.crt -noout -pubkey | openssl rsa -pubin -outform DER 2>/dev/null | sha256sum | cut -d' ' -f1
The output from this command can be used for automation purposes.
a633923134ac00a1e938dde1a28033d2a7d5bc3fb325e7280d000148fef854e2
token is already available from the clusterConfiguration  abcdef.0123456789abcdef(TOKEN)
Let's see how to automate the generation of these scripts
So Here is one automation hack
As you got the (TOKEN) , (CA_CERT_HASH), (CERT_KEY)
you can construct the both the join commands so you dont have to rely on the output
package mainimport (    "fmt")func main() {    token := "abcdef.0123456789abcdef"    caCertSHA := "a633923134ac00a1e938dde1a28033d2a7d5bc3fb325e7280d000148fef854e2"    certKey := "8b80729b738b2eef8dc2dbec17e927aa2fd03d43b7f0f4925c7e47bf9ae1e561"    publicIP := "74.220.22.92"    expected1 := "kubeadm join 74.220.22.92:6443 --token abcdef.0123456789abcdef --discovery-token-ca-cert-hash sha256:a633923134ac00a1e938dde1a28033d2a7d5bc3fb325e7280d000148fef854e2 --control-plane --certificate-key 8b80729b738b2eef8dc2dbec17e927aa2fd03d43b7f0f4925c7e47bf9ae1e561"    expected2 := "kubeadm join 74.220.22.92:6443 --token abcdef.0123456789abcdef --discovery-token-ca-cert-hash sha256:a633923134ac00a1e938dde1a28033d2a7d5bc3fb325e7280d000148fef854e2"    if g1, g2 := generate(publicIP, token, caCertSHA, certKey); g1 != expected1 || g2 != expected2 {        fmt.Println("Missmatch")        return    }    fmt.Println("Matched!")}func generate(pubIPLb, token, cacertSHA, certKey string) (string, string) {    controlplane := fmt.Sprintf(`kubeadm join %s:6443 --token %s --discovery-token-ca-cert-hash sha256:%s --control-plane --certificate-key %s`, pubIPLb, token, cacertSHA, certKey)    workernodes := fmt.Sprintf(`kubeadm join %s:6443 --token %s --discovery-token-ca-cert-hash sha256:%s`, pubIPLb, token, cacertSHA)    return controlplane, workernodes}
below is the output:
Control-plane-(N)
Run the controlplane join command on remaining controlplane nodes:
# Templatekubeadm join ${Loadbalancer_Public_IP}:6443 --token ${TOKEN} --discovery-token-ca-cert-hash sha256:${CA_CERT_HASH} --control-plane --certificate-key ${CERT_KEY}
# my codekubeadm join 74.220.22.92:6443 --token abcdef.0123456789abcdef --discovery-token-ca-cert-hash sha256:a633923134ac00a1e938dde1a28033d2a7d5bc3fb325e7280d000148fef854e2 --control-plane --certificate-key 8b80729b738b2eef8dc2dbec17e927aa2fd03d43b7f0f4925c7e47bf9ae1e561
mkdir -p $HOME/.kubesudo cp -i /etc/kubernetes/admin.conf $HOME/.kube/configsudo chown $(id -u):$(id -g) $HOME/.kube/config
Step 6: Run the join command on all the worker nodes
kubeadm join ${Loadbalancer_Public_IP}:6443 --token ${TOKEN} --discovery-token-ca-cert-hash sha256:${CA_CERT_HASH}
kubeadm join 74.220.22.92:6443 --token abcdef.0123456789abcdef --discovery-token-ca-cert-hash sha256:a633923134ac00a1e938dde1a28033d2a7d5bc3fb325e7280d000148fef854e2
Let's copy the kubeconfig from the controlplane-0 to our host system
# copied the kubeconfig from the known location # make sure you have moved the kubeconfig in the controlplane node to this location the steps are shown in the kubeadm init outputscp -i ksctl-key root@${public ip of the controlplane}:/root/.kube/config kubeconfig
Once you have the Kubeconfig file you can export the KUBECONFIG variable to interact with the cluster.
export KUBECONFIG=kubeconfig
Now you can run the cluster and you will see all the nodes in the NotReady state, this is becasue kubeadm doesn't provide CNI installed.
Step 7: Install CNI
We will use Cilium which is a CNCF graduated project for our CNI layer.
helm repo add cilium https://helm.cilium.io/helm repo updatehelm install cilium cilium/cilium --version 1.14.6
Now when you test, you will see all the nodes in Ready state.
Step 8: Nginx Test
kubectl run nginx-pod --image=nginxkubectl expose pod nginx-pod --port=80 --name=nginx-service --type=NodePort
Using load balancer public IP:NodePort we can reach the app
Testing the High availability
Let's try the High availability, on one screen we will do kubectl get nodes -w and in other tab we will shutdown one controlplane node so that it becomes not ready and then we will try to again create nginx pod and test if that is working.
Below is everything before creating that chaos:
kubectl get no,po,svc -A# looks like
Now, according to raft consensus distributed model FaultTolerant=(n1)/2 are needed for majority vote if the available nodes become less than this, the system will be unreachable.
In our case we are dealing with 3 controlplane and 3 etcd nodes. So at minimum 2 cp and 2 etcd as it can handle only 1 node failure so ideally you will choose 5 which gives us some room for one node down for maintenance and another failure scenarios.
Shutdown the controlplane and after a few minutes when the node heartbeats are unreachable by k8s it will mark the node as not ready.
But you can see that the nginx workload is still running.
Success: still able to reach the kube-api server and the workload is still running
Let's delete one of the etcd node
Success: still able to reach the kube-api server and the workload is still running
Let's stop one etcd service and again restart it
Success: workload is running
Failure: unable to reach the kube-api server
Lets restart the etcd service
Success: Recovered the kube-api server
Also the same would happen if one more controlplane node gets down
Finally lets remove all data in etcd server and see what happens to the cluster
etcdctl \  --cacert=/var/lib/etcd/ca.pem \  --cert=/var/lib/etcd/etcd.pem \  --key=/var/lib/etcd/etcd-key.pem \  get / --prefix --keys-only | wc -letcdctl \  --cacert=/var/lib/etcd/ca.pem \  --cert=/var/lib/etcd/etcd.pem \  --key=/var/lib/etcd/etcd-key.pem \  del / --prefix
cluster is completely dead: as all cluster data is gone
thats why its important to backup the etcd data!!!! 💀 (Let's talk about Backup and recover some other time).
Conclusion
Meticulously configuring your etcd cluster establishes a fault-tolerant foundation for Kubernetes, ensuring uninterrupted application uptime in various scenarios. This robust setup, suitable for microservices or containerized workloads, is scalable and reliable. While we covered core etcd configuration, consult Kubernetes and etcd documentation for customization and troubleshooting. Implement monitoring tools for cluster health and performance awareness.
Finally, this blog post is possible just because of ksctl project. As you can see these are quite a lot of manual steps so ksctl helps you overcome these challenges with more customizability options with multiple cloud providers. Do try out the ksctl project and leave a star if you like the project.
https://github.com/kubesimplify/ksctl
 
Also, there is a good blog post on ksctl introduction
https://blog.kubesimplify.com/ksctl-making-kubernetes-easy-across-clouds
 
References
k3s-datastore
k3s-external-db
etcd-self-signed-tls
etcd-auto-tls
automate-tls-certs-in-go
Refer-kubeadm-config.v1beta3
Follow Kubesimplify on Hashnode,Twitter and LinkedInJoin our Discord server to learn with us.
Check out our Recent course on WebAssembly
https://www.youtube.com/watch?v=eYekV2Do0YU


Ksctl: Making Kubernetes Easy Across Clouds
Shaik Ahmad Nawaz — Tue, 23 Jan 2024 05:23:05 GMT
In the complex world of managing Kubernetes, a tool like Ksctl emerges as a game-changer. Ksctl, short for Kubernetes Simplify Control, is a tool designed to simplify how we handle Kubernetes clusters, especially across different cloud services. This blog will walk you through what Ksctl is, the problems it tackles, its standout features, and a step-by-step guide on creating clusters effortlessly.
What is Ksctl?
Ksctl is a handy tool that makes managing Kubernetes clusters easier. It's like a one-stop-shop for dealing with Kubernetes across various cloud services. Created by the Kubesimplify team, Ksctl is all about simplifying the future of Kubernetes management.
Problems Ksctl Solves:
Before Ksctl, using Kubernetes posed some challenges:
Too Many Tools: People were using too many different tools, causing confusion.
Tool Complications: Some tools relied on others, making things complicated.
No Consistency: The Kubernetes world lacked consistency, making it hard for things to work together smoothly.
Tricky Configurations: Creating custom setups and managing applications had its share of difficulties.
Hard to Learn: Managing Kubernetes, especially for special cases, was tough to learn.
Solution: Meet Ksctl
Ksctl tackles these issues with some cool features:
1. Manage Anywhere:
Ksctl lets you manage your Kubernetes clusters across different cloud services without switching tools. It's like having a universal remote for your clusters.
2. Pick Your Cluster Flavor:
Choose between having someone else manage your cluster or doing it yourself. Ksctl gives you options for High Availability (HA) clusters, making it flexible.
3. Easy App Deployments:
Ksctl makes deploying applications a breeze by having the necessary plugins ready to go. It's like having your favorite apps pre-installed on your new phone. (Currently a beta feature)
4. Light and Simple:
Ksctl is lightweight, meaning it won't slow you down. You can install it quickly without needing extra stuff.
5. Your Way:
From managing everything to having Ksctl handle it, and from deciding which plugins to use to choosing pre-installed apps, Ksctl lets you do things your way.
Key Features:
Easy Cluster Setup: Make a new cluster with just one simple command.
No Fuss Installation: Installing Ksctl is a breeze  no complicated steps.
Make it Yours: Customize your cluster based on what you need, from configurations to apps.
Faster Creation & small binary size: It creates clusters in ~5-6 minutes and the ksctl CLI binary is <50 MB.
How to Use Ksctl:
Creating a cluster with Ksctl is easy:
Install Ksctl: Follow the easy steps to get Ksctl CLI on your system.
For Linux:
bash <(curl -s https://raw.githubusercontent.com/kubesimplify/ksctl-cli/main/scripts/install.sh)
For MacOS:
zsh <(curl -s https://raw.githubusercontent.com/kubesimplify/ksctl-cli/main/scripts/install.sh)
For Windows:
iwr -useb https://raw.githubusercontent.com/kubesimplify/ksctl-cli/main/install.ps1 | iex
Whats next:
Github Repo to look for
kubesimplify/ksctl: Cloud Agnostic Kubernetes Management (Core)
kubesimplify/ksctl-cli: Cloud Agnostic Kubernetes Management (CLI)
kubesimplify/ksctl-docs: Cloud Agnostic Kubernetes Management (Docs)
Conclusion:
Ksctl is your go-to tool for managing Kubernetes without the headache. As more folks dive into Kubernetes, tools like Ksctl make things simpler. With its focus on working across different clouds, offering cluster choices, and making app deployments hassle-free, Ksctl stands out. Embrace the future of Kubernetes management with Ksctl, and make your cluster adventures smoother than ever.
In the future more blogs will come regarding its usage. so stay tuned
For more details and updates, head over to kubesimplify's home on GitHub. And if you want to join the conversation or report an issue. Happy clustering!


Pure Cilium : A Guide for Local Load Balancing and BGP
Swapnasagar Pradhan — Mon, 22 Jan 2024 11:23:53 GMT
In this guide, we'll walk through the steps to build a multi-node Kubernetes cluster on your local workstation or MacBook (M1, M2, or M3) using K3s and Cilium. We will also demonstrate using Cilium's powerful Load Balancer (L.B.) Use the IPAM feature to expose your service as a built-in load balancer in your K8s cluster.
Pre-Requisites
  Before you begin, make sure you have set up your cluster with Cilium CNI. You can follow the instructions in this GitHub repository to get your cluster up and running in under 3 minutes.
  Once Cilium is up and running, the other pods in the cluster should transition into the Running state. You should see one Cilium pod on each node and the Cilium operator.
  
Cilium components
  Here is a high-level description of Cilium components:
The Cilium agent
The cilium-agent component runs on every node in the cluster. It accepts the configuration via APIs or Kubernetes, which describes requirements for networking, network policies, load balancing, visibility, and monitoring.
The agent waits for events from the orchestration system (i.e., Kubernetes) to indicate when workloads or containers start or stop. It manages eBPF programs that allow the Linux kernel to control network access in and out of the containers.
Cilium CLI
The CLI client is a command-line tool installed alongside the Cilium agent on the same node, interacting with the agents REST API. The CLI enables the inspection of the local agents state and status. It also offers tools to access and validate the state of eBPF maps directly.
The Cilium operator
The operator handles tasks that require one-time handling for the whole cluster instead of for every node. The Cilium operator is not critical for making network policy decisions or forwarding  clusters can generally function when the operator becomes unavailable.
The CNI plugin
Kubernetes invokes the cilium-cni plugin when it schedules or terminates a pod on the node. The plugin interacts with the nodes Cilium API to trigger the right datapath configurations for the pods networking, policy, and load balancing needs.
Cilium BGP
  Picture source: https://cilium.io
  
  What is a BGP? BGP is an internet routing protocol that enables the exchange of routing information between autonomous systems (ASes), allowing networks to learn and advertise routes to reach different destinations over public and private networks.
  For more information on BGP, look at RFC 4271  BGP.
  Enabling BGP
  From the official Cilium BGP control plane documentation, you will see that currently, a single flag in the Cilium agent exists to turn on the BGP Control Plane feature set.
  There are different ways to enable this flag, however we will continue using the cilium cli (Helm requires a different approach, so check the official documentation if you are using Helm).
  Before we change the BGP flag, lets check the current configuration.
  P.S. You can enable BGP when you install Cilium, but we want to show you the underlying steps.
As you can see, the BGP Control Plane feature is turned off by default. Lets enable it!
The READY state for our Cilium Agents is 0/1, which means theres a problem. Lets read the logs to see why the Cilium Agents are no longer READY.
kubectl logs -n kube-system cilium-gdnd7 | tail -1Defaulted container "cilium-agent" out of: cilium-agent, config (init), mount-cgroup (init), apply-sysctl-overwrites (init), mount-bpf-fs (init), clean-cilium-state (init), install-cni-binaries (init)level=error msg=k8sError error="github.com/cilium/cilium/pkg/k8s/resource/resource.go:183: Failed to watch *v2alpha1.CiliumBGPPeeringPolicy: failed to list *v2alpha1.CiliumBGPPeeringPolicy: the server could not find the requested resource (get ciliumbgppeeringpolicies.cilium.io)" subsys=k8s
. failed to watch *v2alpha1.CiliumBGPPeeringPolicy.
Setting enable-bgp-control-plane true causes the Cilium Agents to look for the Cilium BGP Peering Policy, which does not yet exist, mainly because the Cilium Operator did not create the CiliumBGPPeeringPolicy CRD. After all, we were not using that feature at installation time.
Let's check the resource types defined in our cluster with the api-resources command:
kubectl api-resources | grep -i cilium
There you go: no BGP policy. Also, we can notice the cilium agents were redeployed but not the operator, which is still running.
let's redeploy it by deleting it and then check the logs of the operator pod, and you can see something
Creating CRD (CustomResourceDefinition)..." name=CiliumBGPPeeringPolicy/v2alpha1
Details are in the screenshot
Now Recheck the api-resources to see the new CRD:
Cilium BGP Peering Policy
Now that we have a CiliumBGPPeeringPolicy type (CRD), we can create an object of that type to define our Cilium BGP peering policy.
Here is the yaml file which we will use to create it.
cat cilium-bgp-policy.yaml
apiVersion: "cilium.io/v2alpha1"kind: CiliumBGPPeeringPolicymetadata: name: 01-bgp-peering-policyspec: nodeSelector:   matchLabels:     bgp-policy: a virtualRouters: - localASN: 64512   exportPodCIDR: true   neighbors:    - peerAddress: '192.168.1.1/32'      peerASN: 64512   serviceSelector:     matchExpressions:       - {key: somekey, operator: NotIn, values: ['never-used-value']}"cilium.io/v2alpha1" kind: CiliumBGPPeeringPolicy metadata: name: 01-bgp-peering-policy spec: nodeSelector: matchLabels: bgp-policy: a virtualRouters: - localASN: 64512 exportPodCIDR: true neighbors: - peerAddress: '192.168.1.1/32' peerASN: 64512 serviceSelector: matchExpressions: - {key: somekey, operator: NotIn, values: ['never-used-value']}
Specification (Spec)
spec: This section defines the behavior of the resource.
nodeSelector: Specifies which nodes this policy applies to.
matchLabels: The policy applies to nodes with the label bgp-policy: a.
virtualRouters: Configures one or more virtual routers for BGP.
- localASN: 64512: Defines the Autonomous System Number (ASN) for the local node.
exportPodCIDR: true: This flag indicates that the pod CIDR should be advertised to BGP peers.
neighbors: Defines BGP neighbors.
- peerAddress: '192.168.1.1/32': Specifies the address of a BGP neighbor.
peerASN: 64512: The ASN of the BGP neighbor.
serviceSelector: Specifies which services this policy applies to.
matchExpressions: A list of criteria for selecting services.
- {key: somekey, operator: NotIn, values: ['never-used-value']}: Selects services that do not have a label with the key somekey and value never-used-value.
Documentation can be found here.
  Now that we have an understanding of the policy, lets apply it to the cluster:
Kubernetes nodes label
  We need to label the nodes we want the BGP policy to apply. In our case, we will label the follower nodes, leaving out the control-plane node. Our CiliumBGPPeeringPolicy node selector expects the bgp-policy=a label.
  kubectl  apply -f cilium-bgp-policy.yaml   ciliumbgppeeringpolicy.cilium.io/01-bgp-peering-policy created  -----  kubectl label nodes k3s-follower bgp-policy=a  ------  ubuntu@k3s:~$ kubectl get nodes -l bgp-policy=a  NAME         STATUS   ROLES    AGE    VERSION  k3s-follower   Ready       2d2h   v1.28.5+k3s1
LB IPAM
  When you create a Load Balancer Service in a Kubernetes cluster, the cluster itself does not assign the Service a Load Balancer I.P. (aka External I.P.); we need a plugin to do that. If you create a Load Balancer Service without a Load Balancer plugin, the External I.P. address will show Pending indefinitely.
  The Cilium LoadBalancer IP Address Management (LB IPAM) feature can be used to provision I.P. addresses for our Load Balancer Services.
  Here is what the official doc says about it:
LB IPAM is a feature that allows Cilium to assign IP addresses to Services of type LoadBalancer. This functionality is usually left up to a cloud provider, however, when deploying in a private cloud environment, these facilities are not always available.
This section must understand that LB IPAM is always enabled but dormant. The controller is awoken when the first IP Pool is added to the cluster.
  Lets create our cilium LoadBalancer IP pool.
  To create a pool, we name it and give a CIDR range. Well use 172.198.1.0/24 as our CIDR range; this range mustn't overlap with other networks in use with your cluster.
  # cat cilium-ippool.yaml  apiVersion: "cilium.io/v2alpha1"  kind: CiliumLoadBalancerIPPool  metadata:    name: "lb-pool"  spec:    cidrs:    - cidr: "172.198.1.0/24"apirsion: "cilium.io/v2alpha1" kind: CiliumLoadBalancerIPPool metadata: name: "lb-pool" spec: cidrs: - cidr: "172.198.1.0/24"  -------  Kubectl create -f cilium-ippool.yaml  ciliumloadbalancerippool.cilium.io/lb-pool created
  Cilium service LoadBalancer
  Now, lets create a pod with a service type LoadBalancer and test it.
  We will make a simple nginx pod and a simple service exposing port 8080, with type LoadBalancer.
  This should cause Cilium to provision an external I.P. for our logical load balancer and then advertise the route through BGP.
  cat pod.yaml service.yaml
  # pod.yaml  apiVersion: v1  kind: Pod  metadata:    name: simple-pod    labels:      app: simple-pod  spec:    containers:    - name: my-app-container      image: nginx:latest      ports:      - containerPort: 80  # service.yaml  apiVersion: v1  kind: Service  metadata:    name: my-service  spec:    selector:      app: simple-pod  # Make sure this matches the label of the Pod    ports:    - protocol: TCP      port: 8080      targetPort: 80    type: LoadBalancer
  Lets create it:
  # kubectl apply -f pod.yaml  pod/simple-pod created  # kubectl apply -f service.yaml  service/my-service created
  From the output below, we know that we have a running pod with the name simple-pod and a service with the name my-service, but the most crucial part is that we have a service TYPE LoadBalancer with EXTERNAL-IP from our ip-pool, which we created earlier, and we get 172.198.1.167
  
Validate LoadBalancer External I.P.
  
  This post guides you through setting up Cilium-based Load Balancer Services in a K3s Kubernetes cluster, detailing the network operations involved and providing a foundation for further experimentation.
Thank you for reading!
  Inspiration:
  https://docs.cilium.io/en/latest/network/lb-ipam/
  https://cilium.io/blog/2020/04/29/cilium-with-rancher-labs-k3s/
Follow Kubesimplify on YouTube, Hashnode, Twitter, and LinkedIn. Join our Discord server to learn with us.


The WebAssembly Course
Saiyam Pathak — Mon, 15 Jan 2024 12:35:49 GMT
By now, the term 'Wasm' is known to everyone as it has shown its success in the browsers but also its marked as the next big thing in the cloud-native ecosystem or serverless ecosystem or when we talk about wasm on the server side . Almost everyone included Wasm in their 2023 predictions, and it's clear that Wasm is onto something. 2024 is no different, and we will see even more production usage of WebAssembly.
I am really excited to share that Rishit Dagli and I have launched a Complete WebAssembly Course - from Beginners to Advanced for you on Kubesimplify. This course will help you start your WebAssembly journey and enable you to create cool stuff in WebAssembly.
https://www.youtube.com/watch?v=eYekV2Do0YU
 
This course was developed as part of our efforts to bridge the education gap in learning WebAssembly. Rishit and I attended various conferences last year focused on WebAssembly, including WasmI/O and Cloud Native Wasm Day. We spoke to many people, and the first major hurdle in entering the field of WebAssembly was finding the right education. That's when we started planning a course to be created, covering all the basic constructs needed to create great applications.
What can you expect from the course?
In this course, you will learn all the basic constructs of WebAssembly, including:
Introduction to WebAssembly: We discuss the history of Wasm, its success in browsers, and its rising popularity on the server side.
CNCF Wasm Landscape and Bytecode Alliance: This section explains the CNCF Landscape and the significance of the Bytecode Alliance.
Memory Management and Sandboxing: An in-depth look at Wasm's core feature of sandboxing and memory management.
Wasm Module: Exploration of the Wasm module, including binary representation, different sections, and mapping from wat files to binary.
Networking Capabilities in Wasm: We delve into the Wasm networking proposal with a practical example.
Tooling around Wasm and Runtimes: Discussion of Wasm specifications, implementations, and additional capabilities provided by various tools.
Run Wasm Everywhere with WASI: Exploring the WebAssembly System Interface (WASI) for server-side applications, including demos.
WASIX and WASI Preview 2: Insights into the future of WASI and the innovations like WASIX.
Wasm in the Cloud Native Landscape: Analysis of Wasm's role in cloud-native technologies, including integration with Docker and Kubernetes.
Wasm Component Model: A deep dive into the component model, including demonstrations using tools like Spin.
KV Store in Wasm: Demonstrating a key-value store example using Spin.
Running ML Models in Wasm: Covering machine learning inferencing in Wasm with practical demos.
Observability for Wasm Modules: Focusing on the importance of observability for enterprise adoption of WebAssembly.
Final Thoughts about Wasm: Summarizing our views on Wasm in 2024.
This course provides you with enough knowledge about WebAssembly to kickstart your own projects using the discussed tools and frameworks and to deploy your WebAssembly workloads.
We hope you enjoy the learning experience. We've created a handy course website and GitHub repository for easy navigation, which includes all the slides and demo files used in the course.
Please make sure to share the course within your network if you find it helpful, and don't forget to subscribe to Kubesimplify."


Tutorial: Build a Cloud Cost Monitoring System with Terraform, Ansible and Komiser
Kunal Verma — Fri, 03 Nov 2023 05:39:27 GMT
In today's cloud-centric world, businesses rely on cloud services to power their applications and infrastructure. While the scalability and flexibility of the cloud are undeniable advantages, it's crucial to keep a close eye on the costs associated with these services.
Introducing Komiser, an open-source cloud-agnostic resource manager that offers a powerful solution to address this challenge by seamlessly collecting comprehensive resource data across your organization's Cloud accounts. With Komiser, you can gain deeper insights into resource consumption and expenditure across different cloud environments, empowering you to make informed decisions and optimize your cloud infrastructure efficiently, and that is what we cover in this article.
In this tutorial, we will walk through the step-by-step process of building a Cloud Cost Monitoring System using Komiser which will enable us to access and aggregate resource data from a cloud infrastructure, provisioned on AWS.
Whether you're a DevOps engineer responsible for managing AWS resources or a cloud architect looking to optimize costs across your cloud infrastructure, this tutorial will demonstrate a practical use case, illustrating how you can leverage Komiser to make informed decisions and drive cost savings in your cloud infrastructure.
Project Architecture
Let us understand the overall architecture and the key components involved in this project.
In this tutorial, we are using a simple Django todo list application that relies on a bunch of AWS services:
IAM - We created a new IAM user to grant necessary permissions to our AWS account.
EC2 Instance - The application container is hosted in an Ubuntu-based remote server.
VPC - In this demo, we are using the default VPC for our AWS region.
Elastic Load Balancer - To manage the incoming traffic to our Django app.
The entire AWS infrastructure is provisioned and managed using Terraform (an Infrastructure as Code tool) and Ansible (a configuration management tool).
And finally, we deploy and authenticate Komiser to monitor the cloud resources associated with our Django application.
Sounds interesting, right? I hope this gave you a gist of what we'll be building together and let's move on to the prerequisites section.
Prerequisites
Before you begin this tutorial, you'll need the following to get started:
Basic knowledge of AWS Cloud.
An AWS Free Tier account.
Basic knowledge of containers and Docker.
The following tools are installed and configured on your system:
Terraform
Ansible
Komiser CLI
Step 1: Initial App Configuration
There are mainly two parts to this particular section. Let us start with testing our Django application locally.
Local App Setup
I always recommend first running and testing the application locally, before any further integrations or provisioning of the actual cloud infrastructure.
Follow the detailed steps mentioned in the documentation to first clone and quickly test the Django Todo list application.
Containerising our App
To easily run our Django application on the remote EC2 instance, well be containerizing the application.
Create a new Dockerfile and use the following code to create a new docker image:
# pull the official base imageFROM python:3.8.3-alpine# set work directoryWORKDIR /app# set environment variablesENV PYTHONDONTWRITEBYTECODE 1ENV PYTHONUNBUFFERED 1# install dependenciesRUN pip install --upgrade pip COPY ./requirements.txt /appRUN pip install -r requirements.txt# copy projectCOPY . /app# expose port 8000EXPOSE 8000
In this case, we are also using docker-compose to further simplify the process of running our container. Create a docker-compose.yaml file with the following code snippet:
version: '3'services:   web:       build: .       command: python manage.py runserver 0.0.0.0:8000       ports:           - 8000:8000
Essentially, this will help us in spinning up a container, running and being exposed at port 8000, using just a single command of:
docker-compose up -d
Interestingly, well be further automating the process of container creation using Ansible playbooks in a separate section ahead!
Congratulations on completing the initial app configuration 🎉
Let us move ahead with provisioning our cloud infrastructure, using Terraform!
Step 2: Cloud Infrastructure Configuration
In this particular section, well first be provisioning our cloud infrastructure on AWS using Terraform and then automating the deployment process for our Django application container on the remote EC2 instance, using the Ansible playbook.
Let us first provision our AWS infrastructure and get that up and running!
Infrastructure Provisioning Using Terraform
As mentioned previously, for this particular project we have the following AWS services that need to be provisioned:
IAM
EC2 Instance
VPC (this is not newly created per se. We'll be using the default VPC for our AWS region)
Elastic Load Balancer
Let us see their configurations one by one, starting with creating an IAM user.
1. Creating an IAM user
It is always recommended to create a new IAM user associated with your AWS account and attach granular permissions according to the use case.
Use the following code snippet to define a new IAM user named komiser-aws-user:
resource "aws_iam_user" "komiser_iam" {  name = "komiser-aws-user"  tags = {    Name = "komiser-django-app"  }}# resource for UI loginresource "aws_iam_user_login_profile" "komiser_iam_login" { user    = aws_iam_user.komiser_iam.name}# for access key & secret access key:resource "aws_iam_access_key" "komiser_iam" {  user    = aws_iam_user.komiser_iam.name}# Output the IAM user access id, secret id and password:output "id"{  value = aws_iam_access_key.komiser_iam.id}output "secret"{  value = aws_iam_access_key.komiser_iam.secret  sensitive = true}output "iam_password" { value = aws_iam_user_login_profile.komiser_iam_login.password sensitive = true}
Explanation:
aws_iam_user - To create a new IAM user named: komiser-aws-user.
aws_iam_user_login_profile - To enable AWS Management console login for the new user.
aws_iam_access_key - To create access and secret access keys for the new user.
After the user has been created, there are three output values defined:
Access ID
Secret Access ID
Login password for AWS Management console
The second part of creating an IAM user is attaching an appropriate policy for granting it the necessary permissions to access AWS resources.
Use the following policy.json file that defines the permissions well give to our new IAM user:
{    "Version": "2012-10-17",    "Statement": [        {            "Sid": "1",            "Effect": "Allow",            "Action": [                "ec2:*",                "s3:*",                "iam:*",                "elasticloadbalancing:*",                "route53:*",                "tag:Get*",                "pricing:*"            ],            "Resource": "*"        }    ]}
You can certainly define more granular permissions here, within each resource group. Refer to the Komiser policy to learn more.
Let us create a new IAM policy using the definition above and attach that to the user:
resource "aws_iam_policy" "komiser_policy" {   name        = "komiser_iam_policy"  description = "This is the policy for komiser user"  policy = file("policy.json") tags = {    Name = "komiser-django-app"  }}# Policy Attachment with the user:resource "aws_iam_user_policy_attachment" "komiser_policy_attachment" { user       = aws_iam_user.komiser_iam.name policy_arn = aws_iam_policy.komiser_policy.arn}
Explanation:
aws_iam_policy - creates a new IAM policy named komiser_iam_policy.
aws_iam_user_policy_attachment - attaches the komiser_iam_policy with our IAM user i.e. komiser-aws-user.
2. Creating an EC2 instance
As we are provisioning our infrastructure using Terraform, there are a few different parts we need to define to successfully provision an EC2 instance.
Let us have a look at each of them, in detail.
Defining the Terraform EC2 resource
Use the following code to define a new Ubuntu EC2 instance of type t2.micro:
# EC2 instance resource:resource "aws_instance" "komiser_instance" {  ami           = "ami-053b0d53c279acc90"  instance_type = "t2.micro"  key_name      = aws_key_pair.ssh_key.key_name  vpc_security_group_ids = [aws_security_group.allow_tls_1.id]  depends_on = [aws_security_group.allow_tls_1]  user_data  = "${file("install.sh")}"  tags = {    Name = "komiser-django-app"  }}
Note:
The AMI ID specified above is specific to the AWS region chosen. In my case, the default region is us-east-1.
Be sure to change the AMI ID value according to the region you choose to provision the resources in!
To install the necessary dependencies on our remote instance after being provisioned, we are using Terraforms user_data type to attach the bash script given below:
#!/bin/bash# Install docker:sudo apt updatesudo apt install -y apt-transport-https ca-certificates curl software-properties-commoncurl -fsSL  | sudo gpg --dearmor -o /usr/share/keyrings/docker-archive-keyring.gpgecho "deb [arch=$(dpkg --print-architecture) signed-by=/usr/share/keyrings/docker-archive-keyring.gpg]  $(lsb_release -cs) stable" | sudo tee /etc/apt/sources.list.d/docker.list > /dev/nullsudo apt updateapt-cache policy docker-cesudo apt install -y docker-ce# Install docker compose:sudo mkdir -p ~/.docker/cli-plugins/sudo curl -SL  -o ~/.docker/cli-plugins/docker-composesudo chmod +x ~/.docker/cli-plugins/docker-compose# Clone the Git repo:git clone https://github.com/kubesimplify/cloudnative-lab.git
Essentially, this particular script will install the following tools on our Ubuntu instance:
Docker Engine
Docker Compose
Clone the GitHub repo, to access the Dockerfile (created above)
You may replace the GitHub repository URL with your projects repository URL here!
Defining the security group for our Instance
 There are mainly two things we need to define in our security group:
Ingress - Allowing incoming traffic at ports:
22 - to enable remote access using SSH
8000 - to expose our Django application
Egress - Allowing external traffic from anywhere on the internet
    Use the code below to create a new security group, associated with our EC2 instance:
    resource "aws_security_group" "komiser_sg" {      name        = "komiser_sg"      description = "Security Group for Komiser Instance"      vpc_id      = "VPC_ID"      ingress {            description = "For ssh"            from_port   = 22            to_port     = 22            protocol    = "tcp"            cidr_blocks = ["0.0.0.0/0"]      }      ingress {            description = "For Django app"            from_port   = 8000            to_port     = 8000            protocol    = "tcp"            cidr_blocks = ["0.0.0.0/0"]      }      egress {        from_port   = 0        to_port     = 0        protocol    = "-1"        cidr_blocks = ["0.0.0.0/0"]      }      lifecycle {        create_before_destroy = true      }      tags = {        Name = "komiser-django-app"      }    }
Creating a new SSH key pair
 Use the following code to create a new SSH key pair in AWS called komiser_ssh_key, that we can use to securely connect with our remote instance:
 resource "aws_key_pair" "ssh_key" {   key_name   = "komiser_ssh_key"   public_key = file("~/.ssh/komiser-aws.pub") # location of public SSH key   tags = {     Name = "komiser-django-app"   } }
Note:
You will need to create a new SSH key pair for this purpose. Use the following command to generate a new SSH key pair on your local machine:
ssh-keygen -t rsa -b 4096
An example SSH key pair creation process has been shown below:
Creating an Elastic IP for our Instance
 By default, the IP address assigned to an EC2 instance changes on reboot and this may sometimes complicate things. To avoid this, I recommend creating an Elastic IP address (which remains constant) and associating that with the instance.
 Use the following Terraform resource types to create and associate an Elastic IP with our instance:
 # Elastic IP resource resource "aws_eip" "koimser_instance_ip" {   instance = aws_instance.komiser_instance.id   depends_on = [aws_instance.komiser_instance]   tags = {     Name = "komiser-django-app"   } } # Elastic IP association: resource "aws_eip_association" "eip_association" {   instance_id   = "${aws_instance.komiser_instance.id}"   allocation_id = "${aws_eip.koimser_instance_ip.id}" } # Output the instance IP: output "ec2_ip" {   value = aws_eip.koimser_instance_ip.public_ip }
Overall, the entire configuration for provisioning our EC2 instance would look like this:
# EC2 instance resource:resource "aws_instance" "komiser_instance" {  ami           = "AMI_ID"  instance_type = "t2.micro"  key_name      = aws_key_pair.ssh_key.key_name  vpc_security_group_ids = [aws_security_group.allow_tls_1.id]  depends_on = [aws_security_group.allow_tls_1]  user_data  = "${file("install.sh")}"  tags = {    Name = "komiser-django-app"  }}# SSH key pairresource "aws_key_pair" "ssh_key" {  key_name   = "komiser_ssh_key"  public_key = file("~/.ssh/komiser-aws.pub")  tags = {    Name = "komiser-django-app"  }}# Security group resource:resource "aws_security_group" "allow_tls_1" {  name        = "allow_tls_1"  description = "Allow TLS inbound traffic"  vpc_id      = "vpc-0c09e12657a2cf8fc"  ingress {        description = "For ssh"        from_port   = 22        to_port     = 22        protocol    = "tcp"        cidr_blocks = ["0.0.0.0/0"]  }  ingress {        description = "For Django app"        from_port   = 8000        to_port     = 8000        protocol    = "tcp"        cidr_blocks = ["0.0.0.0/0"]  }  egress {    from_port   = 0    to_port     = 0    protocol    = "-1"    cidr_blocks = ["0.0.0.0/0"]  }  lifecycle {    create_before_destroy = true  }  tags = {    Name = "komiser-django-app"  }}# Elastic IP resourceresource "aws_eip" "koimser_instance_ip" {  instance = aws_instance.komiser_instance.id  depends_on = [aws_instance.komiser_instance]  tags = {    Name = "komiser-django-app"  }}# Elastic IP association:resource "aws_eip_association" "eip_association" {  instance_id   = "${aws_instance.komiser_instance.id}"  allocation_id = "${aws_eip.koimser_instance_ip.id}"}# Output the instance IP:output "ec2_ip" {  value = aws_eip.koimser_instance_ip.public_ip}
3. Creating an Elastic Load Balancer
For properly configuring an Elastic Load Balancer, there are mainly two parts we need to define:
Security Group for our ELB
 Use the following code to define the security group for our load balancer:
 # ELB security group: resource "aws_security_group" "komiser_elb_sg" {   name        = "komiser_elb"   description = "Komiser ELB Security Group"   ingress {     from_port = 80     to_port = 80     protocol = "tcp"     cidr_blocks = ["0.0.0.0/0"]   }   egress {     from_port        = 0     to_port          = 0     protocol         = "-1"     cidr_blocks      = ["0.0.0.0/0"]     ipv6_cidr_blocks = ["::/0"]   }   tags = {     Name = "komiser-django-app"   } }
 Explanation:
Ingress Rules: This security group allows incoming traffic on port 80 via the TCP protocol from any IP address (0.0.0.0/0), essentially enabling access to the port 80.
Egress Rules: Egress rules allow all outbound traffic (0.0.0.0/0 for IPv4 and ::/0 for IPv6).
Terraform ELB resource
 Use the following code to create a new Elastic Load Balancer:
 # Create a new Elastic load balancer: resource "aws_elb" "komiser_elb" {   name               = "komiser-elb"   availability_zones = ["us-east-1a", "us-east-1b", "us-east-1c", "us-east-1d"]   security_groups = [aws_security_group.komiser_elb_sg.id]   instances = [aws_instance.komiser_instance.id]   access_logs {     bucket        = "komiser-elb-logs"     interval      = 5   }   listener {     instance_port     = 8000     instance_protocol = "http"     lb_port           = 80     lb_protocol       = "http"   }   health_check {     healthy_threshold   = 2     unhealthy_threshold = 2     timeout             = 3     target              = "TCP:8000"     interval            = 30   }   cross_zone_load_balancing   = true   idle_timeout                = 400   connection_draining         = true   connection_draining_timeout = 400   tags = {     Name = "komiser-django-app"   } } # Output the ELB Domain name: output "komiser_elb_dns" {   value = aws_elb.komiser_elb.dns_name   depends_on = [aws_elb.komiser_elb] }
 Explanation:
instances - specifying the EC2 instance ID to which this ELB will be associated with.
access_logs - storing the ELB logs in an S3 bucket named as komiser-elb-logs (this bucket has to be first created separately on AWS)
listener - defines a listener that maps incoming requests on port 80 (configured in our ELB security group) to the Django application container, which will be running on port 8000 in our EC2 instance.
Before applying the above changes and provisioning the infrastructure, make sure to define the terraform AWS provider and specify the correct AWS profile to use:
terraform {  required_providers {    aws = {      source = "hashicorp/aws"      version = "5.8.0"    }  }}provider "aws" {  region = "us-east-1"  profile = "Komiser-User"}
Finally, you can now use the following commands to provision the entire infrastructure on AWS:
terraform initterrafor apply
Congratulations 🎉 we have successfully provisioned our cloud resources on AWS using Terraform!
Step 3: Deploying our App Using Ansible Playbook
Giving a little bit of background, the Ansible playbook is used for automating tasks and managing server configurations, making it easier to maintain and deploy applications on multiple remote servers.
And thats exactly our use case today!
In this particular section, well be using an Ansible playbook to first connect to our remote EC2 instance and then automate the process of deploying our Django Todo application container.
Talking about configuring the application deployment process using Ansible, there are mainly two parts we need to cater to here.
Let us discuss both of them in detail.
1. Building the Ansible Inventory
Essentially, an Ansible inventory file is a simple list of hostnames or IP addresses that Ansible uses to manage and execute tasks on the remote server. In this case, our target remote server is the EC2 instance we provisioned earlier.
Create a new inventory.yaml file in your working directory and use the following configuration:
virtualmachines:  hosts:    vm01:      ansible_host: INSTANCE_IP_ADDRESS       ansible_ssh_user: ubuntu      ansible_ssh_private_key_file: "PRIVATE_SSH_KEY"
Explanation:
virtualmachines - represents a group of virtual machines to be managed by Ansible.
hosts - lists down the information for individual hosts (remote servers) within a group.
Here, we are representing our remote EC2 instance with vm01 and providing the following configuration:
  vm01:      ansible_host: INSTANCE_IP_ADDRESS       ansible_ssh_user: ubuntu      ansible_ssh_private_key_file: "PRIVATE_SSH_KEY"
Make sure to replace INSTANCE_IP_ADDRESS with the Elastic IP address of the EC2 instance and PRIVATE_SSH_KEY with the location of the private SSH key file in your local system.
To test the connection, use the following command to ping the EC2 instance:
ansible virtualmachines -m ping -i inventory.yaml
If the connection is successful, youll receive the following output:
2. Defining the Ansible Playbook
Now that we successfully established a secure connection between Ansible and our EC2 instance, let us define the tasks we need Ansible to perform on our remote server.
Create a new playbook.yaml file and use the following code to create a playbook:
- name: AWS <> Komiser Playbook  hosts: vm01  tasks:  - name: Check if Docker is running    ansible.builtin.systemd:      name: docker.service      state: started      enabled: true  - name: Run Docker Compose    ansible.builtin.command:     args:      # change the current dir      chdir: /cloudnative-lab/projects/ep-cloud-cost-monitoring/project_files      # run docker compose      cmd: sudo docker compose -f docker-compose.yml up -d
Explanation:
hosts: vm01 - the name of the target host server where the tasks will be executed (defined above).
There are two main tasks defined to be executed on our instance:
Checking if the Docker engine is running on the EC2 instance.
 Here, we are using Ansibles built-in systemd module - ansible.builtin.systemd which will essentially execute the following command in the background to check the status of the docker engine:
 systemctl status docker.service
Running Docker Compose to start the application container.
 Here, we are executing some shell commands using the built-in ansible.builtin.command module:
Changing the current directory to where the Dockerfile and docker-compose.yaml are located.
Executing the following command to start our Django app container:
  sudo docker compose -f docker-compose.yml up -d
Now, you can use the following command to execute the playbook:
ansible-playbook -i inventory.yaml playbook.yaml
As the tasks are being executed, you can view the terminal output which may look something like the below:
If everything goes well as planned, our application has been deployed on our EC2 instance and youll be able to access the web browser using either of these two methods:
http://INSTANCE_IP:8000
Elastic Load Balancer Domain (which we already provisioned above)
Note:
In case you face any issues/errors in this step, you can access the application container logs, by connecting to the EC2 instance using ssh:
ssh -i private_ssh_file ubuntu@IP_ADDRESS
Congratulations 🎉 you have successfully deployed the Django application on the remote server, using Ansible!
A Spotlight on Komiser
If we take a look at the definition from the official documentation:
Komiser is an open-source cloud-agnostic resource manager, that integrates with multiple cloud providers (including AWS, Azure, GCP, Civo, Digital Ocean, Kubernetes, OCI, Linode, Tencent and Scaleway), builds a cloud asset inventory, and helps you break down your cost at the resource level.
In simple terms, it's a tool that keeps an eagle eye on your cloud resources, helping you understand and optimize your overall cloud costs.
Highlighting some Key Features
Komiser brings a set of powerful features to the table:
Multi-Cloud Support: It works seamlessly with various cloud providers like AWS, Azure, Google Cloud and more. You can manage your resources across different clouds from a single place.
Comprehensive Resource Data: Komiser collects detailed information about your cloud resources. You can track everything from virtual machines and databases to storage and networking.
Cost Transparency: It provides deep insights into resource consumption and expenditure. You'll know exactly where your money is going and where you can save.
Customizable Dashboards: One of my favorites is the fact that Komiser offers a customizable dashboard, so you can tailor the information you want to see. It's like having your own cloud financial control center!
How Komiser is Solving the Cloud Cost Problem?
Here's the magic of using a tool like Komiser!
Komiser will help you monitor and manage your cloud resources efficiently. Whether you're running a small application (such as a Django ToDo list in our case) or a complex infrastructure, Komiser will simplify the process of controlling your cloud spending.
It's not just about cost-cutting, but about being cost-intelligent. By understanding your cloud costs better, you can make smarter decisions to maximize your cloud investment!
Now that we have an idea of what is Komiser, let us see how we can implement this to monitor our cloud resources, provisioned on AWS.
Step 4: Configuring Komiser
To use Komiser with our provisioned cloud infrastructure on AWS, we first need to authenticate our AWS account with Komiser.
For this purpose, Komiser uses a config.toml file where well provide the necessary cloud provider account configuration, in this case for AWS.
Create a new config.toml file in your working directory and use the following code snippet:
[[aws]]name="Django-Komiser Project"source="CREDENTIALS_FILE"path="./path/to/credentials/file"profile="Admin-User"[sqlite]file = "komiser.db"
Explanation:
name - a custom name we wish to give for the account
source - defines the type of authentication method we wish to choose. There are mainly two methods to feed cloud provider credentials to Komiser:
Using environment variables:
 source="ENVIRONMENT_VARIABLES"
Using a credentials file:
 source="CREDENTIALS_FILE"
    Here, we are using a credentials file.
path - specifying the path to the AWS credentials file.
profile - specifying the AWS account profile to use with Komiser.
For persisting the AWS account data, we are using a simple SQLite file called komiser.db, which is one of the two methods to persist data in Komiser.
Note:
It is not recommended to add your AWS Access and Secret Access keys in the credentials file when working in a production environment. The most secure way of authentication is by using temporary credentials through IAM roles.
To learn more, refer to the documentation.
Now that we have configured our AWS account credential with Komiser, we can start the local Komiser instance using the following command:
komiser start --config config.toml
As we execute this, Komiser engine will start generating the following output continuously:
You'll now be able to access the Komiser dashboard at http://localhost:3000
Congratulations 🎉 you have successfully configured and authenticated Komiser to monitor the resources in your AWS account!
Step 5: Fine-Tuning Resource Monitoring with Tags
You can now have a detailed view of all the active AWS resources in your account by heading over to the Inventory section, as shown below:
This is where things get a little bit more interesting!
While provisioning the cloud infrastructure using Terraform, we tagged all the resources associated with our Django application as shown below:
tags = {    Name = "komiser-django-app"  }
Now, we can filter out the specific cloud resources/services associated with our Django application using this tag name in Komiser 😮
Go ahead and add a new filter in the Inventory section using the following configuration:
Key name - Name
Key value - komiser-django-app
When applied, this will filter out and display only the cloud resources associated with our Django application, as shown below:
So, now we know the exact number of cloud resources our Django application depends upon which is 11 in this case and the cloud costs for each resource are being constantly monitored by Komiser!
With this, we have successfully built a Cloud Cost Monitoring system for our simple Django ToDo list application using Komiser. Congratulations on making it this far 🎉
Conclusion
In this article, we embarked on a journey to explore the realm of cloud cost management with Komiser.
We began by laying the foundation, understanding project architecture, and covering essential prerequisites. Configuring our simple Django ToDo list application from testing to containerization, we dove into provisioning our cloud infrastructure using Terraform and then, remotely deploying our application with the Ansible playbook.
As we shed some light on Komiser, we learned about its unique features and how it effectively addresses the cloud cost challenges. At last, we configured Komiser to continuously monitor our AWS resources, associated with our application.
This tutorial equips you with the knowledge and tools to make informed decisions while driving cost savings in your cloud infrastructure.
Remember, Komiser is your trusted companion in maintaining cost efficiency across your cloud environment. The possibilities are endless, and the savings are real  thanks to the power of Komiser.
Additional Resources
Here are a list of some additional resources for your reference:
Source Code for the project.
Monitoring a Next.js Application with Komiser
How to practice FinOps with Komiser
Follow Kubesimplify on Hashnode, Twitter and LinkedIn. Join our Discord server to learn with us.


Get Ready for Wasm Day at Kubecon NA 2023!
Chad M. Crowell — Sat, 28 Oct 2023 12:30:09 GMT
We're all gearing up for Wasm Day at Kubecon NA in Chicago, IL on November 6th, 2023! Wasm Day is a full day dedicated to all things WebAssembly, where like-minded wasm wizards gather to discuss, explore, and push the boundaries of this amazing technology. Whether you're a seasoned wasm expert or just dipping your toes into the wasm waters, this article aims to prepare and excite you for Wasm Day!
Not only will there be leading experts and enthusiasts from around the globe to share knowledge with, but also you'll be able to try out the latest and greatest wasm demos, build your own Wasm projects, and learn how to use the new Web Assembly Studio tooling. Check out the full Wasm Day schedule here.
https://events.linuxfoundation.org/kubecon-cloudnativecon-north-america/co-located-events/cloud-native-wasm-day/
 
What is Wasm?
This versatile technology is revolutionizing the way we write and run code, offering lightning-fast performance, cross-language compatibility, and so much more! At its core, wasm is a low-level bytecode format that serves as a universal binary instruction set for web browsers and cloud-native web apps. In simple terms, it's a way to write high-performance code in programming languages like C++, Rust, or even Python, and run it directly in the browser or runtime environment at near-native speeds. This means that you can write complex applications with low overhead, fast load times, and high performance.
Wasm is a compact binary instruction format for compiling code to a portable target. This compiled binary, called a "module", can run in any place that includes a Wasm runtime. By design, these modules run in a sandboxed environment. They only have access to its memory and the resources granted by the runtime.
In the context of Kubernetes, Wasm is particularly useful for running serverless applications. Serverless applications are event-driven and automatically scale in response to demand. Traditionally, serverless applications have been written using runtime-specific frameworks or languages, which limit portability and require developers to learn multiple programming environments. Wasm solves this problem by providing a platform-agnostic runtime environment for serverless applications, enabling developers to write code in any language and execute it within a Wasm runtime. Before you come to Wasm day, I would check out this article that provides more detail on how Kubernetes and Wasm are better together.
To run Wasm applications in Kubernetes, several components are needed, so I highly encourage you to check out WebAssembly on Kubernetes: everything you need to know by Nigel Poulton.
https://nigelpoulton.com/webassembly-on-kubernetes-everything-you-need-to-know/
 
WasmCon 2023
Did you miss out on the latest WasmCon event in September? No worries, because I've got all the highlights right here for you! Developers at the event talked about the many benefits of using WebAssembly, such as improved performance, portability, and security. They also shared exciting updates on projects like Wasi, which is helping to make it easier to run WebAssembly code outside of the browser.
Christoph Voigt gave an excellent racap of the event, including his list of top talks from the event. Check it out here.
Additionally, at WasmCon, Angel M De Miguel Meana of VMware and Justin Cormack of Docker gave a great talk called "Develop Wasm Applications with Docker". In this talk, they explore how containers and WebAssembly can work together to unlock the potential of both technologies. By watching this, you will learn how to mix and match containers and Wasm modules and the benefits of doing so. You will learn how to run your first projects using your favorite language with WebAssembly. Watch this talk here.
https://youtu.be/xPO3-TOZxW0?si=qoIcleGwhq94LeQH
 
By far, the most viewed talk at WasmCon was by Luke Wagner, Distinguished Engineer at Fastly called "What is a Component (and Why?)". In this talk, Luke defines what a component is, how it relates to other things we're familiar with, and what sorts of new powers components unlock for us in the future. Watch it here. Luke's talk also includes a live demo of a component-based web app that uses React and Redux.
https://youtu.be/tAACYA1Mwv4?si=z3ROX0uIB2wgkTno
 
Top 10 Wasm Resources
In rapid-fire fashion, I will now present the top 10 resources to help you become a wasm wizard in no time, and get hands-on and involved with the WebAssembly community:
MDN Web Docs 📚: A comprehensive guide from the Mozilla Developer Network, covering everything wasm-related. This is your go-to resource for getting started.
 %[https://developer.mozilla.org/en-US/docs/WebAssembly] 
WebAssembly Weekly 📰: Stay up to date with the latest news, tutorials, and projects surrounding wasm with this awesome newsletter.
 %[https://wasmweekly.news] 
WebAssembly.org 🏗: Find resources to contribute and collaborate with others. Find the official newsletter or join the Discord!
 webassembly.org
Rust and WebAssembly 🦀🕸: Dive into the world of wasm with Rust, a modern systems programming language known for its low-level control and safety features. This book by Mozilla will have you feeling like a wasm pro in no time.
 The Rust and WebAssembly Book
A WebAssembly Collection 🚂: Resources from the Chrome team, to help you on your WebAssembly journey, including WebAssembly in practice and case studies!
 WebAssembly Resources from the Chrome Team
WebAssembly by Example : Learn by doing! This collection of hands-on projects, complete with code samples, will help you master the art of wasm through practical application.
 Wasm By Example
WebAssembly in Action 🎬: This book is like a blockbuster movie for developers, blending theory with real-world examples to deepen your understanding of wasm.
 WebAssembly in Action Book
WebAssembly Awesome List 📜: A curated list of awesome resources, libraries, and frameworks for wasm. If you're looking for inspiration or specific tools, this is the place to be.
 Awesome Wasm
Made by WebAssembly 🥳: Want to see what projects are made with wasm? Explore a collection of wasm-powered projects on this website and marvel at the possibilities!
 Made with WebAssembly
WebAssembly Design : contains documents describing the design and a high-level overview of WebAssembly. The documents and discussions in this repository are part of the WebAssembly Community Group.
https://github.com/WebAssembly/design
 
Bonus
Here are some bonus resources, that will help you understand Wasm use cases that are for cloud-native applications, versus in the browser (and knowing the difference).
Docker & Wasm - The Powerful Combo
Here's a brief introduction to WASM and a discussion about the latest Docker and WASM integration with the example from the docs on how you can run WASM modules using the Docker desktop:
https://youtu.be/9JVV2qrp080?si=wZyB-pK3DN_dvwzd
 
Let's Learn Cloud Native WebAssembly
Get to know the Suborbital projects, choose a WASM runtime, and view a demo of the "batteries included" framework for building web servers using WebAssembly:
https://www.youtube.com/live/4KP5_fXlqDE?si=ktIFg2CQ4LTGTSnd
 
Summary
Now that we know what the Wasm use cases are, have a recap of WasmCon, and have the top 10 list of resources to get involved and deep dive into learning, I know that you'll be prepared for Wasm Day at KubeCon 2023 NA in Chicago. I can't wait to see you there!
Follow Kubesimplify on Hashnode, Twitter, and LinkedIn. Join our Discord server to learn with us.


The Complete Guide to the dd Command in Linux
sysxplore — Wed, 02 Aug 2023 05:58:28 GMT
Introduction
The dd command is an extremely powerful Linux utility. It is commonly referred to as the "disk destroyer", data definition, disk dump, or "disk duplicator" due to its immense power and ability to directly interact with block devices. In this beginner's guide, we will explore the dd command, its syntax, and various use cases, highlighting its role in file copying, disk partition backup, and restoration, and creating bootable USB drives.
dd Command Syntax
The syntax of the dd command is simple. It reads from standard input and writes to standard output by default. Here is the basic syntax of the dd command:
$ dd [OPTION]...
It's worth noting that dd deviates from the standard convention of using the -- or - syntax for options, distinguishing it from most Linux shell commands.
dd command common options
The dd command accepts several options to customize its behavior and achieve specific tasks. Here are some of the most commonly used options:
Option Description
if Specifies the input file (source).
of Specifies the output file (destination).
bs Defines the block size to read from the input file and write to the output file.
count Specifies the number of blocks to copy.
skip Skips a specific number of blocks or bytes while reading the input file.
seek Skips a specific number of blocks or bytes while writing to the output file.
status Shows the progress of the dd command.
conv Specifies conversion options for the input or output file.
13 Practical examples of the dd command
Now, let's explore some practical examples of using the dd command in various scenarios.
How to copy files in Linux
To make a simple copy of a file, you can use the dd command with the if and of options. For example, to copy a file named source.txt to a new file named destination.txt, run the following command:
$ dd if=source.txt of=destination.txt
This command reads the contents of source.txt and writes them to destination.txt.
Prevent overwriting destination file
When using the dd command, if there is already a file with the same name at the destination, it will be replaced by default. This means that the existing file will be overwritten. Use the conv=notrunc option to prevent overwriting an existing file while using the dd command. This option ensures that the destination file is not truncated during the write process. For example:
$ dd if=source.txt of=destination.txt conv=notrunc
Appending Data to a File
In addition to avoiding overwriting, you can also append data to an existing file by using the conv=append option. Let's consider an example where we want to append the contents of users.txt to newusers.txt:
$ dd if=users.txt of=newusers.txt conv=append
With this command, the dd command reads the data from users.txt and appends it to the end of newusers.txt.
Converting text case
The dd command can also be used to perform text conversions. For instance, to convert all the text in a file from lowercase to uppercase, use the conv option with the lcase and ucase conversion parameters. Consider the following example:
$ dd if=lowercase.txt of=uppercase.txt conv=ucase
This command reads the contents of lowercase.txt, converts all lowercase characters to uppercase, and saves the result in uppercase.txt.
Similarly, you can convert text from uppercase to lowercase using the conv option with the ucase and lcase conversion parameters. Here's an example:
$ dd if=uppercase.txt of=lowercase.txt conv=lcase
Creating a Backup of a Linux Disk Partition
One of the powerful use cases of the dd command is creating backups of disk partitions. This can be particularly useful for system administrators or users who want to preserve the state of their disk partitions. To back up a disk partition, you need to identify the block device associated with the partition, usually represented by a device file in the /dev directory.
For example, to back up the first partition of the disk located at /dev/sda, you would use the following command:
$ dd if=/dev/sda1 of=partition_backup.img
This command reads the content of /dev/sda1, the first partition of the disk, and saves it to a file named partition_backup.img.
Once you have a backup of a disk partition, you can use the dd command to restore it when needed. To restore a disk partition, you would reverse the input and output files in the command. Here's an example:
$ dd if=partition_backup.img of=/dev/sda1
This command reads the content from the partition_backup.img file and writes it to the /dev/sda1 partition, effectively restoring the partition to its previous state.
Creating a Backup of the Entire Linux Hard Drive
In addition to backing up individual partitions, you can use the dd command to create a backup of the entire Linux hard drive. This allows you to capture the complete state of the disk, including all partitions, boot records, and file systems.
To back up the entire hard drive, you would specify the hard drive's block device as the input file. For instance, to back up the hard drive located at /dev/sda, run the following command:
$ dd if=/dev/sda of=hard_drive_backup.img
This command reads the entire content of /dev/sda and saves it to a file named hard_drive_backup.img.
Similarly, you can use the dd command to restore a previously created backup of the entire Linux hard drive. This can be extremely useful in situations where you need to recover the whole system from a backup.
To restore the hard drive, you would reverse the input and output files in the command. For example:
$ dd if=hard_drive_backup.img of=/dev/sda
This command reads the content from the hard_drive_backup.img file and writes it back to the /dev/sda hard drive, effectively restoring the entire system.
Creating a Backup of the Master Boot Record
The Master Boot Record (MBR) is a crucial component of a disk that contains the boot loader and partition table. By using the dd command, you can create a backup of the MBR, ensuring that you can recover the boot sector if it gets corrupted or overwritten.
To back up the MBR, you can use the dd command to copy the first 512 bytes of the disk. Here's an example:
$ dd if=/dev/sda of=mbr_backup.img bs=512 count=1
In this command, if=/dev/sda specifies the disk from which to read the MBR, of=mbr_backup.img specifies the output file to save the backup, bs=512 sets the block size to 512 bytes (the size of the MBR), and count=1 specifies that only one block should be copied.
If you need to restore the MBR from a backup, you can use the dd command to write the contents of the backup file back to the disk. Here's an example:
$ dd if=mbr_backup.img of=/dev/sda bs=512 count=1
This command reads the content from the mbr_backup.img file and writes it back to the /dev/sda disk, effectively restoring the MBR.
Copying Content from a CD/DVD Drive
The dd command can also be used to create a bit-by-bit copy of a CD or DVD. This can be useful when you want to create an exact replica of the disc, including its file system and bootable properties.
To copy the contents of a CD/DVD drive, you would specify the CD/DVD drive as the input file (if) and an output file to save the copy. Here's an example:
$ dd if=/dev/cdrom of=disk_copy.iso
In this command, /dev/cdrom represents the CD/DVD drive, and disk_copy.iso is the output file where the copied data will be saved.
Compressing Data Read by dd
As mentioned earlier, one common use of the dd command is disk cloning. By copying block devices byte by byte, dd creates an exact replica of a disk. When cloning a disk to a file, we can enhance the result and reduce the file size by piping the data read by dd through compression utilities like gzip. For example, to create a clone of the entire /dev/sda block device, we can execute the following command:
$ sudo dd if=/dev/sda bs=1M | gzip -c -9 > sda.dd.gz
In this example, we specify that dd should read from the /dev/sda device and adjust the block size to 1M for improved performance. We then pipe the data to the gzip program, utilizing the -c option to output to stdout and the -9 option for maximum compression. Finally, we redirect the output to the "sda.dd.gz" file.
Skipping Bytes or Characters When Reading the Input File
The dd command provides the skip option, which allows you to skip a specific number of bytes or characters while reading the input file. This can be useful when you need to exclude certain parts of the file. Here's an example:
$ dd if=user.txt of=newusers.txt skip=100
In this command, the dd command skips the first 100 bytes of data in users.txt and writes the remaining content to newusers.txt.
Wiping a Block Device
Another valuable use case for dd is wiping a device. There are various situations where such an operation becomes necessary, such as preparing a disk for sale to ensure the previous data has been completely erased for privacy reasons or wiping data before setting up encryption. In the former case, overwriting the disk with zeros is sufficient:
$ sudo dd if=/dev/zero bs=1M of=/dev/sda
With this command, dd reads from the /dev/zero device, which provides null characters, and writes them to the target device until it is completely filled.
When setting up an encryption layer on our system, it is advisable to fill the disk with random data instead. This step renders the sectors that will contain data indistinguishable from the empty ones, thus preventing potential metadata leaks. In this scenario, we can read data from either the /dev/random or /dev/urandom devices:
$ sudo dd if=/dev/urandom bs=1M of=/dev/sda
Both commands will require a significant amount of time to complete, depending on the size and type of the block device and the source of random data used. It's worth noting that /dev/random is slower as it blocks until it gathers sufficient environmental noise, but it produces higher-quality random data compared to /dev/urandom.
Creating a Bootable USB Drive
The dd command is widely used for creating bootable USB drives from ISO images. This is particularly useful when installing or booting operating systems or live distributions from a USB device.
To create a bootable USB drive, you would specify the ISO file as the input file (if) and the USB drive as the output file (of). Here's an example:
$ dd if=linux_distro.iso of=/dev/sdX bs=4M status=progress
In this command, linux_distro.iso represents the ISO image of the Linux distribution, /dev/sdX is the USB drive (replace X with the appropriate drive letter), bs=4M sets the block size to 4 megabytes for faster copying, and status=progress displays the progress of the dd command.
Displaying the Progress Bar
By using the status=progress option with the dd command, you can display a progress bar that indicates the completion percentage of the ongoing operation. This can be helpful, especially when dealing with large files or lengthy processes.
For example, to copy a file and show the progress bar, use the following command:
$ dd if=source_file of=destination_file status=progress
This command reads the content from the source_file and writes it to the destination_file, while displaying a progress bar.
Conclusion
In this tutorial, we learned how to use the dd command. We also covered some practical use cases, such as creating backups and bootable USB sticks, Because dd is a very powerful utility, it must be used with extreme caution: simply switching the input and output targets can, in some cases, destroy data on a disk.
Remember to refer to the official documentation and additional resources to further expand your knowledge and explore advanced usage scenarios.
Be sure to follow us on Twitter and Instagram.
Follow Kubesimplify on Hashnode, Twitter, and LinkedIn. Join our Discord server to learn with us.


Simplified Introduction To Bacalhau
Saloni Narang — Sun, 09 Jul 2023 19:27:17 GMT
Bacalhau is an open-source project wherein the existing workflows can be streamlined without rewriting by running Docker containers and Web Assembly (WASM) images. This architecture which is referred to as Compute Over Data (CoD) enables users to run compute jobs where the data is generated and stored. Bacalhau platform makes computation secure, fast, and cost-efficient.
The interesting fact is the name Bacalhau is derived from the Portuguese word for salted codfish which is a common dish in Portugal.
Features of Bacalhau:
Fast job processing: As Bacalhau supports Docker and WASM, jobs can be easily executed without complex configurations and making any changes to the base code. This helps in orchestrating and distributing workloads easily.
Fault Tolerance: If the node fails, Bacalhau automatically finds another node to run the job successfully. It ensures the job is completed even if there are network disruptions.
Low cost: Bacalhau utilizes idle computing capacity
Secure: Here instead of bringing data over compute, you bring compute over data. This can reduce the leaking of data and more granular permissions to our data.
Large-scale data: Bacalhau can run anywhere from low-power edge devices to the largest VMs. You can batch-process petabytes of data.
More info on the project can be found here: https://github.com/bacalhau-project/bacalhau#why-bacalhau
Architecture
Bacalhau allows decentralized communication as it is a peer-to-peer network of nodes.
Each node in the network has two components: a requester and compute component.
Requester Component:
The requester node plays a very crucial role in the Bacalhau network as it serves as the main point of contact for the users. It majorly handles requests from users using JSON over HTTP. When a job is submitted to a requester node,
The requester component takes the input of a request, now this request comes from the user via the CLI. Once the request for executing the jobs comes in, the requester node is responsible for broadcasting the job to be executed over the network where all the compute nodes are connected. It is also responsible for effective communication between the nodes on the network.
How this works is that once the job is broadcasted over the network, compute nodes will accept or reject that request. Now, there is only a single requester node for a particular job.
The accepted compute nodes will execute the job and produce a verification proposal, then these proposals will be accepted or rejected, and then the compute nodes will publish the raw results.
Compute Component:
Compute Node is responsible for executing the jobs and producing the results. Once the bid is made by compute node and accepted by the requester node, the compute node runs the job using executors, each of which has its collection of storage providers. Once the executor executes the job, compute node produces a verification proposal. These proposals will either be rejected or accepted, after which compute node produces its raw results via the publisher interface.
Interface:
The interface handles the distribution, execution, storage, verification, and publishing of jobs. The following are the interfaces:
a. Transport: The transport interface uses a protocol called bprotocol to distribute job messages efficiently to other nodes on the network. It is responsible for sending messages about the jobs that are created and executed to other compute nodes. It ensures that the messages are delivered to the correct node without causing network congestion.
b. Executor: The executor is mainly responsible for two actions that are running the job and the other one is to present the storage volumes in a format that is suitable for the executor.
When the job is completed, the executor will merge the stdout, stderr, and named output volumes into a results folder. This results folder is used to generate a verification proposal that is sent to the requester.
c. Storage Provider: There are multiple execution platforms like Docker and WASM and then the executor will select the appropriate storage provider based on this implementation. So, there will be multiple storage providers present in the network.
d. Verifier: It checks the results produced by the executor against results produced by other nodes and transports those results back to the requester node. Both compute and requester node have verifier component. The compute node verifier produces a verification proposal based on having run the job, while the requester node verifier collates the proposals from various compute nodes.
e. Publisher: It publishes the final result of a job to a public location where users can access them. The default publisher used is Estuary and if Estuary is used as the publisher, the results will also be stored on Filecoin. The published results are stored with a unique content identifier (CID).
Installation and Demo
Now that we know what Bacalhau is, and how it works, Let's try to install the CLIand in future blogs, we will start looking into more deep dive demos.
I am using play with docker to install bacalhau.
docker pull ghcr.io/bacalhau-project/bacalhau:latest
docker run ghcr.io/bacalhau-project/bacalhau:latest
To verify and check the version of Balcalhau:
docker run -it ghcr.io/bacalhau-project/bacalhau:latest version
You can also do a curl command to download the cli binary and use it.
curl -sL https://get.bacalhau.org/install.sh | bash
Running hello world(as usual, nothing is complete without a hello world)
docker run -t ghcr.io/bacalhau-project/bacalhau:latest docker run     --id-only     --wait     ubuntu:latest --         sh -c 'uname -a && echo "Hello from Saloni Bacalhau!"'095be3fd-095f-4c55-bacf-19578dc72580
As you can see above you have the job ID that will be used to fetch the status, this is actually hitting the public network to execute the job.
Export the Job ID
export JOB_ID=095be3fd$ docker run -t ghcr.io/bacalhau-project/bacalhau:latest  list --id-filter ${JOB_ID} CREATED   ID        JOB                      STATE      VERIFIED  PUBLISHED                18:59:25  095be3fd  Docker ubuntu:latest...  Completed            ipfs://Qmex4tjNsxY8J...
so our job has successfully been completed and we need to download the results which we can do by running the bacalhau get command.
$ bacalhau get $JOB_IDFetching results of job '095be3fd'...Computed default go-libp2p Resource Manager limits based on:    - 'Swarm.ResourceMgr.MaxMemory': "17 GB"    - 'Swarm.ResourceMgr.MaxFileDescriptors': 524288Theses can be inspected with 'ipfs swarm resources'.2023/07/09 19:17:00 failed to sufficiently increase receive buffer size (was: 208 kiB, wanted: 2048 kiB, got: 416 kiB). See https://github.com/quic-go/quic-go/wiki/UDP-Receive-Buffer-Size for details.Results for job '095be3fd' have been written to.../tmp/job-095be3fd2023/07/09 19:17:01 CleanupManager.fnsMutex violation CRITICAL section took 22.293641ms 22293641 (threshold 10ms)
The output has been written to the /tmp directory. Finally, it's time to verify the results.
cat /tmp/job-095be3fd/stdout Linux fadcacb48980 5.19.0-1026-gcp #28~22.04.1-Ubuntu SMP Tue Jun 6 07:24:26 UTC 2023 x86_64 x86_64 x86_64 GNU/LinuxHello from Saloni Bacalhau!
Conclusion
In this blog, we learned about the basics of Bacalhau, what the project is all about, and how it works using the decentralized network of compute nodes to execute the jobs and publish the results. In the next part, we will go through a deep dive demo where we will try to execute the Job.
Share the blog if you learned about Bacalhau and I will post the next deep dive version soon.
Follow Kubesimplify on Hashnode, Twitter and LinkedIn. Join our Discord server to learn with us.


Microservices
Naved Ahmad — Mon, 05 Jun 2023 12:30:42 GMT
Imagine a rapidly growing application that handles numerous user requests, processes data, and provides seamless functionality to users. In the traditional monolithic architecture, the entire application is bundled into a single unit, tightly coupled and dependent on a single technology stack. As the application scales, challenges arise: coordination between teams becomes complex, scaling specific services is inefficient, and the release process becomes time-consuming.
In this blog, we'll explore the transformative power of microservices architecture and how it overcomes the limitations of the monolithic approach. By breaking down the application into smaller, self-contained microservices, each responsible for specific business functionality, we gain unprecedented flexibility, scalability, and maintainability. But that's not all  we'll delve into the challenges of microservices and the various communication methods between them. Plus, we'll discuss different approaches for managing code in microservices, from monorepo to polyrepo.
Before microservices - Monolithic Architecture
Before microservices, the standard way of developing applications was with a monolithic architecture. This means the whole code was a part of a single unit. Everything was developed, deployed and scaled as one unit. This means that the application must be written in a single language with one tech stack with a single runtime. And if you have different teams working on different parts of the application, they will need to coordinate to make sure they don't affect each other's work. Also, if developers change code for a specific functionality, then the whole application needs to be built and deployed as one package because you can't just update and deploy only one specific functionality in a monolithic architecture. So, this was the standard way of developing applications. But as applications grew in size and complexity, various challenges occurred.
Challenges of Monolithic Architecture
Coordination between teams became difficult when applications became large and complex.
You cannot scale a specific service, instead, you would need to scale the entire application, which meant higher infrastructure costs.
The release process takes longer, because for changes in any part of the application in any feature, you need to test and build the whole application to deploy those changes.
A bug in any module can bring down the entire application.
The solution to all these problems was microservices.
Microservices - Introduction
With microservices, we break down the application into essentially multiple smaller applications, so we have several small or micro applications that make up this one big application.
Here, we face some challenges. Like:
When do we create a microservice architecture?
How to decide how to break down the application?
Which code goes where?
How do they communicate?
Firstly, the best practice is to break down the application into components or microservices based on the business functionalities and not technical functionalities.
In terms of size, each microservice must do just one isolated thing.
A very important characteristic of each microservice is that they should be self-contained and independent of each other. This means each service must be able to be developed, deployed and scaled separately without any type of dependencies on any other service, even though they are part of the same application. This is called loosely coupled.
Each microservice has its versions which are not dependent on others.
Communication between microservices
There are three ways by which microservices can talk to each other.
Communication via API calls: Each microservice has its API, and they can talk to each other by sending requests to the respective API endpoint. This is synchronous communication, where one service sends a request to another service and waits for the response.
Communication via Message Broker: This is asynchronous communication. Here, services will send messages first to the intermediary message service or a broker such as RabbitMQ and the message broker will forward that message to the respective service.
Communication via Service Mesh: This method is becoming pretty popular, especially in the field of Kubernetes. With service mesh, you have kind of a helper service that takes over the complete communication logic, so you don't have to code this logic into the microservices and have this communication logic kind of delegated to this external service.
Since the services are all isolated and talk to each other either with API calls or using additional services you can even develop each service with a different programming language, and you can have dedicated teams that can choose their technology stack and work on their service without affecting or being affected by other service teams. And this is the most important advantage of microservices architecture over monolithic architecture.
Disadvantages of microservices
While microservices made developing and deploying applications easier in many aspects, they also introduced some other challenges that were not there before.
Configuring the communication between microservices, because a microservice may be down or unhealthy and not responding yet, while another service starts sending requests to its API expecting a fulfilled response but end up getting unexpected result.
With microservices deployed and scaled separately, it may become difficult to keep an overview and find out when a microservice is down or which service is down when something in the application is not working properly.
Tools are being developed to tackle these challenges. The most popular one we all know is Kubernetes, which is a perfect platform for running large microservices applications.
How to manage the code
Now. you must be wondering that if these microservices get developed and deployed separately, then how do we manage the code?
For this, we have two ways: Monorepo and Polyrepo.
Monorepo
Monorepo means having one Git repository for all the services.
But how do we structure multiple applications in one repository? A common way is using folders, where we have folders for each service, and all the codes for those services are in those respective folders.
A Monorepo makes code management and development easier because you only have to clone and work with one repository. Changes can be tracked together, tested together and released together.
There are some challenges we face with Monorepo. Like:
We know that the biggest advantage of microservices is to be completely independent and loosely coupled, but in the case of mono repo, there is a tight coupling of services.
When the application becomes big, git interactions (cloning, fetching and pushing) become slow.
In terms of the CI/CD pipeline, in most of the CI/CD platforms like GitLab CI/CD or Jenkins, we can only create one pipeline for one project. Since we are building multiple services with a single project pipeline and that means we need to add additional logic in our pipeline code that makes sure to only build and deploy the service which has changed.
Since we have just one main branch because we have one repository, if developers of one of the services break the main branch, then other services and their pipelines will be blocked as well.
Polyrepo
In this, for each service, we create a separate git project.
However, there are separate application repositories, they are still part of this bigger application. So we want to have some connection between these repositories for easy management.
Here we have, a separate pipeline for each repository.
It also has some downsides. They are:
Having code in multiple repositories can make working on the project harder, especially if we need to change two or more services at once.
Searching, testing and debugging are more difficult.
So, that was a blog on Microservices.
Connect with me to get more content on DevOps, Open source and Cloud.
Follow Kubesimplify on Hashnode, Twitter and Linkedin. Join our Discord server to learn with us.


The Ultimate Guide to Audit Logging in Kubernetes: From Setup to Analysis
Santoshdts — Mon, 15 May 2023 12:30:39 GMT
Kubernetes is a popular container orchestration tool that has revolutionized the way developers deploy and manage their applications. However, as with any complex system, it's important to have visibility into what's going on under the hood. That's where auditing comes in. Kubernetes auditing entails keeping a record and evaluating all actions that occur within the cluster, including requests made to the API server and the creation or removal of pods. This data can be utilized to identify security breaches, solve problems, and ensure adherence to regulatory standards. In this article, we'll take a closer look at Kubernetes auditing and how it can benefit your organization.
Auditing assists in maintaining compliance. It helps by providing the ability to retrieve certain sequences of events a user has initiated. This ability to retrieve the historical records of changes made to the cluster provides deep insights into strengthening the regulatory framework in the organization.
What are the stages during Audit logging
The kube-apiserver allows us to capture the logs at various stages of a request sent to it. This includes the events at the metadata stage, request, and response bodies as well. Kubernetes allows us to define the stages which we intend to capture. The following are the allowed stages in the Kubernetes audit logging framework:
RequestReceived: As the name suggests, this stage captures the generated events as soon as the audit handler receives the request.
ResponseStarted: In this stage, collects the events once the response headers are sent, but just before the response body is sent.
ResponseComplete: This stage collects the events after the response body is sent completely.
Panic: Events collected whenever the apiserever panics.
There are lots of calls made to the API server, and we need a mechanism to filter out the events based on our requirements. Kubernetes auditing provides yet another feature for this very reason  the level field in the policy configuration.
What are the levels at which Auditing needs to happen
The level field in the rules list defines what properties of an event are recorded. An important aspect of audit logging in Kubernetes is, whenever an event is processed it is matched against the rules defined in the config file in order. The first rule sets the audit level of logging the event. Kubernetes provides the following audit levels while defining the audit configuration.
None: This disables logging of any event that matches the rule.
Metadata: Logs request metadata (requesting user/userGroup, timestamp, resource/subresource, verb, status, etc.) but not request or response bodies.
Request: This level records the event metadata and request body but does not log the response body.
RequestResponse: It is more verbose among all the levels as this level logs the Metadata, request, and response bodies.
Configuration
Auditing in Kubernetes is not enabled by default. We need to configure this feature by providing a set of rules defining the events we intend to keep track of, and the location where we intend to store the audit logs.
Let's first discuss the rules in the audit configuration.
Rules
The rules in the audit config mainly comprise of Level and Stages. Of course, there are other parameters as well, like resources, verbs, users/userGroups, etc. The meaty part of the rule file is in level and resources lists.
Following is an example Audit policy configuration:
apiVersion: audit.k8s.io/v1kind: Policyrules:  - level: Metadata namespaces: ["default"] verbs: ["get","list","delete"] resources:    - group: "" resources: ["pods"]  - level: RequestResponse omitStages: ["RequestReceived"] namespaces: ["default"] verbs: ["create","get","delete"] resources:    - group: "" resources: ["secrets","configmaps"]  - level: Metadata namespaces: ["default"] resources:    - group: "" # pods/exec is a subresource resources: ["pods/exec"]   - level: None verbs: ["watch"] namespaces: ["*"] resources:    - group: ""
In the above configuration file, the first three lines containing apiVersion, kind, and rules are required fields. Under rules list, all the parameters based on our logging requirements are defined.
In the first block, we are logging the information at Metadata level. The events we are logging for this block are for get, list, and delete operations on the resources defined in the resources list in the default namespace. In this case, whenever the kube-apiserver receives a request for the get or delete method on a pod object, the events for the same will be logged at the metadata level.
Similarly, in the second block, we are logging based on the RequestResponse level on the secret and configMap objects in the default namespace, whenever the API server receives a get or delete HTTP method. This rule will not log events generated during ResquestReceived stage defined in omitStages.
In the third rule, we are logging something interesting. this rule will log all the information whenever there is an exec into a pod in the default namespace. Finally, we have a rule defining all the irrelevant information we are not bothered to collect the logs for. As logging occupies lots of disk space, we need to precisely define what are the events that we are not interested in logging. Kubernetes provides a field for this called omitStages, which skips logging for that particular stage defined under omitStages. This rule will not log any information on all the objects in the core group on the watch verb.
Configuring the APIServer ad enabling Auditing in k8s
After creating the policy configuration, we must set up the kube-apiserver by providing it with the necessary information, such as the location of the configuration file, log file details, and log size. This can be achieved by inserting the specified lines into the command field of the kube-apiserver manifest file.
spec: containers:  - command:     - --audit-policy-file=/etc/kubernetes/audit-policy.yaml     - --audit-log-path=/var/log/audit/audit.log    - --audit-log-maxage=5 #No of days we want to retain the logs 
you can define a few more properties with respect to maintaining the logs. --audit-log-maxbackup for defining the maximum of audit log files to retain and --audit-log-maxsize sets the maximum size of the log files in megabytes before rotating.
As these files need to be accessed by the kube-apiserver pod. We need to make it available within the pod by mounting the hostPath to the location of the policy and log file. This makes the audit records persistent.
 volumes:  - name: audit hostPath: path: /etc/kubernetes/audit-policy.yaml type: File  - name: audit-log hostPath: path: /var/log/audit/audit.log type: FileOrCreate
 volumeMounts:    - mountPath: /etc/kubernetes/audit-policy.yaml name: audit readOnly: true    - mountPath: /var/log/audit/audit.log name: audit-log readOnly: false
This step completes the configuration of the audit log in the kube-apiserver and sets a log location for our audit logs. This step of modification will restart the kube-apiserver and we should already be receiving some logs in the log location we had defined.
In case the kube-apiserver doesn't come up online. We need to look at the pod logs at /var/log/pods/kube-system_kube-master_xxx/kube-apiserver/x.log location for any misconfiguration and errors.
Now, if we perform any action defined in the audit log policy configuration. We should expect the audit logs to appear in the log file, if we inspect the same we should view the logs in a somewhat similar fashion:
{ "kind": "Event", "apiVersion": "audit.k8s.io/v1", "level": "Metadata", "auditID": "c762ad6d-9994-4b03-8e6b-eee5e19d3d98", "stage": "ResponseComplete", "requestURI": "/api/v1/namespaces/default/pods/test", "verb": "get", "user": { "username": "kubernetes-admin", "groups": [ "system:masters", "system:authenticated"    ]  }, "sourceIPs": [ "192.168.56.11"  ], "userAgent": "kubectl/v1.26.0 (linux/amd64) kubernetes/b46a3f8", "objectRef": { "resource": "pods", "namespace": "default", "name": "test", "apiVersion": "v1"  }, "responseStatus": { "metadata": {}, "code": 200  }, "requestReceivedTimestamp": "2023-03-29T15:33:20.131662Z", "stageTimestamp": "2023-03-29T15:33:20.133724Z", "annotations": { "authorization.k8s.io/decision": "allow", "authorization.k8s.io/reason": ""  }}
From the above json blob, we can identify some of the important fields which might give us some insights into what happened inside the cluster. Some of the fields of interest are requestURI, verb, sourceIP, user, and the objectRef objects. Mind you these are just a few fields that are captured in Metadata level, we might get a lot more information if we change the level to RequestResponse type.
Viewing all the relevant information from such json blobs may become daunting. To make our life easy, there is yet another tool  jq which helps us in parsing such large amounts of data contained in JSON structure.
Leveraging jq to view the audit logs
jq, as aptly defined in its [official documentation](https://stedolan.github.io/jq/) is a lightweight and flexible command-line JSON processor. We can leverage the power of jq and target specific fields in the Kubernetes audit logs.
For example, if we are interested to track all the activity related to the HTTP methods sent to API server that are defined in our rules. We may simply use the command tail -f /var/log/audit/audit.log | jq .verb. Or we can track which resources the API requests are made to, by using the following command tail -f /var/log/audit/audit.log | jq .objectRef.resource
We may go a step ahead and use more advanced features of jq such as filters.
For example, in our audit policy, we have defined to log all actions if we perform a create, get or a delete operation on secrets. In order to pretty-print the audit logs, we can use the jq filter as shown in the image below.
For detecting a shell being spawned inside a pod:
The above example is for filtering each resource based on some selection criteria. We can further leverage the power of jq by saving all our filters on various resources in a single file and supplying that file as a command line argument to jq. This command will spit out the log in a human-readable format immediately whenever it detects an event matching the rules define in our audit policy configuration file.
For example, whenever a configMap object is created, which contains some sensitive information ( username or password as field names in this example). A specific jq filter will kick in and trigger a human-readable output from the audit log.
The example filter used for this demo can be found on my GitHub gist.
The above examples are for learning and experimenting purposes. In the production cluster, it is advisable to use a centralized logging solution like the ELK stack to collect, process, and analyze the logs. Where, Logstash can be used as a log collector, which can ingest logs from Kubernetes audit logs, and then Elasticsearch can be used to store the logs. Finally, Kibana can be used to visualize and analyze the logs.
Conclusion
In conclusion, Kubernetes auditing is a crucial tool for maintaining visibility and control over the activity within a cluster. Logging is a critical aspect of security in production clusters, and it's essential to have a robust method to audit logging. By recording and analyzing all events, it can help organizations track down security breaches, troubleshoot issues, and ensure compliance with regulatory requirements. With the flexibility provided by the ability to define stages and levels of logging, organizations can tailor their auditing to their specific needs. However, it is important to remember that auditing can generate a lot of data, so it's essential to configure the policy and the storage location carefully to ensure effective and efficient use of resources. Overall, Kubernetes auditing is an essential part of any organization's strategy for managing and securing their applications in a Kubernetes environment.
By enabling audit logging, choosing the right audit policy, using a centralized logging system, and monitoring audit logs, you can ensure the security of your production cluster and quickly detect and respond to any security threats.
Follow Kubesimplify on Hashnode, Twitter and LinkedIn. Join our Discord server to learn with us.


Docker Networking Demystified
Saurav Rana — Wed, 10 May 2023 12:30:39 GMT
Introduction
Docker has transformed the way developers build, ship, and run applications. Its networking capabilities are a key feature that allows containers to communicate with one another and the outside world. However, Docker networking can be complex and challenging to comprehend, particularly for those new to the technology. In this article, we will demystify Docker networking by exploring how containers communicate with each other and the concept of network namespaces. Read on to gain a better understanding of Docker networking.
Network Namespaces
On Linux, a network namespace is a technique used to isolate network resources, such as network interfaces, routing tables, and firewall rules, from the rest of the system. Each network namespace provides an independent network stack that can be used by processes running within that namespace. This allows multiple applications to run on the same host while using different network configurations.
Docker leverages network namespaces to provide container-level networking isolation. When a Docker container is started, it is assigned its own network namespace. This namespace provides a virtual network interface, IP address, routing table, and firewall rules for the container. The Docker daemon also creates a bridge network, which acts as a virtual switch connecting all the containers in the same network. Containers can communicate with each other through this bridge network or with the outside world through the host's network interface.
Docker also supports other network drivers, such as overlay networks, which allow containers to communicate across multiple hosts, and host networks, which allow containers to use the host's network interface directly. Each network driver uses a different network namespace to provide network isolation and different features for container networking.
Overall, network namespaces provide a powerful tool for isolating and managing network resources on Linux, and Docker leverages this technology to provide flexible and secure networking for containerized applications.
How it works
To recreate docker containers at the most basic level we can create two network namespaces and then create a virtual Ethernet pair (veth pair) and plug the two ends into each of the network namespaces now this creates our simple container network in the most basic terms.
Below is the script that does the same. So let's understand it line by line.
https://gist.github.com/sauravrana646/f68f9cd8acf05e7994c72c475feca66f
 
set -o pipefail
This sets the Bash option "pipefail", which causes a pipeline (a series of commands separated by pipes) to fail if any command in the pipeline fails. This ensures that errors in the pipeline are detected, and the script exits with a non-zero status.
string="$1"
This assigns the first command-line argument to the variable "string".
if [ "$string" = "up" ]; then
This checks if the value of "string" is equal to "up". If it is, the following commands will be executed.
ip netns add net1ip netns add net2
These commands create two network namespaces called "net1" and "net2" using the "ip netns add" command.
ip link add veth1 netns net1 type veth peer name veth2 netns net2
This creates a pair of virtual network interfaces called "veth1" and "veth2", with "veth1" connected to "net1" and "veth2" connected to "net2", using the "ip link add" command.
ip netns exec net1 ip addr add 10.100.0.1/16 dev veth1ip netns exec net2 ip addr add 10.100.0.2/16 dev veth2
These commands assign IP addresses to both virtual interfaces using the "ip addr add" command.
ip netns exec net1 ip link set lo upip netns exec net2 ip link set lo up
These commands bring up the loopback interface in both namespaces using the "ip link set" command.
ip netns exec net2 ip link set veth1 upip netns exec net2 ip link set veth2 up
These commands bring up the virtual network interfaces in "net2" using the "ip link set" command.
ip netns exec net1 ping -c 3 10.100.0.2
This command tests connectivity from "net1" to "net2" by pinging the IP address of "veth2" in "net2" using the "ping" command.
ip netns exec net2 ping -c 3 10.100.0.1
This command tests connectivity from "net2" to "net1" by pinging the IP address of "veth1" in "net1" using the "ping" command.
if [ "$1" == "down" ]; then
This checks if the value of the first command-line argument is equal to "down". If it is, the following commands will be executed.
ip netns delete net1ip netns delete net2
These commands delete the network namespaces "net1" and "net2" using the "ip netns delete" command.
Let's run it
To run this bash script, we can make an executable using chmod +x just run it with the bash -c parameter.
As we can see, our script ran, and we can ping namespaces from one another. We can also get inside these namespaces to see their details.
sudo ip netns exec net1 ip a
This command shows us the interfaces inside the net1 namespace, and we can see that net1 has veth1 interface which is assigned the same IP as we did in the script.
This marks the completion of our first part but there's some problem with this approach that we are going to discuss now.
The Problem
We have just begun to scratch the surface of Linux and virtual networks. As we learned how, two containers can talk to each other through virtual Ethernet pairs, but there is a problem. Can you guess what may be the issue?
Yes, the issue is, that it is not scalable if we have a large no. of containers because we have to create an Ethernet pair and attach it to all the containers so that all can communicate with each other.
For example, if we have four containers, we would need to create N*(N-1)/2 containers, i.e. we would be needing 6 veth pairs to connect all of them.
Thus, to solve this problem, the concept of bridges was introduced.
Linux Bridge
A Linux bridge is a software bridge that connects multiple network interfaces together into a single network segment. It is a layer 2 devices that operates at the data link layer of the OSI model, allowing it to connect network segments together by forwarding traffic between interfaces.
In the context of Docker, a Linux bridge is a common networking solution used to connect containers running on a host system to the external network. When a container is created, Docker creates a virtual Ethernet (veth) pair, which consists of two virtual network interfaces: one end is attached to the container's namespace, while the other end is attached to the host system's bridge interface.
The Linux bridge is created on the host system, and all containers are connected to the bridge via their virtual interfaces. This allows them to communicate with each other and with the external network.
One of the primary benefits of using a Linux bridge with Docker is that it simplifies the process of managing container networking. With a bridge, Docker containers can be easily connected to a single network segment, eliminating the need to manually configure network interfaces for each container.
Let's create a bridge
Here's the modified bash script to include a bridge as a mode of communication between the namespaces.
https://gist.github.com/sauravrana646/56d4efb50b4dbd4ebbe077652591c511
 
Let's go through it line by line one more time.
ip netns add net1ip netns add net2
These commands create two new network namespaces, named net1 and net2, respectively.
ip link add veth1 type veth peer name vethpeer1ip link add veth2 type veth peer name vethpeer2
These commands create two virtual Ethernet devices named "veth1" and "veth2", each with a peer device named "vethpeer1" and "vethpeer2", respectively.
ip link set veth1 upip link set veth2 up
These commands bring the veth1 and veth2 interfaces up.
ip link set vethpeer1 netns net1ip link set vethpeer2 netns net2
These commands move the peer devices "vethpeer1" and "vethpeer2" into the network namespaces "net1" and "net2", respectively.
ip netns exec net1 ip link set lo upip netns exec net2 ip link set lo up
These commands bring the loopback interfaces up in the net1 and net2 network namespaces, respectively.
ip netns exec net1 ip link set vethpeer1 upip netns exec net2 ip link set vethpeer2 up
These commands bring up the peer devices "vethpeer1" and "vethpeer2" in their respective network namespaces.
ip netns exec net1 ip addr add 10.100.0.10/16 dev vethpeer1ip netns exec net2 ip addr add 10.100.0.20/16 dev vethpeer2
These commands assign IP addresses to the vethpeer1 and vethpeer2 interfaces, giving them IP addresses 10.100.0.10 and 10.100.0.20, respectively.
ip link add br00 type bridgeip link set br00 up
These commands create a new bridge interface named br00 and bring it up.
ip link set veth1 master br00ip link set veth2 master br00
These commands add the veth1 and veth2 interfaces to the br00 bridge.
ip addr add 10.100.0.1/16 dev br00
This command assigns an IP address to the br00 interface, giving it IP address 10.100.0.1.
ip netns exec net1 ip route add default via 10.100.0.1ip netns exec net2 ip route add default via 10.100.0.1
These commands add default routes to the net1 and net2 network namespaces, directing traffic to the br00 interface.
ip netns exec net1 ping -c 3 10.100.0.20ip netns exec net2 ping -c 3 10.100.0.10
These commands test the connectivity between the two network namespaces by running ping commands from one namespace to the other.
bash -c 'echo 1 > /proc/sys/net/ipv4/ip_forward'
This command enables IP forwarding by setting the value of the ip_forward sysctl parameter to 1.
iptables -t nat -A POSTROUTING -s 10.100.0.1/16 ! -o br00 -j MASQUERADE
This command sets up NAT (Network Address Translation) using iptables. Specifically, it adds a rule to the nat table (-t nat) that applies to packets with a source IP address of 10.100.0.1/16 (-s 10.100.0.1/16) that are not going out through the br00 interface (! -o br00). The rule action is to apply MASQUERADE (-j MASQUERADE), which modifies the source IP address of the packet to the IP address of the outgoing interface (in this case, br00).
ip netns exec net1 ping -c 3 8.8.8.8
This command tests connectivity to the Google DNS server (8.8.8.8) from the net1 namespace by sending three ICMP echo request packets.
Let's bridge it
Run the bash script using sudo ./bridge.sh up
As we can see that our namespaces are able to communicate with each other through the bridge and after some additional configuration we can also ping the internet through our namespaces.
We can also check our bridge through brctl and ifconfig command like this.
Summary
In this part, we learned how containers talk to each other through virtual ethernet pairs and what is the problem with that approach, and how bridges solve the problem of connecting multiple containers effectively. We understood why a bridge is required, how it works and helps to manage container networking.
But there is something that might be confusing here, why we had to use ip_forwarding and add iptable rule?
This we will explore in the next part of the series so stay tuned for the next part and until then keep growing, keep grinding, and keep learning .... !!
Follow Kubesimplify on Hashnode, Twitter and Linkedin. Join our Discord server to learn with us.


Four Pillars Of Observability in Kubernetes
Rakshit Gondwal — Mon, 01 May 2023 12:30:39 GMT
Kubernetes is a complex system due to its distributed and dynamic nature, managing a large number of containers and services across multiple nodes, which makes it difficult to manually monitor and manage. This is where Observability comes in by providing insight into the health, performance, and behavior of applications and clusters. Before we discuss the Four Pillars of Observability, let's understand what observability means.
Understanding Observability
Observability refers to the ability to gain insight into the state and behavior of the application by collecting and analyzing various types of data. If this sounds overwhelming, let me readily explain you.
Suppose you own a big garage with a lot of cars. Now, you need to observe which car needs repairing or which car has a broken part. This is what observability does in the case of Kubernetes. Consider Kubernetes as a big garage with many nodes, clusters, services, CRD, etc which needs to be observed continuously for better the working of the application. By using special tools, we can check if things are running smoothly or if there are any problems that need fixing. This helps to make sure that everything is working the way it should be, just like how you want all of your cars to work properly.
Observability vs Monitoring
Although observability and monitoring are often mentioned together, they serve different purposes.
Monitoring involves collecting and analyzing data and metrics from the Kubernetes cluster to ensure it's performing as expected. This can include things like checking CPU usage, memory usage, and network traffic of the cluster and its components, such as pods and nodes.
For observability, we can say that it is built on top of monitoring. Observability includes monitoring but goes beyond it to analyze logs, traces, and other techniques that can provide insights into the behavior of the application as a whole.
For example, in Kubernetes, monitoring might involve checking the CPU usage of a pod to ensure it's not running out of resources. Observability, on the other hand, might involve collecting and analyzing logs and metrics from multiple pods, services, and nodes in the cluster to gain insights into the overall health and performance of the entire system.
Four Pillars of Observability
If you know about observability, you might've heard about the Three Pillars of Observability and might be wondering what's the fourth pillar. Well, here are the four pillars of observability:
Metrics
Tracing
Logging
Profiling
Metrics
The definition refers to metrics as the data which is collected from different components of the cluster to monitor and measure the health and performance of the system. Metrics can provide insights into key aspects of the cluster such as CPU usage, memory usage, network traffic, and other performance-related data.
Suppose you want to know the performance of your car or you want to know how far it can travel, that's like a metric. In the same way, metrics will help you to analyze the performance of your cluster or your application.
Tools such as Prometheus, Dynatrace, and Datadag help you to fetch metrics.
Grafana provides a better view of these metrics using dashboards.
Tracing
The definition refers to tracing as the practice of collecting and analyzing data about the requests that flow through your system, in order to identify and diagnose issues with application performance and behavior.
Suppose you are playing a game of tag and someone tags you. Now you need to figure out who tagged you by tracing back to where they came from. That's like tracing in Kubernetes, it helps you figure out what's happening by following the path of something.
Logging
The definition refers to logging as the practice of collecting and storing data about the events and activities that occur within your cluster, in order to monitor and diagnose issues with your applications and infrastructure.
Imagine you have a diary where you write down everything you do each day. That's like logging, it helps you keep track of what you did and when you did it. In Kubernetes, we use logging to keep track of what's happening in the cluster or in our application. For example, if there's an error, we can look at the logs to figure out what went wrong.
Profiling
Profiling has been the latest addition to the pillars of observability. It refers to the practice of analyzing the performance of your applications and cluster, in order to identify areas of inefficiency that may be impacting their performance. Profiling can help you understand how your applications are using resources like CPU and memory, and identify areas where optimization may be needed to improve their performance.
Imagine you have a friend who's really good at drawing, and you want to learn how they do it. You might watch them draw and try to copy their technique. That's like profiling, it helps us understand how something is done by watching it in action. Just like in Kubernetes, we use profiling to figure out how our cluster or our application is working and where we can make it work better.
The above image represents a Flame Graph. A FlameGraph is a way to visualize the profile of an application allowing it to instantly detect the most frequent code path. They can be particularly useful in a Kubernetes environment, where multiple containers are running on a cluster, and you need to identify performance bottlenecks.
Wrapping up
In this blog post, I briefed you about observability, monitoring vs observability, and the four pillars of observability.
In the next blog posts, we will try to dive deep into the four pillars of observability separately with a demo.
You can reach out to me on Twitter or LinkedIn if you would like to have a chat about Cloud Native, Observability, or DevOps.
Thank you for reading :)
Follow Kubesimplify on Hashnode, Twitter and Linkedin. Join our Discord server to learn with us.


Understanding How Containers Work Behind the Scenes
Anuj Chourasia — Sat, 29 Apr 2023 12:30:41 GMT
Containers provide a convenient way to deploy and run applications within their own isolated environment, eliminating the need to create separate virtual machines. However, have you ever questioned the underlying mechanisms that make containers operate seamlessly?
https://giphy.com/gifs/kQOxxwjjuTB7O
 
Containers leverage two key features of the Linux kernel which enables better isolation between the processes:
Namespaces.
Control groups (cgroups).
When a container is launched, Docker generates a unique set of namespaces and cgroups which are allocated specifically to that container.
"Containers are like Russian dolls - they have layers upon layers, and when you open them up, you realize they all contain the same thing, just in different sizes!" ~chatgpt
Let's take a closer look at what these namespaces and cgroups are and how can you create one.
Namespaces and cgroups
Namespaces
Namespace is a feature in Linux that lets you see a specific part of the system, meaning allocating resources in an isolated environment. Namespace allows you to create that isolated environment where the container only knows what it can see because it's only in a certain namespace.
When you initialize a container, docker generates a set of namespaces for the container and every container has its own unique set of namespaces.
There are various types of namespaces with different properties:
User/UIDs namespaces - a container running within the user namespace is isolated from the User IDs and Group IDs of other containers, making them unaware of each other's existence.
UTS namespaces - isolate hostname and domain name information.
IPC namespaces - isolate IPC method.
Net namespaces - isolate network interfaces.
PID namespaces - isolate process IDs.
Mount namespaces - isolate mount points.
Let's create a new namespace using the unshare command:
The unshare command is used to run a program with certain namespaces unshared from the parent process.
$ unshare --mount
This command creates a new mount namespace, which means that the program that is run after the command will have its own view of the file system. The mounts made within the new namespace won't be visible in the parent namespace.
For example, if we mount a file system inside the new namespace, it will not be visible outside of the namespace:
$ unshare --mount /bin/bash# mkdir /mnt/test# mount -t tmpfs none /mnt/test# ls /mnt/test# exit$ ls /mnt/testls: cannot access '/mnt/test': No such file or directory
To start with, we create a new mount namespace by executing the unshare --mount command in this example. We then start a new shell inside the namespace using /bin/bash. Inside the shell, we create a new directory /mnt/test and mount a tmpfs file system on it. To verify the mounting was successful, we list the contents of the directory.
When we exit the shell, we're back in the parent namespace. If we try to access the /mnt/test directory, we get an error because the directory was only visible inside the child namespace.
This is just an example of how you can create a mount namespace, you can also create other namespaces using the unshare command, depending on your need.
Control groups (cgroups)
Cgroups help to limit the use of resources so that a single container is not utilizing all the resources available. It allows managing various system resources such as:
CPU - limit CPU utilization.
Memory - limit memory usage.
Disk I/O - limit disk I/O.
Network - limit network bandwidth.
With the help of cgroups docker engine helps to share available hardware resources with the container and puts a limit on how much resources the container can use.
Let's create a new cgroup named "mygroup" to manage CPU resources using cgcreate command:
1. First, create a new directory for the cgroup:
$ sudo mkdir /sys/fs/cgroup/cpu/mygroup
A folder named "mygroup" is created on the above path.
All these different files define the limit on the CPU.
2. Then, assign the "cpu" subsystem to the new cgroup:
$ sudo sh -c "echo $$ > /sys/fs/cgroup/cpu/mygroup/cgroup.procs"
This command adds the current shell process ID to the "cgroup.procs" file of the "mygroup" cgroup, indicating that it belongs to this cgroup.
3. Next, set the CPU usage limit for the cgroup:
$ sudo cgset -r cpu.cfs_quota_us=50000 mygroup
This command sets the maximum CPU usage to 50% of one CPU core for the "mygroup" cgroup.
Now, any processes that are added to the "mygroup" cgroup will have their CPU usage limited to the specified value. For example, using the cgexec command, you can start a new process in the cgroup.
$ sudo cgexec -g cpu:mygroup /bin/bash
This command starts a new shell process in the "mygroup" cgroup with CPU usage limited to the value set earlier.
The cgcreate, cgset, and cgexec commands are part of the libcgroup package, which you may need to install before using them.
Here's an example of how to run a Docker container with a CPU limit of 50% of one CPU core:
$ docker run --cpus 0.5 
This command starts a new container with the specified image and limits its CPU usage to 50% of one CPU core.
Conclusion
In this article we saw what namespaces and cgroups are and by using these features how can docker create a unique set of namespaces and cgroups for each container, making it possible to run multiple containers on a single host machine without any problems.
I hope you understood how docker works behind the scenes 😁.
Also, check out this wonderful hands-on workshop by Chad to learn more about Linux and Docker :)
https://www.youtube.com/watch?v=EUu1E_YKGTw&t=12948s


Implementing Kubernetes Network Policies: A Comprehensive Guide
Srinivas Karnati — Fri, 07 Apr 2023 12:30:39 GMT
Network policies are networking rules in Kubernetes that will allow you to specify how the pod can communicate with other objects.
Network policies are not mandatory to establish communication with pods. However, network policies will add one more layer of network security and help control traffic at Layers 3 and 4 (IP and ports).
The idea of network policy is born in the networking SIG group of Kubernetes in 2015. It was included in the alpha release of v1.2 (2016) and moved to beta in v1.3(July 2016). Below is the proposal for the Network policy by Casey Davenport.
https://github.com/kubernetes/kubernetes/pull/24154
 
It is released stable as part of release v1.7 (June 2017).
What are the things that you can control using Network policies?
Pods that are allowed to communicate with other pods.
Namespaces that are allowed.
IP Blocks for connection.
Policies are applied based on Selectors and CIDR range:
When dealing with pods or namespaces, we use selectors to identify the resources for which our policy needs to be applied.
When we want to create policies that need IP range restrictions, we use CIDR ranges.
CNIthe mandatory element
Network policies need a network plugin. You must need a network plugin that supports Network Policy. Without a suitable plugin, all your network policies are of no use. Even if you apply a network policy without a supported CNI configured, those network policies didnt affect any traffic. Some of the CNI plugins that support network policies are:
Weave
Calico
Cilium
Romana
Some points to note:
Ingress and Egress:
Ingress means the traffic that is entering the Pod. Similarly, egress is the traffic that is leaving the pod.
Default = Allow All:
By default, the pod allows all ingress and egress. It means it has no restrictions for both inbound and outbound traffic.
No Deny Rules:
There are no denied rules in Network policies. You can only specify traffic to be allowed and the rest is denied (you cant write what to deny). You cannot get in if traffic is not allowed on the policy.
empty selector:
An empty selector means everything. If PodSelector:{} is mentioned, it will select all the pods in the namespace.
Null selector:
If the policy contains a Null selector [], means it is not selecting anything (so all traffic is blocked).
Policies are 'OR'ed:
Network policies are additive. If multiple policies are applied to a single pod, all the policies are ORed.
Network policies do not conflict; they are additive. If any policy or policies apply to a given pod for a given direction, the connections allowed in that direction from that pod are the union of what the applicable policies allow. Thus, the order of evaluation does not affect the policy result.
Network policy is namespace scoped:
Network Policies are scoped to the namespace, which means it will affect the traffic of the pods in the namespace at which the policy is applied.
Network Policy Resource definition:
A sample NetworkPolicy look like this:
apiVersion: networking.k8s.io/v1kind: NetworkPolicymetadata:  name: default-deny-allspec:  podSelector: {}  policyTypes:  - Ingress  - Egress  ingress: {}  egress: {}
Just like any other Kubernetes resource, A NetPol (simplification of Network Policy) needs apiVersion, kind, and metadata fields. The spec section is also like any other k8s resource, mentions all the information needed for the Network policy.
podSelector:
Every Network Policy definition contains the podSelector field which defines which pods need to select. In the above policy definition, we used an empty selector {} means the policy will select all the pods in the namespace.
If we have to select the pods with specific labels, we use matchLabels to mention the labels.
......podSelector:    matchLabels:       role: db #Selects the pods with labels "role = db......
policyTypes:
This field indicates the traffic flow direction in which the policy will be applied. It can be Ingress, Egress, or both. If no policyTypes are specified on a NetworkPolicy then by default Ingress will always be set and Egress will be set if the NetworkPolicy has any egress rules.
Actual rules:
Each network policy has a rules section named ingress and egress based on the policy type you mentioned. These sections define the actual rules that need to be satisfied before the traffic is allowed to your pod.
ingress: This section contains the ingress-allowed rules. It has sections from and ports which defines from which pod/namespace/ipBlock traffic is allowed at which port. In the above sample policy, we have used an empty selector, so it will allow all ingress.
egress: Just like ingress, this section contains the egress-allowed rules. It has sections to and ports which defines to which pod/namespace/ipBlock traffic is allowed at which port.
Now that we know the basic concept of NetworkPolicy, let us create some scenarios to learn a bit more about them.
Scenario 1:
Use a network policy to restrict ingress traffic for pods with labels "type: critical" only to allow traffic from pods with labels "access: approved".
We have already created the pods that we will be using for this scenario.
By default, if no Network Policy is applied all the pods can access other pods using IP or Service DNS name.
We don't want this to happen, so let's create a Network Policy that will limit the ingress traffic to pods with labels "type: critical" to only allow traffic from pods with "access: approved" labels.
From the scenario statement, we see that the policy needs to apply to pods with labels type: critical (It is the podSelector section).
Also, we can see that it needs to ingress traffic from pods with labels access: approved (so we clearly have an ingress rule here).
Combining the above two points, we can have the following manifest.
apiVersion: networking.k8s.io/v1kind: NetworkPolicymetadata:  name: restrict-critical-ingressspec:  podSelector:    matchLabels:      type: critical  ingress:  - from:    - podSelector:        matchLabels:          access: approved
But we are still not sure if it works as we intended or not. Let's apply it and test it out.
Tried to access the pod with type: critical from pod with label access: approved. It can access the pod.
  
You probably wondering what's so interesting here !!! It has access earlier too. Yes, it has access earlier too, but earlier the test-pod also has access to the critical-pod. But let us try it now.
  
As you can see, the test-pod can't access critical-pod now. But why? It is happening because we did not mention any ingress rule for test-pod access critical-pod. So it failed to access.
Our Network Policy works 🥳. This way you can use Network policies to limit the traffic to/from the pods in a namespace.
Scenario 2:
Consider a situation where you deployed pods in several namespaces, but you don't want all of them to have access to each other and you have several restrictions on which pods and which ports need to have access. Will Network Policies help in such cases?
Let's find it out.
For this scenario, We have deployed pods in alpha,beta and gamma namespaces. We want our pods in alpha namespace with labels type: kubesimplify needs to be accessed from all pods from the namespace gamma and also pods with labels access: community. We don't want any Egress connectivity to the pod except at port 53.
  Sounds complex? It is not. Let us break down the entire problem statement into simple rules.
Namespace in which the policy needs to be applied is the namespace in which the pod is deployed = alpha
Policy needs to be enforced on pods with labels type: kubesimplify . So podSelctor sections use these labels.
Pods in the namespace gamma need to have access and also pods with labels access: community should have access. So ingress from rules would be as follows.
  ...  ingress:      - from:         - podSelector:             matchLabels:               access: community           namespaceSelector: {} #These pods can be in any namespace      - from:         - namespaceSelector:             matchLabels:               name: gamma  ...
All egress connectivity is restricted except at port 53. So our egress rule can be written as follows:
  egress:   -  ports:        - protocol: UDP          port: 53        - protocol: UDP          port: 53
    Combining all the above rules into a single manifest will look as follows.
    apiVersion: networking.k8s.io/v1    kind: NetworkPolicy    metadata:      name: kubesimplify-access      namespace: alpha    spec:      podSelector:        matchLabels:          type: kubesimplify      ingress:      - from:        - namespaceSelector: {}          podSelector:            matchLabels:              access: community        - namespaceSelector:            matchLabels:              kubernetes.io/metadata.name: gamma      egress:      - ports:          - protocol: TCP            port: 53          - protocol: UDP            port: 53      policyTypes:      - Ingress      - Egress
    I have applied the above Network Policy. To see more information about policy details, use kubectl describe netpol  .
Let us perform a connectivity test to check if the policy is working as expected or not.
Pods in gamma the namespace should have access to kubesimplify pod which has the label type=kubesimplify.
 
Pods with the label access=community should have access to kubesimplify pod.
 
All the traffic which is not part of the policy should be denied.
Egress connectivity of pod with label type=kubesimplify should be restricted except on port 53 (nslookup uses port 53).
 
Our policy works as expected.
These scenarios only explain the basic use cases of Network Policies. We can also implement network policies to allow traffic from a particular range of IPs and ports.
Things you can't do:
Although Network Polices helps to control the traffic that is accessing the pods, it still cannot perform several things (not yet) such as:
It cannot do anything related to TLS, you might need to use additional resources for that.
You cannot target the services using their name.
You cannot create default policies that will be applied to all namespaces and pods.
You cannot log the network security events such as which requests are blocked, allowed, etc.
You cannot directly write the deny rule in the policy.
To know more restrictions visit Kubernetes docs
Additional Resources:
Kubernetes docs
https://editor.networkpolicy.io/ - To create network policies using the visual editor.
https://github.com/ahmetb/kubernetes-network-policy-recipes
Follow Kubesimplify on Hashnode, Twitter and Linkedin. Join our Discord server to learn with us.


How get started with Hashicorp Vault🛡️
Dipankar Das — Wed, 05 Apr 2023 12:30:39 GMT
Introduction
It is a tool for managing secrets and sensitive data in modern computing environments. Especially in a dynamic environment. Which can be accessed using API, CLI as well as web app also if you don't want to manage the infrastructure on your own then you can use the SaaS offering by HashiCorp
Why?
In earlier app development we have to manually update the Secret / API Tokens which means human intervention for Key Rotation and updating the App Source Code's configurations. But, using the vault the app reaches out to the vault for fetching the token by providing valid authentication, and then the token will be used for authorization purposes, getting Required Secret Keys, and more. Tokens are frequency rotated based on max TTL(Time To Live) (i.e. Dynamic secrets 😉 ) hence taking out the logic for managing the key rotation out of apps hence better security management and lowering the development overhead for adding security functionality
https://media.giphy.com/media/v1.Y2lkPTc5MGI3NjExZjhjMGVlMDQwMTVlMDQ3ZWViZDgwMDllZjZkYTg2MmEyNzIzN2IyYyZjdD1n/Tk7IIQTldNwllIRId2/giphy.gif
 
How to Install
There are various methods:
binary
package manager
If you have brew
brew tap hashicorp/tapbrew install hashicorp/tap/vault
👉 Download Link
How to start the vault in dev mode 
Running development/testing mode
vault server -dev
then you need to set the required export for vault_addr and login as a root user using ROOT token. and check the status of the vault cluster which says the following:
the Cluster is unsealed (seal -> cluster is running it requires the keys to decrypt the master key which in itself decrypts the backend storage)
backend storage type is inmem which is in-memory
seal type is shamir
Note: ROOT TOKEN must be saved in highly secured environment
Now let's do it in prod environment 🏭
vault server -config=config.hcl# here the config.hcl will contain the required config for vault to run in production mode
Vault config (Example)📃
# for vault config -> /etc/vault.d/vault.hclstorage "consul" {  address = "127.0.0.1:8500"  path    = "vault/"  token   = "XXXXXX-yyyy-zzz-aaaa-BBBBBB"}listener "tcp" { address = "0.0.0.0:8200" cluster_address = "0.0.0.0:8201" tls_disable = 0 tls_cert_file = "/etc/vault.d/client.pem" tls_key_file = "/etc/vault.d/cert.key" tls_disable_client_certs = "true"}seal "awskms" {  region = ""  kms_key_id = "",  endpoint = "example.kms..vpce.amazonaws.com"}api_addr = "https://vault-us-east-1.example.com:8200"cluster_addr = " https://node-a-us-east-1.example.com:8201"cluster_name = "vault-prod-us-east-1"ui = truelog_level = "INFO"license_path = "/opt/vault/vault.hcl"disable_mlock=true
Key points
Backend storage - consul which is another storage which is a product of Hashicorp, and various field can be checked out 👉 Docs link
Listener method - tcp means tcp connection as Transport Layer protocol 👉 Docs link
Seal - we are using awskms for automatic unseal of the vault cluster 👉 Docs link
UI - to set whether we want UI interface for the cluster
and more option 👉 Docs link
What is AutoSeal, Why do we need and how to config? 📝
Vault unseal operation requires a quorum of existing unseal keys split by Shamir's Secret sharing algorithm. This is done so that the "keys to the kingdom" won't fall into one person's hand.
As, this process is manual and can become painful when you have many Vault clusters as there are now many different key holders with many different keys.
Instead of using shared keys are replaced by the vault transit key in a ault secrets engineor instead of creating another vault cluster for maintaining this process we can make some trusted cloud environment handle it for us like awskms
for instance, when awskms is configured, the vault cluster during init stage creates and transfers data between kms and the cluster to store the vault keys. After any further shutdown or restart of cluster, it will reach out the awskms for unseal of cluster. It is an automatic process
seal "awskms" {  #......}# Add this block to /etc/vault.d/vault.hcl
For more info on this topic 👉 Refer Docs link
What is Backend, Why do we need and how to config? 📝
In vault language storage backend is untrusted storage where only encrypted data is stored and all the logic of encrypt and decrypt are inside a confined layer inside Vault cluster, whenever the data goes out of cluster it's always encrypted. Logging is given more priority than Performace as it's more important to log the event for a later security audit than to log after the data is processed. If the event is unable to be logged, vault does not process the event and throws an error
So in a nutshell, the storage backend is the location for the durable storage of Vault's information. When selecting which backend to use, remember each backend has pros, cons, advantages, and trade-offs.
For more info on this topic 👉 Refer Docs link
Let's play around and learn 🏏
Let's get started with configuring the vault server
Prerequisites
Vault binary
root access to the server
AWS account
server is authenticated to AWS cloud
Step 1: Setup AWS Credentials 🛶
Install the credentials using AWS CLI or directly create a file
cat < ~/.aws/credentials[default]aws_access_key_id = AABBCCXyzaws_secret_access_key = Abdcdcdiweif43323
Now the AWS credential part is ready!
Step 2: Make the installed vault package to start automatically by systemd 🚤
let's move forward and add systemd file to make automatic startup of vault binary on startup
sudo sucat << EOF > /usr/lib/systemd/system/vault.service[Unit]Description="HashiCorp Vault - A tool for managing secrets"Documentation=https://www.vaultproject.io/docs/Requires=network-online.targetAfter=network-online.targetConditionFileNotEmpty=/etc/vault.d/vault.hcl  # importantStartLimitIntervalSec=60StartLimitBurst=3[Service]User=root # importantGroup=root # importantProtectSystem=fullProtectHome=read-onlyPrivateTmp=yesPrivateDevices=yesSecureBits=keep-capsAmbientCapabilities=CAP_IPC_LOCKCapabilities=CAP_IPC_LOCK+epCapabilityBoundingSet=CAP_SYSLOG CAP_IPC_LOCKNoNewPrivileges=yesExecStart=/usr/bin/vault server -config=/etc/vault.d/vault.hcl # importantExecReload=/bin/kill --signal HUP $MAINPIDKillMode=processKillSignal=SIGINTRestart=on-failureRestartSec=5TimeoutStopSec=30StartLimitInterval=60StartLimitIntervalSec=60StartLimitBurst=3LimitNOFILE=65536LimitMEMLOCK=infinity # important[Install]WantedBy=multi-user.targetEOFsystemctl daemon-reload
As now the startup script is ready, we can now config the vault server settings
Step 3: Create AWS S3 bucket for storage of the vault 🛥
We are using cloud-based storage as it is highly available. No data loss
with the name "" or make it something unique as it will be our storage with default settings
Make sure you allow all public access
NOTE: only for demonstration purpose
  
Link for how to create 👉 S3 bucket
Once created the bucket it will be empty
copy the bucket name
Step 4: Create a key in AWS KMS for AutoSeal 
It will help a lot when you vault server/cluster restarts, or you start it after shutdown it will automatically unseal it
Go to AWS KMS dashboard
Click on the customer-managed key
Select the checkbox to enable a specific IAM user to have permission to delete the KMS key
Save the KMS ID and region where its created
Step 5: Create an Endpoint in VPC (Regional based service) to access the key(s) 🚢
As the endpoint is created we can now be able to use the KMS service using that VPC
Note: Make sure KMS and endpoint are in the same region
Step 6: vault configuration 🛳
cat < /etc/vault.d/vault.hcl# Full configuration options can be found at https://www.vaultproject.io/docs/configurationui = true#mlock = true#disable_mlock = truedisable_mlock = true#storage "file" {#  path = "/home/dipankar/vault/data"#}storage "s3" {  bucket = ""}# HTTP listener#listener "tcp" {#  address = "127.0.0.1:8200"#  tls_disable = 1#}listener "tcp" {  address       = "0.0.0.0:8200"  tls_disable = 1  # no TLS i.e. no HTTPS  tls_cert_file = "/etc/vault.d/vault.crt"  tls_key_file  = "/etc/vault.d/vault.key"  tls_disable_client_certs = "true"}seal "awskms" {  region = ""  kms_key_id = ""  endpoint = "kms..amazonaws.com"}EOF
Let's bootup the vault 🤞
sudo systemctl start vaultsudo systemctl enable vaultsudo systemctl stop vault # to stop the vault server# vault is uninitialized and sealedvault status# lets initialize the vault servervault operator init # it will output the root login token(KEEP IT SAVED)# so now the vault must be initialized and unsealedvault status# loginvault login # In order to use login method other than token and it is# userpass then we first need to enable the auth methodvault auth enable userpass# now we need to specify the nwe username and passwordvault write auth/userpass/users/ password=# Lets add a new secretvault secrets enable kv# In next section we will enable specific auth method and policies
login into specific userpass user
Created the secret key and how to access them
Turn on Specific auth method and policies
# for restrictive access for the user to kv store onlycat < user-policy.hclpath "kv/*" {  capabilities = ["list", "read", "update", "delete"]}path "kv" {  capabilities = ["list", "delete"]}EOFvault policy write user-dipankar user-policy.hclvault write auth/userpass/users/dipankar policy=user-dipankar password=1234vault secrets enable kv
the above policy user-dipankar will allow user who has this policy attached to read, update, list, and delete permissions in the path kv/......
but you may ask them to then allow kv path its because to query the secret the user must have permission to access kv path i.e. able to see kv option in UI and then can navigate to other subdirectories
For more info on CLI commands, do refer 👉 CLI Docs link
Update for HTTPS 🛡 (self-signed cert)
#!/bin/bashcd /etc/vault.dopenssl req -new -newkey rsa:4096 -x509 -sha256 -days 365 -nodes -out vault.crt -keyout vault.key# it will create the vault.crt and vault.key# now make changes to the vault.hcllistener "tcp" {  address = "0.0.0.0:8200"  tls_cert_file = "/etc/vault.d/vault.crt"  tls_key_file  = "/etc/vault.d/vault.key"  tls_disable_client_certs = "true"}
if you want to use that commonName you can edit the /etc/hosts file
End remarks 🪂
I hope you liked this blog. If any mistake, do comment down below 🚀.
Do like and share this blog
https://media.giphy.com/media/RlrcXMffVZaouUVPGD/giphy.gif
 
Here are my socials:
Twitter https://twitter.com/DipankarDas011
Linkedin https://www.linkedin.com/in/dipankar-das-1324b6206/
Follow Kubesimplify on Hashnode, Twitter and LinkedIn. Join our Discord server to learn with us.


Iptables Firewall Demo: Learn How to Secure Your Network
Arnav Barman — Wed, 29 Mar 2023 12:30:39 GMT
Introduction
This hands-on tutorial on creating your own firewall rules is in continuation of my previous blog. So if you haven't already read that, here's the link:
https://blog.kubesimplify.com/firewall-a-networks-gatekeeper
 
💻IPTABLES firewall in Linux
The Linux kernel firewall can be configured by system administrators using iptables, a user-space utility program. It is a crucial tool for Linux users as it offers a robust firewall to safeguard against malicious attacks. It can be used for managing network traffic efficiently.
Netfilter modules are used to implement the firewall, which contains filters organized into various tables. These tables contain chains of rules for handling network traffic packets. Establishing these rules for incoming and outgoing network traffic gives you complete control over the packets allowed through or blocked from entering or leaving your system based on source and destination addresses, ports, and other criteria. Iptables even permit the creation of custom rules for specific services or applications, granting you greater flexibility in managing your network. Additionally, it helps ensure that only authorized users can access specific parts of your network.
A comprehensive understanding of iptables and their proper use can significantly enhance your system's security against potential attackers. As a Linux system administrator, IPTABLES is an essential part of your toolkit for providing secure access control to your network.
Traversing chains and Rule matching
Now that you've got a basic understanding of iptables, it's time to dive deeper into the world of network security and explore the intricacies of traversing chains and rule matching. These concepts are at the heart of how iptables filters and manages network traffic, allowing you to specify precisely which packets should be allowed or blocked based on a set of rules. By understanding how chains and rule matching work in iptables, you can gain greater control over your network traffic and ensure that only authorized traffic is allowed through.
In iptables, a chain is a collection of rules applied to incoming or outgoing network traffic. When a packet arrives on a network interface, iptables checks the packet against the rules in the appropriate chain and either accepts, rejects, or modifies the packets based on the rules.
There are three main types of chains in iptables: the INPUT chain, the OUTPUT chain, and the FORWARD chain. The INPUT chain is used to filter incoming traffic to the local system, while the OUTPUT chain filters outgoing traffic from the local system. The FORWARD chain is used to filter traffic that is being routed through the system, such as traffic between two different networks. In addition to these three main chains, several predefined chains are used for specific purposes, such as the PREROUTING and POSTROUTING chains, which are used for network address translation (NAT), and the mangle chain, which is used for packet modification. Finally, you can also create your own custom chains, which can organize your rules into logical groups, simplify your rule sets, and make your iptables configuration more manageable. By understanding the different types of chains in iptables, you can gain greater control over your network traffic and create a more secure and stable network environment.
Now, Rule matching is a crucial aspect of iptables that enables you to define precisely which packets should be accepted or blocked based on a set of criteria. When a packet arrives on a network interface, iptables checks the packet against a series of rules to determine what actions should be taken. Each rule specifies a set of conditions that must be met for the rule to match, such as the source or destination IP address, the protocol, the port number, or other attributes of the network traffic. If a packet matches a rule, iptables will apply the action specified in the rule, such as accepting, rejecting, or modifying the packet.
Rule matching in iptables is incredibly flexible and powerful, allowing you to define complex rules that filter traffic based on multiple criteria. You can also use various operators and modifiers to refine your rules further, such as negating a condition, using regular expressions, or applying a rule only to certain network interfaces. By understanding how rule matching works in iptables, you can create a more effective and efficient firewall configuration that provides the level of network security that you require. Whether you're a Linux administrator, a network engineer, or just a curious user, iptables' rule-matching capabilities offer a valuable tool for managing your network traffic and keeping your systems secure.
Extension Modules
Iptables offers more advanced functionality to your firewall configuration with its extension modules. These modules act as plugins that enhance the built-in chains and rule-matching capabilities of iptables. By using these extension modules, you can add more features and options to your firewall configuration beyond what the base system provides.
A wide range of extension modules is available for iptables, each of which provides a different set of features and capabilities. Some examples of commonly used extension modules include the conntrack module, which enables iptables to track and manage network connections, and the limit module, which allows you to limit the rate of incoming or outgoing network traffic.
By leveraging extension modules, you can extend the functionality of iptables to meet your specific network security needs and gain greater control over your network traffic. Whether you're a system administrator, a developer, or just a curious user, iptables' extension modules offer a powerful tool for customizing your firewall configuration and creating a more secure and reliable network environment.
Some rules to make you familiar with Iptables.
Installing iptables.
 sudo apt install -y iptables // The command installs the iptables package on the system without requiring any confirmation.
Delete all existing rules.
 iptables -F  //-F: a parameter that stands for "flush". It deletes all the rules from the selected chain (or all chains if none is given).
Show the current rules.
 iptables -L  //-L: a parameter that stands for "list".  //The command lists all the rules in the current iptables configuration. It shows the table name, chain name, target name, and any additional parameters for each rule.
 
Set default chain policies to DROP (drops all packets).
 iptables -P INPUT DROP iptables -P FORWARD DROP iptables -P OUTPUT DROP //-P: a parameter that stands for "policy". It sets the default policy for the selected chain to the specified target. //INPUT, FORWARD, OUTPUT: Chain names. //DROP: Command to drop all traffic matching the given rule.
Block a specific IP address.
 BLOCK_THIS_IP="x.x.x.x" //variable, it could be a single IP address (e.g., 192.168.1.2) or a network address (e.g., 192.168.1.0/24). iptables -A INPUT -s "$BLOCK_THIS_IP" -j DROP //-A: a parameter that stands for "append". It adds a new rule at the end of the selected chain. //-s: a parameter that stands for "source". It specifies the source IP address or network of the traffic to be matched. //-j: a parameter that stands for "jump". It specifies the target action to take when the rule is matched.
MultiPorts (Eg. To allow incoming SSH, HTTP, and HTTPS).
 iptables -A INPUT -i eth0 -p tcp -m multiport --dports 22,80,443 -m state --state NEW,ESTABLISHED -j ACCEPT iptables -A OUTPUT -o eth0 -p tcp -m multiport --sports 22,80,443 -m state --state ESTABLISHED -j ACCEPT //i: a parameter that stands for "input interface". It specifies the network interface through which the incoming traffic is expected to arrive. Similarly 'o' for "output interface". //eth0: the name of the network interface through which the incoming traffic is expected to arrive. //tcp: the transport protocol used by the traffic. //-m: a parameter that stands for "match". It specifies additional criteria to be matched by the traffic. //multiport: a match extension that allows matching multiple destination ports. //--dports: a parameter that stands for "destination ports". It specifies the destination port or ports to which the traffic is expected to arrive. Similarly '--sports' for "source ports". //--state: a match extension that allows matching the connection state of the traffic. //ACCEPT: Command to accept all traffic matching the given rule.
Allow outgoing SSH only to a specific network.
 iptables -A OUTPUT -o eth0 -p tcp -d 192.168.1.0/24 --dport 22 -m state --state NEW,ESTABLISHED -j ACCEPT iptables -A INPUT -i eth0 -p tcp --sport 22 -m state --state ESTABLISHED -j ACCEPT
Load balancing the incoming HTTPS traffic.
 iptables -A PREROUTING -i eth0 -p tcp --dport 443 -m state --state NEW -m nth --counter 0 --every 3 --packet 0 -j DNAT --to-destination 192.168.1.101:443 iptables -A PREROUTING -i eth0 -p tcp --dport 443 -m state --state NEW -m nth --counter 0 --every 3 --packet 1 -j DNAT --to-destination 192.168.1.102:443 iptables -A PREROUTING -i eth0 -p tcp --dport 443 -m state --state NEW -m nth --counter 0 --every 3 --packet 2 -j DNAT --to-destination 192.168.1.103:443 //-m nth: a match extension that allows matching packets based on their sequence number. //--counter 0: a parameter that specifies the starting sequence number for matching packets. //--every 3: a parameter that specifies that every third packet should match the rule. //--packet 0: a parameter that specifies that the first packet should match the rule (since we specified a starting counter of 0). //-j DNAT: a parameter that stands for "jump destination NAT". It specifies the target action to take when the rule is matched, which is to perform Destination Network Address Translation (DNAT). //--to-destination: a parameter that specifies the new destination IP address and port for the traffic. //the command adds a new rule to the end of the PREROUTING chain that redirects every third incoming TCP packet on port 443 (HTTPS) through the eth0 network interface to the IP address 192.168.1.101/102/103 respectively and port 443 using DNAT.
Ping from inside to outside.
 iptables -A OUTPUT -p icmp --icmp-type echo-request -j ACCEPT iptables -A INPUT -p icmp --icmp-type echo-reply -j ACCEPT //-p icmp --icmp-type echo-request: match the ICMP echo-request packets. //-p icmp --icmp-type echo-reply: match the ICMP echo-reply packets.
Ping from outside to inside.
iptables -A INPUT -p icmp --icmp-type echo-request -j ACCEPTiptables -A OUTPUT -p icmp --icmp-type echo-reply -j ACCEPT
Allow loopback access.
iptables -A INPUT -i lo -j ACCEPTiptables -A OUTPUT -o lo -j ACCEPT//-i lo: match traffic coming in from the loopback interface (i.e., lo).//-o lo: match traffic going out from the loopback interface (i.e., lo).//These commands allow loopback traffic (i.e., traffic between applications running on the same machine) to pass through the firewall, as loopback traffic doesn't need to be filtered.
Allow packets from the internal network to reach the external network.
iptables -A FORWARD -i eth0 -o eth1 -j ACCEPT//if eth1 is connected to external network (internet)//if eth0 is connected to internal network (192.168.1.x)
Allow outbound DNS.
iptables -A OUTPUT -p udp -o eth0 --dport 53 -j ACCEPT iptables -A INPUT -p udp -i eth0 --sport 53 -j ACCEPT//-p udp: match UDP traffic.//These commands allow DNS traffic to pass through the firewall, as DNS queries and responses use UDP port 53 by default. The first command allows DNS traffic to leave the machine through the eth0 interface, while the second command allows DNS traffic to enter the machine through the same interface.
Port forwarding from 443 to 80.
iptables -t nat -A PREROUTING -p tcp -d 192.168.1.2 --dport 443 -j DNAT --to 192.168.1.2:80//--to 192.168.1.2:80: change the destination port of the traffic to port 80 (HTTP) on the same IP address. This is useful when you want to redirect HTTPS traffic to HTTP for a web server.iptables -A INPUT -i eth0 -p tcp --dport 443 -m state --state NEW,ESTABLISHED -j ACCEPT//This command allows new or existing HTTPS traffic coming into the machine through the eth0 interface to pass through the firewall.iptables -A OUTPUT -o eth0 -p tcp --sport 443 -m state --state ESTABLISHED -j ACCEPT//This command allows existing HTTPS traffic going out of the machine through the eth0 interface to pass through the firewall.
Log the dropped packets.
iptables -N LOGGING//This command creates a new chain that we can use to log traffic that matches certain criteria.iptables -A INPUT -j LOGGING//This command adds a new rule to the end of the INPUT chain that will jump to the LOGGING chain for any traffic that doesn't match any of the previous rules.iptables -A LOGGING -m limit --limit 2/min -j LOG --log-prefix "IPTables Packet Dropped: " --log-level 7//-m limit --limit 2/min: limit the rate at which log messages are generated to 2 per minute.//-j LOG: log the traffic.//--log-prefix "IPTables Packet Dropped: ": prefix the log message with the specified string.//--log-level 7: set the log level to debug.iptables -A LOGGING -j DROP//This command adds a new rule to the end of the LOGGING chain that will drop any traffic that reaches it. This is useful for blocking traffic that doesn't match any of the other rules and for which you don't want to generate log messages.
Hands-on Experimenting with firewalls
🧱Stateless Firewall
In this hands-on demo, we will create a stateless firewall using iptables. A stateless firewall is a type of firewall that inspects each network packet independently without considering the state of the connection. This is in contrast to stateful firewalls that keep track of the state of network connections to determine which packets to allow or block. The advantage of a stateless firewall is its simplicity and low overhead.
Steps to be followed
Set default policies for INPUT, FORWARD, and OUTPUT chains to DROP
Allow traffic on the loopback interface
Allow traffic to/from specified ports or services
Block traffic to/from specified IP addresses or range
Code
// Set default policies for INPUT, FORWARD, and OUTPUT chains to DROPsudo iptables -P INPUT DROPsudo iptables -P FORWARD DROPsudo iptables -P OUTPUT DROP// Allow traffic on loopback interfacesudo iptables -A INPUT -i lo -j ACCEPTsudo iptables -A OUTPUT -o lo -j ACCEPT// Allow traffic to/from specified ports or servicessudo iptables -A INPUT -p tcp -m multiport --dports 22,23,53 -j ACCEPTsudo iptables -A OUTPUT -p tcp -m multiport --sports 22,23,53 -j ACCEPT// Block traffic to/from specified IP addresses or rangessudo iptables -A INPUT -s  -j DROPsudo iptables -A OUTPUT -d  -j DROP
Testing the Firewall
To test the firewall, we can use various tools like: ping and ssh to see if the traffic is allowed or blocked based on our rules. Here are some examples:
Ping loopback interface
 ping 127.0.0.1 //Expected result: Packets should be sent and received successfully.
 
Ping external IP address
 ping google.com //Expected result: Packets should be dropped due to the default DROP policy.
 
SSH into the machine
 ssh user@ //Expected result: Connection should be established successfully.
You can also check the status of the firewall by running the iptables -L command, which lists all the rules currently defined in the firewall. Note that this is not an exhaustive set of rules, and you may need to customize the rules based on your specific requirements.
Conclusion
This demonstration showcases the development of a stateless firewall with iptables. Our configuration includes rules that enable or restrict traffic based on criteria such as source and destination IP addresses, ports, and protocols. We have verified the effectiveness of our firewall by conducting tests with multiple tools.
🧱Stateful Firewalls
In this hands-on demo, we will create a stateful firewall using iptables. A stateful firewall is a type of firewall that keeps track of the state of network connections to determine which packets to allow or block. This allows the firewall to recognize legitimate traffic and prevent malicious traffic from entering the network.
Steps to be followed
Set default policies for INPUT, FORWARD, and OUTPUT chains to DROP
Allow traffic on the loopback interface
Allow traffic related to established connections
Allow traffic to/from specified ports or services
Block traffic to/from specified IP addresses or ranges
Allow traffic from established connections and their related traffic
Code
// Set default policies for INPUT, FORWARD, and OUTPUT chains to DROPsudo iptables -P INPUT DROPsudo iptables -P FORWARD DROPsudo iptables -P OUTPUT DROP// Allow traffic on loopback interfacesudo iptables -A INPUT -i lo -j ACCEPTsudo iptables -A OUTPUT -o lo -j ACCEPT// Allow traffic related to established connectionssudo iptables -A INPUT -m conntrack --ctstate RELATED,ESTABLISHED -j ACCEPTsudo iptables -A OUTPUT -m conntrack --ctstate RELATED,ESTABLISHED -j ACCEPT// Allow traffic to/from specified ports or servicessudo iptables -A INPUT -p tcp -m multiport --dports 22,80,443 -m conntrack --ctstate NEW,ESTABLISHED -j ACCEPTsudo iptables -A OUTPUT -p tcp -m multiport --sports 22,80,443 -m conntrack --ctstate ESTABLISHED -j ACCEPT// Block traffic to/from specified IP addresses or rangessudo iptables -A INPUT -s 192.168.1.100 -j DROPsudo iptables -A OUTPUT -d 192.168.1.100 -j DROP// Allow traffic from established connections and their related trafficsudo iptables -A INPUT -m conntrack --ctstate NEW,ESTABLISHED -j ACCEPTsudo iptables -A OUTPUT -m conntrack --ctstate NEW,ESTABLISHED -j ACCEPT
Testing
To test the firewall, we can use various tools like ping and telnet to see if the traffic is allowed or blocked based on our rules. Here are some examples:
Ping loopback interface
 ping 127.0.0.1 //Expected result: Packets should be sent and received successfully.
 
SSH into the machine
 ssh user@ //Expected result: Connection should be established successfully.
Make a request to a website.
 curl google.com //Expected result: Connection should be established, and the content of the page should be displayed.
 
Make a request to a blocked IP address
 curl 192.168.1.100 //Expected result: The connection should be dropped due to the block rule.
 
Conclusion
In this demo, we have created a stateful firewall using iptables. We have defined rules to allow or block traffic based on different criteria like source/destination IP addresses, ports or services, and connection states. Testing the firewall with different tools and scenarios verified that the rules were working as expected.
It is important to note that this is a very basic example, and many other rules and configurations can be added to create a more secure and customized firewall. Additionally, iptables can be complex and confusing for those who are not familiar with it, so it is important to thoroughly test and understand the rules before implementing them in a production environment.
🧱Application Firewall & Web Proxy
So, there's this thing called an application firewall that works at the app level. It either lets or stops traffic depending on the app-specific protocols or commands. Then there's a web proxy, which is like a middleman between clients and servers. It sends client requests to the right server and gives the server's response back to the client. Cool, right?
The optimal way to structure an application firewall and web proxy utilizing iptables is as follows:
Set up iptables rules to redirect traffic from the client to the proxy server. This can be done by configuring a DNAT rule for the port and protocol used by the application.
Install a web proxy server, such as Squid, on the proxy server to receive the redirected traffic.
Configure the web proxy server to listen for incoming traffic on the redirected port.
Set up iptables rules to redirect HTTP/HTTPS traffic to the web proxy server.
Configure the web proxy server to inspect the traffic for security threats or other issues. This can be done by enabling various security features such as SSL interception, IP whitelisting, and content filtering.
Set up iptables rules to block or allow traffic based on the web proxy server's security policies. For example, traffic from blacklisted IPs or containing malicious payloads can be blocked.
Configure the web proxy server to forward the traffic to the application server for processing. This can be done by configuring the web proxy server to use a load balancer or a round-robin algorithm to distribute traffic evenly among multiple servers.
Set up iptables rules to rewrite the source address of the forwarded traffic to that of the proxy server, so the application server can send the response back to the proxy server.
Configure the web proxy server to send the application server's response back to the client.
By simply following the steps above, you can set up an application firewall and web proxy using iptables, ensuring your web apps are secure and run smoothly on a remote server.
Rest assured that iptables is a powerful tool that can be complex, and mastering its configuration requires a solid understanding of networking and security concepts. To ensure optimal firewall implementation, I'd recommend seeking guidance from a certified network security professional prior to deploying in a production environment.
Follow Kubesimplify on Hashnode, Twitter, and LinkedIn. Join our Discord server to learn with us.
Like the explanation? Want to connect? You can find me here! Till then, happy learning 



Enhancing Runtime-Security with Falco: My Hands-On Experience
Santoshdts — Mon, 27 Mar 2023 12:30:39 GMT
Containerization and microservices have revolutionized the way applications are developed and deployed. But, with these technological advancements come new challenges in securing the containerized environment. Security in a containerized environment requires a different approach than traditional security mechanisms. It involves the continuous monitoring of container activities, identifying security threats, and ensuring compliance.
In this blog, I share my hands-on experience with Falco, an open-source tool designed specifically for securing containerized environments. It can be easily categorized as the de facto threat detection engine for Kubernetes and for cloud infrastructure. Because of its advanced features and capabilities, Falco has become a popular choice among DevOps and security professionals alike.
I will discuss how Falco enhances security in containerized environments and how it can help in addressing security challenges. I will also share my experience with the tool and provide insights into its features and capabilities.
As we are aware, Linux systems come with isolated environments  userspace and kernel space. Falco operates at both the user space and kernel space. The system calls are intercepted by the kernel module by the executable code deployed inside the OS kernel or using eBPF probes, which allows running scripts safely and performing actions inside the OS. The syscalls are then analyzed using the libraries. Alerts are generated when there is a match in a rule defined in the rule engine and alerted to outputs that are configured as Syslog, files, Standard Output, and others.
Falco can be installed on any local machine, cloud, or Kubernetes cluster. It mainly uses different drivers for collecting system calls' activity on the host. In this post, I will be installing Falco's latest release v3.41as a Package on my Ubuntu 22.04 system, so that it can detect any malicious activity occurring in the runtime. The detailed installation instruction for installing Faclo can be found in the official docs. I have gone ahead and installed the Falco package using the pre-installed dialog package and chose the falco-modern-bpf driver for this demo.
Once the installation is complete, we can verify that the Falco package is installed correctly and running by using the systemctl status falco-modern-bpf.service command:
santosh@~:$ systemctl status falco-modern-bpf.service  falco-modern-bpf.service - Falco: Container Native Runtime Security with modern ebpf     Loaded: loaded (/lib/systemd/system/falco-modern-bpf.service; enabled; vendor preset: enabled)     Active: active (running) since Mon 2023-02-27 16:49:51 IST; 32s ago       Docs: https://falco.org/docs/   Main PID: 39540 (falco)      Tasks: 9 (limit: 14115)     Memory: 85.7M        CPU: 1.787s     CGroup: /system.slice/falco-modern-bpf.service             39540 /usr/bin/falco --pidfile=/var/run/falco.pid --modern-bpfFeb 27 16:49:51 santoshdts falco[39540]: Loading rules from file /etc/falco/falco_rules.yamlFeb 27 16:49:51 santoshdts falco[39540]: Loading rules from file /etc/falco/falco_rules.local.yamlFeb 27 16:49:51 santoshdts falco[39540]: The chosen syscall buffer dimension is: 8388608 bytes (8 MBs)Feb 27 16:49:51 santoshdts falco[39540]: Starting health webserver with threadiness 4, listening on port 8765Feb 27 16:49:51 santoshdts falco[39540]: Enabled event sources: syscallFeb 27 16:49:51 santoshdts falco[39540]: Opening capture with modern BPF probe.Feb 27 16:49:51 santoshdts falco[39540]: One ring buffer every '2' CPUs.Feb 27 16:50:02 santoshdts falco[39540]: 18:23:12.408382864: Error File below /etc opened for writing (user=root user_loginuid=-1 command=falcoc>Feb 27 16:50:15 santoshdts falco[39540]: 18:23:25.539821933: Error File below / or /root opened for writing (user=root user_loginuid=1000 comman>Feb 27 16:50:18 santoshdts falco[39540]: 18:23:28.101536918: Error File below /etc opened for writing (user=root user_loginuid=-1 command=falcoc>
As you can see from the above output the Falco service is up, and running, and is already collecting logs.
Since version 0.34, Falco supports other driver types apart from kernel modules. To see the list of drivers view the systemd unit-files. However, it is recommended not to run multiple units in parallel :
santosh@~:$ systemctl list-unit-files | grep falcofalco-bpf.service                                                             disabled        enabledfalco-custom.service                                                          disabled        enabledfalco-kmod-inject.service                                                     static          -falco-kmod.service                                                            disabled        enabledfalco-modern-bpf.service                                                      enabled         enabledfalcoctl-artifact-follow.service                                              disabled        enabled
Falco Rules
Also, you can see from the above system status, Falco by default loads rules from the default directory /etc/falco/falco_rules.yaml. The default Falco configuration file is also placed in the same directory as /etc/falco/falco.yaml. This config file contains all the details about the rule files, the log output methods, etc. At the very top of the Falco config file, the order in which the rules will be evaluated is defined:
The order in which the above rules files are defined is important. The defining order decides which rule file Falco will use to evaluate events generated by syscalls as default, which is the first one defined in the list /etc/falco/falco_rules.yaml.
It is advisable and recommended practice, not to overwrite the main rules file, i.e etc/falco/falco_rules.yaml file, and make any modifications or additions to the rules in the file located in the same directory by the name /etc/falco/falco_rules.local.yaml. You can see from the above image, this local rule is defined second in order of preference. We will see this in action later.
Falco ruleset basically is a yaml file consisting of five main components as a list.
rule: a rule in any ruleset defines the name of the rule being defined.
desc: A short description of the rule.
condition: a condition is the important part in a rule. A condition is a boolean expression, which is evaluated when an event is triggered and detected by Falco. The condition stanza contains various syscall event types, file descriptive combined with some boolean operators. You can also check all the supported fields on the command prompt by using falco --list=syscall command.
Output: Output is the field that formats the logs in a readable format. The output is formatted in   . The message can be broken down into two parts. The first is a human-readable message. The second includes some placeholders (ex: %user,name), that will be populated when outputted. The placeholders, start with a % symbol followed by one of the event's supported fields, as in the condition field.
Priority: This field indicates the severity of the rule being voilated. This is included in the output logs. Priority field can include values like EMERGENCY, ALERT, CRITICAL, ERROR, WARNING, NOTICE, INFORMATIONAL, DEBUG.
The above structure forms a rule in Falco. But, the rules can include some more fields like, list, macro, exceptions, tags, etc, to form advanced rules. You can read more on this in official docs.
Working with Falco Hands-on
So now, we have some understanding of the Falco rules and Falco configured, up and running. Let's put it to work.
To witness Faclo in action, let's try to open a sensitive file on a Linux system /etc/sudoers file which needs sudo access. Simultaneously, I will open another terminal to view systemd logs with the journalctl -fu falco-modern-bpf command:
Falco outputs the logs whenever a condition in a rule matches an event being triggered. You can see, the Falco tool detected that a sensitive file /etc/shadow was opened for reading with a priority of Warning. The output also provides some context to the event, like the user who tried to open the file, it was root in our case, the name of the file, etc.
Now, let us test it with Kubernetes pods:
First, in order to enable communication between our security tool Falco and Kubernetes Pods to fetch information from the containers. We need to install falco as a Daemonset on our Kubernetes cluster. This enables, falco to query the CRI  contained I'm my case for polling the events.
santosh@blogs:main$ kubectl get pods -n falcoNAME          READY   STATUS    RESTARTS      AGEfalco-gxxb6   2/2     Running   7 (34m ago)   3d15hfalco-rpz45   2/2     Running   7 (34m ago)   3d15h
Once, the Falco pods are up and running on both nodes (I've installed it on my two-node Kind cluster. Hence, two pods). We can move ahead and see Falco watching the processes triggered by Containers as well.
To see this in action, I've deployed a pod named privileged1, which is just a busybox container running with some serious security flaws in the configurations:
apiVersion: v1kind: Podmetadata: creationTimestamp: null labels: run: privileged1 name: privileged1spec: containers:  - args:    - sleep    - 1d image: nginx name: privilaged securityContext: privileged: true runAsUser: 1000 allowPrivilegeEscalation: true
In order to view the logs, we need to exec into the Kind node, where Falco is installed, and view the logs through journalctl the command line tool that lets you interact with the journal logs.
Now, let's deploy this pod and see if Falco catches the security misconfiguration in the workload.
As expected, Falco detected that a privileged pod was created and a volumeMount was executed inside the container. The severity of the output is set to Warning in this case. Suppose, we now try to spawn a shell inside the container. As spawning a shell inside a container is considered a security risk, Falco would again sense this and alert us about this.
Modifying a rule
Another point to notice from the above image is that the output in the logs is more concise, giving only the relevant information about the user and container. This is achieved by altering the default rules to provide the output. As we discussed earlier in the Rules section, the rules are evaluated based on their listing in the main Falco config file. The default rules faclo_rules.yaml file is one which comes pre-installed with all the rules formulated by the wonderful Falco community and might change during upgrades. Keeping this in mind, all the modification to the rules is made in a local rules file listed below the falco_rules.yaml file, namely falco_rules.local.yaml. Hence, while evaluating the rules, Falco looks first goes through the main rules file and then to the local file to evaluate the rules.
In order to make any changes to the existing rules, we need to add our custom rules in the falco_rules.local.yaml file.
santosh@blogs:main$ cat /etc/falco/falco_rules.local.yaml # Custom rules! - rule: Terminal shell in container    desc: Detects a shell being spawned in a Container    condition: container.id != host and proc.name in (linux_shells)    output: >      A shell was spawned in a container with an attached terminal (user=%user.name user_name=%user.loginname shell=%proc.name parent=%proc.pname cmdline=%proc.cmdline container_name=%container.name image=%container.image)    priority: WARNING  - list: linux_shells    items: [sh, bash, zsh]
There are various events that falco tracks for evaluating the process and generating readable output. We can learn more about all the event types and supported fields in the rules Condition field from Falco's official docs.
Once, the falco_rules.local.yaml file is altered, we need to restart the Falco service either by systemctl restart falco-modern-bpf.service or by hot reloading, this method does not restart the systemd service and restart the falco instance by identifying the pid of falco and sending a sighup signal by using this command: kill 1 $(cat pidof falco).
In our case, we were monitoring the logs from the journalctl utility as text files. But, Falco provides various ways to export the logs to different alert channels. We can export logs in json, to a specific file, a Program output for a Slack Incoming Webhook, even an HTTP/HTTPS endpoint to some URL, or via a gRPC API client to an external program.
Conclusion
In this post, we've just scratched the surface of the runtime security paradigm by working with Falco. Falco also supports Kubernetes Audit logs by ingesting the Kubernetes event as event source, among others. Falco's support for monitoring the Kubernetes Audit logs in real-time provides an additional layer of security for Kubernetes environments and helps organizations detect and respond to security threats quickly and effectively.
In the next post, we will be integrating Falco with Kubernetes, enabling robust runtime security for our Kubernetes cluster.
Follow Kubesimplify on Hashnode, Twitter and Linkedin. Join our Discord server to learn with us.


Getting Started with KinD: Creating a Multi-node  Local Kubernetes Cluster
Chirag Varshney — Mon, 13 Mar 2023 12:30:39 GMT
Nowadays, Kubernetes is the most popular orchestration tool. So, have you ever wanted to become acquainted with its components, commands, or other related information?
Simply you just need a platform to play around with Kubernetes.
There are numerous platforms for playing around with Kubernetes clusters. Kubeadm, Kops (Kubernetes Operations), Minikube, and Killercoda are a few examples. However, as far as I can tell, those options have limitations. These clusters/environments are either temporary (killercoda) or you can only create a single control-plane node with a single etcd database running on it (kubeadm) or you get only a single node cluster (minikube) or you must pay for what you consume.
What if we could create a highly available k8s cluster locally for development and testing? Which is permanent and does not require payment. That's fantastic, isn't it? Furthermore, if the cluster configuration procedure is simple?
Yes, we are talking about KinD (Kubernetes in Docker). It is a tool for running local Kubernetes clusters using Docker container nodes. Kind was primarily designed for testing Kubernetes itself but may be used for local development or CI.
We discuss more about KinD in detail in this article, including how to use it to create single-node, multi-node, and multiple nodes clusters, as well as how to deploy an application to your kind cluster.
What is KinD?
KinD (Kubernetes in Docker) is a simple tool with several powerful and unique features that make it easier to run local Kubernetes clusters. Kind is a Kubernetes SIGs project that is quite distinct from minikube. It encapsulates the cluster in Docker containers. This results in a substantially faster starting time as compared to running a VM.
With Kind, it is easy to spin up a local Kubernetes cluster within Docker Desktop. The Kind runs as a container by itself.
Kind documentation is easy and straightforward to understand, for more details and understanding refer this.
Installation of KinD
Pre-Requisites
Install Docker - You must have docker installed and running in your system. If not, you can get it from here as per your OS.
https://docs.docker.com/engine/install/
 
Install Kubectl (optional) - kind does not require kubectl, but you will not be able to perform some of the examples without it.
  Follow this documentation to install and set up kubectl.
Install KinD
On macOS
Via Homebrew:
    brew install kind
Via MacPorts:
  sudo port selfupdate && sudo port install kind
From Release Binaries:
  # for Intel Macs  [ $(uname -m) = x86_64 ]&& curl -Lo ./kind https://kind.sigs.k8s.io/dl/v0.17.0/kind-darwin-amd64  # for M1 / ARM Macs  [ $(uname -m) = arm64 ] && curl -Lo ./kind https://kind.sigs.k8s.io/dl/v0.17.0/kind-darwin-arm64  chmod +x ./kind  mv ./kind /some-dir-in-your-PATH/kind
On Linux
 curl -Lo ./kind https://kind.sigs.k8s.io/dl/v0.17.0/kind-linux-amd64 chmod +x ./kind sudo mv ./kind /usr/local/bin/kind
On Windows
Via Chocolatey:
  choco install kind
From Release Binaries:
  curl.exe -Lo kind-windows-amd64.exe https://kind.sigs.k8s.io/dl/v0.17.0/kind-windows-amd64  Move-Item .\kind-windows-amd64.exe c:\some-dir-in-your-PATH\kind.exe
To see if KIND is installed on your system, you can use the command kind version to see what version of KIND is installed.
Creating Cluster
Creating Single-node Cluster
Create a single-node cluster without any config file. Just use the below command:
kind create cluster
This will use a pre-built node image to bootstrap a Kubernetes cluster. Prebuilt images are hosted atkindest/node.
You can use the command kubectl get nodes to vaidate that your single-node cluster is running correctly.
Cluster Configurations
By default, the cluster access configuration is stored in ${HOME}/.kube/config if $KUBECONFIG environment variable is not set.
You can access the config file by using the command less ~/.kube/config
To get the api server to which we are going to interact for connecting to our kubernetes cluster, we can use the command grep server ~/.kube/config
Changing node image:
To specify another image use the --image flag  kind create cluster --image=....
kind create cluster --image kindest/node:v1.24.7@sha256:577c630ce8e509131eab1aea12c022190978dd2f745aac5eb1fe65c0807eb315
Using a different image allows you to change the Kubernetes version of the created cluster. Every version of kind supports the specific list of versions of Kubernetes, you can see the list of supported versions of Kubernetes from the release page.
Changing cluster context name:
By default, the cluster will be given the name kind. Use the --name flag to assign the cluster a different context name.
kind create cluster --name sample
You can use the command kubectl config get-contexts to list the currently active clusters.
To switch between different cluster, you can use kubectl config use-context 
Interacting with the cluster:
After creating a cluster, you can use kubectl to interact with the cluster created by kind.
#for defaultkubectl cluster-info#for cluster having specified context namekubectl cluster-info --context 
You can list all the active clusters by using the command kind get clusters .
Creating Multi-node Cluster
To create a multi-node kind-cluster environment use the config file given below.
# this config file contains all config fields with comments# NOTE: this is not a particularly useful config filekind: ClusterapiVersion: kind.x-k8s.io/v1alpha4# patch the generated kubeadm config with some extra settingskubeadmConfigPatches:- |  apiVersion: kubelet.config.k8s.io/v1beta1  kind: KubeletConfiguration  evictionHard:    nodefs.available: "0%"# patch it further using a JSON 6902 patchkubeadmConfigPatchesJSON6902:- group: kubeadm.k8s.io  version: v1beta3  kind: ClusterConfiguration  patch: |    - op: add      path: /apiServer/certSANs/-      value: my-hostname# 2 control plane node and 2 workersnodes:# the control plane node config- role: control-plane- role: control-plane# the two workers- role: worker- role: worker
In this config file, we are creating a multi-node cluster with two control planes and 2 worker nodes, you can create more according to your requirements.
Save the above config file as example-config.yaml .
You can create a cluster using a pre-defined config file by using the command kind create cluster --config  .
kind create cluster --config example-config.yaml
You can validate the multi-node clusters created by running the command kubectl get nodes to ensure that all nodes are running correctly.
To list all the running containers you can run docker ps .
By running the command grep server ~/.kube/config, we can see that this server is the same as kind-external-load-balancer i.e. we are connecting with kind-external-load-balancer, which will directly communicate with other master nodes.
Deleting Cluster
You can delete the cluster by using the command given below:
#for default clusterskind delete cluster#for cluster having different context namekind delete cluster --name 
Dynamic Volume Provisioning
Dynamic Volume Provisioning in Kubernetes is a mechanism that allows storage volumes to be created on demand. To accomplish this, Kubernetes Cluster employs the Storage class concept, which abstracts the details of the underlying storage. Cluster administrators must manually call their cloud or storage provider and then create Persistent Volume objects in Kubernetes without dynamic provisioning.
Kind comes with a pre-configured default Storage Class when you create the kind cluster. To see the list of available storage classes, use the command kubectl get sc .
WaitforFirstConsumer indicates that pvc(persistent volume claim) will not be bound until it is attached to a pod.
Now, create a PVC file by using the given below code.
# local path provisioner only supports readwriteonceapiVersion: v1kind: PersistentVolumeClaimmetadata:  name: pvc-testspec:  storageClassName: standard  accessModes:    - ReadWriteOnce  resources:    requests:      storage: 500Mi
Save this file as pvc.yaml and run the following command to create a persistent volume claim from this pvc. yaml file:
kubectl create -f pvc.yaml
Create another yaml file for the busybox pod by using the given below code and save it as busybox.yaml.
apiVersion: v1kind: Podmetadata:  name: busyboxspec:  volumes:  - name: host-volume    persistentVolumeClaim:      claimName: pvc-test  containers:  - image: busybox    name: busybox    command: ["/bin/sh"]    args: ["-c", "sleep 600"]    volumeMounts:    - name: host-volume      mountPath: /mydata
Run the following command to create a pod:
kubectl create-f busybox.yaml
Run the following commands to validate the persistent volume or persistent volume claim created, and to check whether the pod is running or not:
kubectl get pv,pvckubectl get pods
Now we've actually built a multi-node pvc-backed cluster and mounted it on busybox.
You must expose your service after it has been deployed to Kubernetes so that your users can access it. The cluster can be accessed from outside in three ways: ingress, load balancer, and node port.
Exporting Cluster Logs
kind has the ability to export all kind related logs for you to explore.
#To export all logs from the default cluster (context name kind):kind export logs#Like all other commands, if you want to perform the action on a cluster with a different context name use the --name flag.kind export logs --name 
As you can see, kind put all of the cluster logs in a temporary directory. If you want to specify a location, simply follow the command with the path to the directory:
The structure of the logs will look more or less like this:
 docker-info.txt kind-version.txt kind-worker     containers     alternatives.log     containerd.log     images.log     serial.log     docker.log     inspect.json     journal.log     kubelet.log     kubernetes-version.txt     pods/ kind-worker2 kind-control-plane kind-control-plane2/     containers     alternatives.log     containerd.log     images.log     serial.log     docker.log     inspect.json     journal.log     kubelet.log     kubernetes-version.txt     pods/
Deploying an Application
You can use the kubectl command-line tool to deploy an application to your KinD cluster. Create a deployment definition file that contains the specifics of your application. An example deployment definition file for a simple Nginx web server is provided below:
apiVersion: apps/v1kind: Deploymentmetadata:  name: nginx-deploymentspec:  replicas: 3  selector:    matchLabels:      app: nginx  template:    metadata:      labels:        app: nginx    spec:      containers:      - name: nginx        image: nginx:1.19        ports:        - containerPort: 80
Save this file as nginx-deployment.yaml and then run the following command to create the deployment:
kubectl apply -f nginx-deployment.yaml
This command will create a deployment with three Nginx web server replicas.
To connect to the web server, you must first create a service that exposes the deployment. The following service definition file can be used to create a service:
apiVersion: v1kind: Servicemetadata:  name: nginx-servicespec:  selector:    app: nginx  ports:  - name: http    port: 80    targetPort: 80  type: ClusterIP
Save this file as nginx-service.yaml and then run the following command to create the service:
kubectl apply -f nginx-service.yaml
This command will create a ClusterIP service, which will expose the Nginx web server deployment.
To obtain the IP address of the service, use the kubectl get services command. You can access the Nginx web server once you have the IP address by opening a web browser and navigating to http://:80.
Conclusion
There we are at the end, this blog covered how to set up a multi-node KinD cluster, including how to install KinD, configure the cluster, and deploy an application to the cluster. KinD is a useful tool for testing and development because it allows you to set up and manage a local Kubernetes cluster. KinD allows you to experiment with various configurations and deploy a wide range of applications, making it a valuable tool for any Kubernetes developer.
Don't forget to like and share this blog if you liked it. Connect with me on Twitter for getting updates on more such blogs.
THANKS FOR READING !!😁
Follow Kubesimplify on Hashnode, Twitter and Linkedin. Join our Discord server to learn with us.


Operating Systems 101: Essential Knowledge for  DevOps/SRE Engineers
Krishnamohan Yerrabilli — Thu, 09 Mar 2023 12:30:39 GMT
When it comes to DevOps, you may have come across some challenging concepts, such as Kubernetes, Docker, Helm, Prometheus, and others, which can be difficult to grasp without fundamental knowledge. That's why I'm starting a new blog series, called Building a Strong Foundation in DevOps/SRE.
It's crucial to start with the basics to become a good engineer, and I'm committed to providing you with a comprehensive understanding of these complex ideas. Ok, enough talk, let's get started.
Introduction
When it comes to your computer, the operating system plays a crucial role as the Head of everything. It acts as a manager, ensuring the smooth functioning of all the different parts of your computer, by keeping them in line. The operating system creates and runs the programs that make up your computer applications, serving as the backbone of your computer's functionality
So, in other words, the operating system acts as an intermediary between the hardware and the applications that run on the computer. It is responsible for managing and coordinating the use of the computer's hardware resources, such as memory, processing power, and input/output devices, as well as providing a common interface for applications to interact with the hardware
Through this, the applications can access the necessary resources they need to function properly, the OS makes sure everything is running smoothly and coordinating all the different parts of the computer in line with each other.
History of Operating Systems
It all begins a long way back from the early days of batch processing on centralized mainframes, which have evolved through time. This evolution started with the introduction of time-sharing systems, allowing multiple users to access the computer at once
To start with, MS-DOS was the first popular personal computer operating system. The Macintosh, introduced in 1984, revolutionized the game with the introduction of the graphical user interface (GUI)
In the 90s, Windows, introduced by Microsoft, combined features of the Macintosh with MS-DOS and became the most popular personal computer operating system. Today, OSes are designed particularly for different types of devices, such as smartphones, servers, and supercomputers.
Functions of an Operating System
Did you see, there are a lot of functions that an OS can handle, let's understand 5 of them
Memory Assignment
To guarantee an appropriate amount of memory for each program in operation and to resolve issues between programs, the system responsible takes care of the computer's memory management and assignment through proper handling.
Resource Allocation
Is responsible for coordinating and efficiently utilizing various computer resources, such as CPU time, memory, and storage, and allocates these resources with the goal is to achieve optimal performance by efficiently utilizing them.
Process Coordination
It is the central authority, to allows each program to run efficiently and without interfering with other programs and processes, and is accountable for orchestrating the execution of programs and processes in the computer system by using efficient resource utilization techniques.
Safety and Protection
To keep the computer and its information secure, the operating system comes with some safety features, like user authentication, access control, and data protection, all aimed at ensuring the safety and security of the computer and its data
File Organization
Users want to access their data easily, So file system management takes care of organizing and managing how the computer stores, gets, and changes files and information. This is done by properly organizing and managing the data on the computer to make it easier to store and retrieve files.
Types of Operating Systems
It comes in various shapes and sizes, each boasting its own unique set of features and abilities. Broadly speaking, the main categories of operating systems are single-tasking and multitasking
Single-tasking operating systems can often be found on older computers and, as the name says, they are only capable of running one program at a time. To launch a new program, the current one must first be closed. Despite the limitations of single-tasking systems, they tend to be more user-friendly and are less prone to crashes in comparison
Multitasking OS which quickly reacts to outside things is a real-time system. This kind of system is often used in things like flying, driving, and running factories, where fast reactions are very important. The real-time system can reply quickly and exactly, making it a good pick for things that need a real-time response.
Process Management
Process management is a crucial aspect of operating systems and regards the way programs and processes are managed on a computer. The goal is to use system resources. This is gonna achieved by creating, scheduling, executing, and monitoring processes in a coordinated manner.
There is a thing, called a process management system that takes care of managing the life cycle of each process, including allocating resources, scheduling, and synchronization, as you see above it is also responsible for the termination of processes.
Memory Management
It deals with the management and coordination of computer memory. This is done in a way that each process has enough memory to execute its tasks and the system remains stable. To achieve this goal, memory management involves allocating memory to processes, freeing up memory that is no longer needed, and ensuring that there are no conflicts between processes fighting for memory resources
This system keeps track of which parts of memory are being used and which parts are available for use. The goal is to make the most effective use of memory resources, provide fast memory access, and minimize memory waste and fragmentation.
Storage Management
It refers to the process of basically monitoring and controlling the storage of data in computer systems, with the goal to make the best use of available storage resources and to minimize the risk of data loss or corruption.
This includes tasks such as allocating, organizing, and monitoring the use of storage resources in order to ensure that they are being used effectively and efficiently.
File System Management
Probably it's the process of managing and organizing the storage of files and directories on a computer. This includes tasks such as creating and deleting files, creating and managing directories, and managing the allocation of storage space to individual files and directories.
The main goal of File System Management in the OS is to provide a structured and organized way of storing and accessing data on a computer. This is achieved with the use of a file system, which is a way of organizing and storing data in the form of files and directories.
Security and Access Control
This is one of the important aspects of OS, Security Access Control (SAC) is the process of protecting and controlling access to resources and data stored on a computer system. This includes tasks such as managing user accounts, setting permissions for access to files and directories, and controlling the execution of programs.
The task of organizing the storage of files and directories on a computer includes creating and deleting files, creating and managing directories, and managing the allocation of storage space to individual files and directories.
Networking
This is the basic way how computers are connected together, Client-Server Architecture refers to the process of connecting computers and devices to each other to exchange data and information.
This includes tasks such as configuring network interfaces, setting up communication protocols, and managing the flow of data between devices it plays the role in enabling communication and collaboration between different devices and systems.
Interrupts and Exception Handling
These are the methods used by the computer system to handle unexpected events or errors. Interrupts are signals sent to the processor indicating that an event requiring immediate attention has occurred. Examples of interrupts include hardware events such as a key press on the keyboard or clicking on your mouse.
Exception handling is the process of dealing with errors or unexpected events that occur during program execution. This includes errors such as divide-by-zero or illegal memory accesses this is actually represented in a vector scale I mean it's basically a table with the data indexing that what to do when an unusual thing happens.
Deadlocks and resource allocation
It's the concept related to the problem of managing the allocation of shared resources in a computer system. In a multitasking environment, multiple processes may request and hold onto resources simultaneously, leading to the possibility of a deadlock.
As I mentioned in the visual, this usually happens when two or more processes are blocked, and each is waiting for the release of a resource that is held by the other process. This creates a circular wait condition and the processes are unable to proceed or make progress.
To avoid these locks, a proper strategy for resource allocation is required. One common way is to use a resource allocation algorithm that determines the order in which resources are granted to processes. This helps in avoiding circular wait conditions and making sure that resources are efficiently utilized.
Processor Scheduling
Process Scheduling refers to the process of determining which tasks the processor will execute next and when. This involves making decisions about how to allocate the processor's time among the tasks that are waiting to be executed, based on factors such as priority, deadline, and other resource requirements.
To ensure that the processor is used effectively and efficiently so that tasks are completed in a timely manner and the overall system performance is improved, the operating system uses algorithms and data structures to manage the scheduling process in the most efficient way possible, taking into account the state of the processor and the available system resources.
Process Synchronization
It is the synchronization of processes in a computer system to avoid conflicts and race conditions. This is achieved by making sure that access to shared resources, such as data or files, is regulated in a way that only one process at a time can access them.
Where these processes can execute concurrently without interfering with each other, which can lead to data corruption or incorrect results. This is achieved through the use of synchronization techniques such as semaphores, monitors, and critical sections.
Threads
The concept of Threads in an operating system is all about making sure that multiple tasks can run in parallel within a single process. This allows for more efficient use of the processor's time and can lead to improved performance.
Note that: A process is just a program that is executing, and it can contain multiple threads. Each thread runs independently of the other threads in the same process and has its own program counter, stack, and local variables.
Virtual Memory
The purpose of virtual memory is to allow a computer system to run multiple programs simultaneously while making sure that each program has enough memory to execute properly. This is achieved by allowing each program to have its own virtual address space, which is a portion of the memory that is isolated from the memory of other programs.
it simply means, the operating system allocates a separate space of the hard disk to be used as an extension of the RAM, which acts as a temporary storage area for data that is not currently needed.
File System Implementation
The storage of files and directories on a computer is organized and efficiently involves implementing a File System and managing the allocation of storage space for individual files and directories.
Includes tasks such as creating and deleting files, creating and managing directories, and using related Index Allocation Techniques to manage the efficient use of storage space. The implementation of the file system and related index allocation must also ensure that data remains secure and protected with the proper access control and protection mechanisms.
OS-level services
System Services are one of the core concepts of OS, its facilities and functions are provided by the operating system to the user and applications to make it easier for them to interact with the hardware. These services are provided at the abstraction level, hiding the complexities of the underlying hardware.
Making sure these services are efficient and reliable is a crucial aspect of the operating system's functionality. There are plenty of services provided by OS including memory management, process management, file management, and many others. These services are designed to be flexible and customizable, allowing applications to make use of them in different ways for their specific needs.
Design Principles
These are actually guidelines and approaches used in the creation and development of an OS. These principles aim to make sure the OS operates efficiently, is user-friendly, and provides robust and reliable services to applications. Some of the important design principles of an OS are
Modularity
To divide the system into smaller, manageable components that can be developed, tested, and maintained independently.
Abstraction
To hide the complexity of the underlying hardware and present a simple and unified view to the user.
Hierarchy
Organize the system into a series of levels, each building upon the services of the lower level.
Kernel interfaces and System Utilities
System utilities are programs that are used for various tasks such as file management, process management, and system maintenance. They are independent programs that run outside of the kernel and interact with it through the kernel interfaces. it provides a standard way for the system utilities to access the services of the kernel.
The design of the kernel interfaces and system utilities must take into account the requirements of different applications, the hardware, and the operating system, and the need to make sure that the system is efficient, reliable, and easy to use.
So here comes the end.
Thank you for taking the time to go through the extensive blog. I am certain that this information will be of great use to you and will inspire you to continue exploring the vast world of DevOps, see you soon with another one, and happy learning!!
Resources
Operating Systems Principles By Stanford, Course Code (CS111)
Operating System Concepts By Abraham_Silberschatz,Greg, Peter
Operating_Systems_By_HillaryNyakundi
Operating System Full Course
If you're curious or have any questions about the topic, just send me a direct message on Twitter, I'm happy to chat and clear things up for you!
Follow Kubesimplify on Hashnode, Twitter, and LinkedIn. Join our Discord server to learn with us.


How to Install a Kubernetes Cluster with Kubeadm, Containerd, and Cilium: A Hands-On Guide
Santoshdts — Tue, 07 Mar 2023 12:30:39 GMT
There are many ways to create a self-managed Kubernetes Cluster like Kind, Minikube, etc. Apart from these, there are many ways we can create managed Kubernetes clusters on cloud providers of our choice. The self-managed clusters created by the above tools are suitable for testing our workloads and integrations. Given the complexity, the kubeadm is not the most popular choice for creating a production-grade on-premise cluster. However, creating a cluster using Kubeadm can help in understanding the various components and configurations.
Kubeadm is a tool that is used to bootstrap a Kubernetes cluster from scratch. It provides a way to create a fully functional, production-ready Kubernetes cluster by following a set of well-defined steps. Kubeadm is highly customizable, and it allows users to configure different aspects of the cluster, such as the network configuration, the container runtime, and the authentication and authorization policies. Kubeadm is a good choice for learning Kubernetes because it provides a more realistic simulation of a production environment and allows users to practice the cluster setup and configuration.
In this post, we shall walk through a hands-on demo of installing a two-node Kubernetes Cluster of version 1.26.0 built using the Kubeadm tool with ContainerD as a Container Runtime and Cilium as a CNI plugin.
Let's start with getting ready with our two nodes, which shall be virtual machines provisioned using VirtualBox and Vagrant. We will configure the VMs with SSH keys to enable communication between both VMs. Once we are ready with the VMs provisioned. We will start with configuring the nodes with the prerequisites like updating the packages, turning off the swap memory, etc.
We'll start with updating and upgrading the apt packages on both Nodes:
$ apt-get update && apt-get upgrade -y
If required escalate the privileges by using the sudo privileges
Install the Kubernetes Packages using the apt repository:
sudo apt-get install -y apt-transport-https ca-certificates curl
Download the Google Cloud public signing key:
curl -fsSL https://packages.cloud.google.com/apt/doc/apt-key.gpg | sudo gpg --dearmor -o /etc/apt/keyrings/kubernetes-archive-keyring.gpg
If you get an error like:
curl: (23) Failed writing body (0 != 1210)
This indicates the /etc/apt/keyrings directory does not exist. we need to create this specific directory to download the Google Cloud public signing keys.
Add the Kubernetes apt repository:
echo "deb [signed-by=/etc/apt/keyrings/kubernetes-archive-keyring.gpg] https://apt.kubernetes.io/ kubernetes-xenial main" | sudo tee /etc/apt/sources.list.d/kubernetes.list
Update the apt packages and install kubeadm, kubectl, and kubelet packages from the apt package:
sudo apt-get install -y kubelet=1.26.0-00 kubeadm=1.26.0-00 kubectl=1.26.0-00
In the above snippet, we will be installing a specific version opf the tools i.e. v1.26.0.
We need to place a hold on the above-installed packages for any accidental upgrades: sudo apt-mark hold kubelet kubeadm kubectl
We need to repeat all the above processes on the kubenode01 as well. We can choose if we need the kubectl tool available on the worker node.
Once, the above steps are performed on both the ControlPlane and Worker Node. We need to perform a very important configuration on both nodes. Disabling swap memory, enabling a couple of Kernel Modules, and updating the Settings in sysctl.
First, we need to disable the swap for the kubelet to work properly, we'll do it by first checking for the /etc/fstab and look for a line:
/swap.img none swap sw 0 0
If this line is available in the fstab file, we can disable this setting by commenting it out. Instead of rebooting our nodes, we can apply the following command to disable the swap sudo swapoff -a.
Once the swap is disabled, we need to enable two kernel modules, overlay and br_netfilter:
cat <
sudo modprobe overlay
sudo modprobe br_netfilter
cat <
Now, we should see the config file as below:
cat /etc/sysctl.d/k8s.confnet.bridge.bridge-nf-call-ip6tables = 1net.bridge.bridge-nf-call-iptables = 1net.ipv4.ip_forward = 1
Once the file is saved, we must reload the sysctl:
sudo sysctl --system
In the next step, we shall move ahead and install Containerd as CRI and Cilium as a CNI addon.
Installing ContainerD
With the depreciation of Docker for Kubernetes since v1.25.0, Containerd is one of the preferred choices of the Ops team. Containerd is a Container-Runtime developed by Docker that manages the container lifecycle. In February 2019, Containerd became an official project within the Cloud Native Computing Foundation (CNCF).
While there are multiple ways to install Containerd, we shall be using the method of installing it using the apt package. The Containerd runtime needs to be installed on both nodes. The first thing to do is to enable iptables Bridged Traffic on all the Nodes and to configure the persistent loading of the necessary Containerd modules by using the following commands:
sudo tee /etc/modules-load.d/containerd.conf << EOFoverlaybr_netfilterEOF
Reload the sysctl configurations with sudo sysctl --system command.
Install the necessary dependencies with the following command:
sudo apt install curl gnupg2 software-properties-common apt-transport-https ca-certificates -y
Now, we need to add the GPG keys with curl -fsSL https://download.docker.com/linux/ubuntu/gpg | sudo apt-key add - command.
Adding the repository with sudo add-apt-repository "deb [arch=amd64] https://download.docker.com/linux/ubuntu $(lsb_release -cs) stable" command.
Now, we are ready to install the Containerd package through the apt package manager using the following command:
sudo apt updatesudo apt install containerd.io -y
Once, we have successfully installed the Containerd. We need to load the Containerd configurations, for this, we might need to gain access to sudo privileges sudo -i:
mkdir -p /etc/containerdcontainerd config default>/etc/containerd/config.toml
Now, restart the Containerd systemd service and enable it:
systemctl restart containerdsystemctl enable containerd
By now, we have installed Containerd as a CRI installed on both of our nodes. Now, we need to configure the Kubernetes cluster with kubeadm tool and install the CNI plugin of our choice.
Configure Kubernetes Controlplane
Once, the CRI is installed successfully on both nodes, we are now ready to configure our Kubernetes Controlplane. We need to perform the following actions on the ControlPlane node we identified earlier.
$ kubeadm init --pod-network-cidr=10.1.1.0/24 --apiserver-advertise-address 
The kubeadm init command expects mainly two arguments among others. the --pod-network-cidr and --apiserver-advertise-address. The --pod-network-cidr enables inter-pod networking which we will install using Cilium and the cidr range for cilium is 10.1.1.0/24. The --apiserver-advertise-address is the one we need to carefully assign the IP address of the Controlplane nodes API Server. We can get the IP address by using ifconfig or ip a command.
This command will configure and bootstrap the controlplane node by installing all necessary components and provide an output with a kubeadm join command and other settings required.
For example:
Your Kubernetes control-plane has initialized successfully!To start using your cluster, you need to run the following as a regular user:  mkdir -p $HOME/.kube  sudo cp -i /etc/kubernetes/admin.conf $HOME/.kube/config  sudo chown $(id -u):$(id -g) $HOME/.kube/configYou should now deploy a Pod network to the cluster.Run "kubectl apply -f [podnetwork].yaml" with one of the options listed at:  /docs/concepts/cluster-administration/addons/You can now join any number of machines by running the following on each nodeas root:kubeadm join 192.168.56.11:6443 --token 1gehfl.g3n31uj4cmvnzxug --discovery-token-ca-cert-hash sha256:17a0b8da44fe941c2c00808928a6bbce54d1e7b42d77c865b3e619192949856f
In case we miss this join command with the token, we can create a new token with the following command to be used on the Worker node:
vagrant@kubemaster:~$ kubeadm token create --print-join-commandkubeadm join 192.168.56.11:6443 --token 1gehfl.g3n31uj4cmvnzxug --discovery-token-ca-cert-hash sha256:17a0b8da44fe941c2c00808928a6bbce54d1e7b42d77c865b3e619192949856f
We need to copy this kubeadm join command and apply this to our kubenode01 worker node to join the cluster. Once we have applied this join command. We need to return to the Controlplane and configure the kubeconfig required for authentication to the API server.
We do this by creating a directory to place the cluster details, contexts, and credentials in the mkdir -p $HOME/.kube directory by coping the following:
$ sudo cp -i /etc/kubernetes/admin.conf $HOME/.kube/config  sudo chown $(id -u):$(id -g) $HOME/.kube/config
Once this is configured, our cluster should be almost up. We can check this by using the kubectl get nodes command:
vagrant@kubemaster:~$ kubectl get nodesNAME         STATUS     ROLES           AGE     VERSIONkubemaster   NotReady   control-plane   3m59s   v1.26.0kubenode01   NotReady             8s      v1.26.0
We can see the Nodes are in a NotReady state, this is because we have not yet implemented the Pod Networking CNI plugin yet. For this experiment, we shall be using Cilium as our networking solution.
We first download the Cilium binaries by using curl -LO https://github.com/cilium/cilium-cli/releases/latest/download/cilium-linux-amd64.tar.gz command. Then extract the downloaded file to your /usr/local/bin directory with the following command:
sudo tar xzvfC cilium-linux-amd64.tar.gz /usr/local/binrm cilium-linux-amd64.tar.gz
After running the above commands, you can now install Cilium with the following command: cilium install. Once the cilium is installed, we can check the status of the Cilium by using the cilium status command to confirm that the cilium is correctly installed. Once the Cilium addon is installed, we should see the Pod networking enabled and our nodes in the Ready state:
 Cilium was successfully installed! Run 'cilium status' to view installation health$ vagrant@kubemaster:~$ kubectl get nodesNAME         STATUS     ROLES           AGE     VERSION$ kubemaster   NotReady   control-plane   12m     v1.26.0kubenode01   Ready                8m17s   v1.26.0$ vagrant@kubemaster:~$ kubectl get nodesNAME         STATUS   ROLES           AGE     VERSIONkubemaster   Ready    control-plane   12m     v1.26.0kubenode01   Ready              8m23s   v1.26.0
And all the components are successfully installed, up and running:
$ vagrant@kubemaster:~$ k get po -n kube-systemNAME                                 READY   STATUS    RESTARTS   AGEcilium-kmvs6                         1/1     Running   0          13m50scilium-operator-5c594d7766-n88bm     1/1     Running   0          13m50scilium-vnj4v                         1/1     Running   0          13m50scoredns-787d4945fb-p64dv             1/1     Running   0          18m41scoredns-787d4945fb-tklwr             1/1     Running   0          18m41setcd-kubemaster                      1/1     Running   0          18m54skube-apiserver-kubemaster            1/1     Running   0          18m56skube-controller-manager-kubemaster   1/1     Running   0          18m54skube-proxy-9km94                     1/1     Running   0          17m55skube-proxy-xrfr7                     1/1     Running   0          18m41skube-scheduler-kubemaster            1/1     Running   0          18m54s
This looks great!!
Let's try and run a test workload on our cluster:
$ vagrant@kubemaster:~$ k run busybox --image busybox -- sleep 1dpod/busybox created$ vagrant@kubemaster:~$ k get po busybox NAME      READY   STATUS              RESTARTS   AGEbusybox   0/1     ContainerCreating   0          9s$ vagrant@kubemaster:~$ k get po busybox NAME      READY   STATUS    RESTARTS   AGEbusybox   1/1     Running   0          13s$ vagrant@kubemaster:~$ k get po busybox -owideNAME      READY   STATUS    RESTARTS   AGE   IP         NODE         NOMINATED NODE   READINESS GATESbusybox   1/1     Running   0          24s   10.0.1.8   kubenode01              
And, our pod is created successfully and has been scheduled on the kubenode01 worker node.
Congratulations, we have successfully created a two-node Kubernetes cluster with the help of the kubeadm tool and Containerd as CRI and Cilium as CNI plugin for our learning and development purposes. Now, we can play with this cluster and learn more in-depth about interacting with a production-grade on-premise cluster enabled with advanced eBPF-based networking and observability capabilities  Cilium and crictl as a command line tool to interact with the containers on the cluster.
Follow Kubesimplify on Hashnode, Twitter and Linkedin. Join our Discord server to learn with us.



Speeding up using MicroK8s
Aayush Sharma — Tue, 21 Feb 2023 12:30:38 GMT
What is MicroK8s?
MicroK8s is a lightweight Kubernetes distribution that is designed to run on local systems. It is the smallest and fastest multi-node Kubernetes. It is lightweight and easy to set up, which helps in decreasing the complexity of management, deployment, and scaling of containerized applications, which further helps in the smooth transition to Kubernetes. It is available as a snap and runs on Linux, macOS, and Windows using multipass.
MicroK8s is an upstream Kubernetes deployment that is CNCF-certified and operates fully on our workstation. It can quickly build up a single-node cluster and works well with local development, IoT appliances, CI/CD, and at the edge because it employs snap packing, it is also capable of automatic updates, which means that once a new Kubernetes version is available on the main deployment, it will immediately update itself to the most recent version.
Benefits of MicroK8s
Apart from being fast, it has various other functionalities to look at before choosing any other distribution.
Do not require VM - MicroK8s was originally intended for Linux and does not require a virtual machine to execute. Instead, it is installed as a snap package.
Diverse integrations and resources - MicroK8s is ideal for edge deployments since it doesn't need a virtual machine (VM) and has more resources at its disposal to run applications.
Isolated Environment - As MicroK8s is a Zero-ops, pure-upstream Kubernetes,
  from developer workstations to production it provides an isolated and secure development environment fundamental to the Operating System.
High-Level addons - Initially there are various services provided by MicroK8s such as - kube-proxy, kubelet, api-server, and so on. Users can add more services as per their needs, to view the list of all the add-ons click here.
High Availability(HA) Support - In MicroK8s when three or more nodes are clustered, high availability is immediately activated, and the data store automatically switches between nodes to preserve a secret in the case of a failure. When a node is lost, HA MicroK8s can continue to offer reliable services, and lower production needs with a minimum of overhead and important work.
Lightweight - As said earlier it does not need to spin a VM containing all the services which might not even be used making the process faster and lighter. MicroK8s initially provides only the required services to run a single-node cluster in your local system using snap and multipass which make it lighter.
A vast variety of fields - MicroK8s may be utilized in a variety of technologies, including DevOps, AI/ML, CI/CD, and others. When compared to other options, all of the aforementioned fields require a significant amount of effort to configure, however MicroK8s simplifies the process and provides significant resource savings.
Some valuable Addons
AFTER INSTALLING AND RUNNING MICROK8S, RUN THE FOLLOWING CODE IN YOUR TERMINAL TO ENABLE ANY ADDON:
microk8s enable 
The following are the most crucial or often used MicroK8s additions for setting up a production-level environment:
cert-manager - Certificate controller for Kubernetes clusters.
 microk8s enable cert-manager
CoreDNS - To provide address resolution services to Kubernetes, CoreDNS is deployed. It is advised that you activate this service because other add-ons frequently use it.
 microk8s enable dns
dashboard - The default Kubernetes Dashboard.
 microk8s enable dashboard
ingress - This addon adds an NGINX Ingress Controller for MicroK8s.
 microk8s enable ingress
community - This enables the addition of several add-ons created by third parties and the community such as - portainer, istio, argued and so on.
 microk8s enable community
Installation
STEP-1 - Installing MicroK8s:
Linux - In Linux, MicroK8s is installed with the help of snap. Check if your system contains snap or not by running the below-given command:
  snap version
  It should display these details if present in your system and if you get an error install snap in your local system to install MicroK8s:
  
  If you have snap installed run the below command in your terminal to install MicroK8s in your local system:
  sudo snap install microk8s --classic
  It should take a while and you should get this output in your terminal:
  
  MicroK8s is successfully installed in your system!!
macOS - In macOS, MicorK8s is installed with the help of Homebrew. If you don't have Homebrew installed do install it in your local system and after that run the below command in your terminal to install MicroK8s:
  brew install ubuntu/microk8s/microk8s
  This can take a few minutes and at the end of the process it should display:
  
  After this execute the below command in your terminal and again this might also take a while:
  microk8s install
  Wait until you get this displayed on your terminal screen:
  
Windows - In windows simply run the MicroK8s Installer.
STEP-2 - Check the MicroK8s status:
 Run the below command in your terminal to check if MicroK8s is running:
 microk8s status --wait-ready
 The result on the terminal should be:
 
 That's it MicroK8s is up and running in your local system!!
Kubernetes Dashboard
After completing the installation steps, let's check the node's status by running the below command:
microk8s kubectl get nodes
I have one default "microk8s-vm" node running. Let's enable some of the important services before creating the dashboard:
microk8s enable dashboard dns
To double-check all the services that are running, enter the below command in your terminal:
microk8s kubectl get all --all-namespaces
Finally, create the dashboard using:
microk8s dashboard-proxy
Visit the link where the dashboard is available and enter the token displayed in your terminal to access the Kubernetes Dashboard.
Multi-node Setup
After completing the installation steps to run a multi-node cluster, let us first create two VMs using the multipass command:
multipass launch --name  --mem 4G --disk 40G
In the above command, the "--name" is used to name the VM, the "-mem" flag is used to specify memory, and the "-disk" is used to allocate the disk storage for the VM.
In the above two images, I have launched two VMs named "Blog" and "Blog-end".
To check the list of nodes running, enter the below command in your terminal:
multipass list
After this, shell into both the VMs using the command:
multipass shell 
Tip: Split your terminal window, which will help you avoid confusion while setting up the multi-node cluster.
Following that, install MicroK8s on both VMs separately using:
sudo snap install microk8s --classic
On completing the installation, run the below command to check the status of MicroK8s:
microk8s status --wait-ready
If you get the below error:
Check if you have the ".kube" directory in your VM by running:
As we see in the above directory list ".kube" directory is absent so create one with the help of the below command:
mkdir .kube
Now the ".kube" directory is created after this just copy and paste the suggested command into the terminal
sudo usermod -a -G microk8s ubuntusudo chown -R ubuntu ~/.kube
Following the process make a new group of microk8s using the below command:
newgrp microk8s
After completing these steps check again if MicroK8s is running or not by using:
microk8s status --wait-ready
As we see, Microk8s is running in both VMs.
To add the node to this cluster, we have to run the following command in the VM we want to end the node to:
microk8s add-node
After getting the output of the form:
Copy and paste "microk8s join....." as per your choice in the second VM and it will be added to the initial VM completing the multi-node setup.
For example, here I ran "microk8s add-node" in Blog VM where I wanted the multi-node setup and copied the below-given command in the output of the terminal. Pasted the command in the Blog-end VM to connect it with the initial VM.
If you get the above error it is because your initial node is not able to recognize the IP address of the node to be added. To resolve this error you have to manually add the IP address of the second node to the initial node by pasting the IP address followed by the node name in the "/etc/hosts" of your initial VM.
In my example, the IP address of my Blog-end VM is 192.168.64.28
So I will just add the below line in the "/etc/hosts" file of my Blog VM:
192.168.64.28 Blog-end
Get a new microk8s join link by again running "microk8s add-node" in the initial VM and paste it into the VM whose node is to be added.
And finally, check the number of nodes by running:
microk8s kubectl get nodes
Resources
MicroK8s
Video Explanation
Conclusion
In this article, we learned about MicroK8s. There you are at the end of this blog post, I hope this blog helps you understand the use of MicroK8s. Don't forget to like and share this post if you liked this blog. Connect with me on Twitter and LinkedIn. Follow me for more such blogs.
THANKS FOR READING 😄📖!!
#LEARNINPUBLIC #LEARNWHILEDOING
Aayush Sharma 👨🏻💻
Follow Kubesimplify on Hashnode, Twitter, and LinkedIn. Join our Discord server to learn with us.


Deploy a Maven Project on a Tomcat Server Using Jenkins and AWS
Kanika Gola — Sun, 19 Feb 2023 12:30:39 GMT
In this blog, we are going to create a simple job that will deploy a Maven project on a tomcat server built on an EC2 instance through Jenkins.
The steps that will be followed for this project are:
Create a security group for Jenkins.
Set up Jenkins on AWS EC2 instance.
Connect to Jenkins instance using SSH.
Install Jenkins on the EC2 instance.
Manage some Jenkins plugins
Create a security group for Tomcat.
Set up Tomcat on AWS EC2 instance.
Connect to Tomcat instance using SSH.
Install Tomcat on the EC2 instance.
Create a job for deployment
NOTE: For setting up your AWS account, if you are using it for the first time, you can check out this video https://youtu.be/FRQ9fE4fd5g
Security Group for Jenkins
Go to your AWS Console and sign in as an IAM user.
I am choosing the region, ap-south-1 i.e the Mumbai region, which is present in the top right corner of the console, left to your account name.
You can select from any of them mentioned.
  
Search for the service EC2 from the search bar.
From the left toggle bar, search for Security Groups
Create a Security Group and name it Jenkins-Security-Group
  
Let's add some Inbound rules
  
First rule: To allow SSH to your EC2 instance, you need to provide this rule by giving your IP i.e., My IP so that only you will be allowed to SSH.
Second Rule: Next is Custom TCP with Port No. 8080 because our Jenkins server will run on this port.
Third Rule: The last one is HTTP with port 80, to give general access to the Internet
Now go ahead and click on create.
Jenkins Server
Search for the service EC2 from the search bar.
  
Click on 'Instances' from the toggle bar at the left.
Now let's launch an instance by clicking on 'Launch instances'
Now, enter some details for your instance, name it 'Jenkins-Server'
I am opting for Amazon Linux AMI here, you can use Ubuntu, macOS, Windows etc.
AMI: An Amazon Machine Image (AMI) is a supported and maintained image provided by AWS that provides the information required to launch an instance.
Make sure that the AMI you are opting for, lies in the free tier
Choose the size of the instance t3. micro or anything that lies in the free tier.
Let's create a key pair for your instance, naming it Jenkins-Key and downloading.
  
Choose ".pem", if you are going to use SSH to connect to your instance, or ".ppk" if you are using putty.
In network settings, select an Existing Security Group, which we just created Jenkins-Security-Group and then launch the instance.
Wait for the Status Check to complete.
  
Connect to the Jenkins Server using SSH
Click on Connect and select the SSH Client option.
  
Move to the directory, where you have your downloaded key pair. Mine is in the downloads.
Copy the third command and enter it in your terminal to ensure all permissions.
And then, enter the command given under the example.
Note: Make sure to enter the following command with a sudo prefix, if you haven't done sudo su - at the beginning.
  sudo ssh -i "Jenkins-Key.pem" ec2-user@ec2-15-207-19-201.ap-south-1.compute.amazonaws.com
  
Congratulations, you have connected to the instance🎉
Install Jenkins on your Instance
We are using sudo as a prefix with every command, you can also do sudo -su in the beginning to avoid that.
Enter the command, for a quick update of all the software packages on your instance
  sudo yum update y
Add the Jenkins repo
  sudo wget -O /etc/yum.repos.d/jenkins.repo \      https://pkg.jenkins.io/redhat-stable/jenkins.repo
Import a key file from Jenkins-CI to enable installation from the package:
  sudo rpm --import https://pkg.jenkins.io/redhat-stable/jenkins.io.key  sudo yum upgrade
Follow these commands
  # install java  sudo amazon-linux-extras install java-openjdk11 -y  # install Jenkins  sudo yum install jenkins -y  # enable the Jenkins service to start at boot:  sudo systemctl enable jenkins
Start Jenkins as a service
  sudo systemctl start jenkins
Check the status
  sudo systemctl status jenkins
Now copy the public IP address of the Jenkins instance, which is present in the details of the instance
  
Enter this IP address with the port number, i.e., ""
  
Now, get the password by entering the following command and enter it in the text box
  sudo cat /var/lib/jenkins/secrets/initialAdminPassword
Click on install the suggested plugins
  
Go ahead and enter your username and stuff as asked
  
You are ready to use Jenkins🎉
NOTE: Some extra installations
  1. Install git
  sudo yum install git -y
2. Install maven
sudo wget https://repos.fedorapeople.org/repos/dchen/apache-maven/epel-apache-maven.repo -O /etc/yum.repos.d/epel-apache-maven.reposudo sed -i s/\$releasever/6/g /etc/yum.repos.d/epel-apache-maven.reposudo yum install -y apache-maven# check the installationmvn --version
Manage some Plugins in Jenkins
Since we are deploying a maven project, go ahead and select Manage Jenkins from the left and then Manage Plugins.
  
Search for maven in the available plugins section and install without restart.
  
Install another plugin names Deploy To container for deployment
This plugin allows you to deploy a war to a container after a successful build.
  
Next, go to Manage Jenkins>Global Tool configuration>Maven and opt for automatic installation
  
Security Group for Tomcat
Follow the same steps, as followed in creating a security group for Jenkins, with some minor changes.
Give the name Tomcat-Security-Group to the security group.
One inbound rule has to be changed
  
We will use port no. 8090 for accessing it on the browser.
And then Create🚀
Tomcat Server
Now let's launch an instance by clicking on 'Launch instances'
Enter the name Tomcat-Server
Choose the Amazon Linux AMI.
Select the size of the instance t3. micro or anything that lies in the free tier.
Let's create a key pair for your instance, naming it 'Tomcat-Key' and download it.
Choose Tomcat-Security-Group
  
And Launch it 🎉
Connect to the Tomcat Server using SSH
Connect to it through SSH, following the same steps as followed in the Jenkins server by clicking on Connect.
Here we can see both instances are running!
  
Install Tomcat on your Instance
Go to https://tomcat.apache.org/download-80.cgi and copy the tar.gz file link.
  
Move to the opt directory and download the tomcat package
  cd /opt  sudo wget https://downloads.apache.org/tomcat/tomcat-8/v8.5.85/bin/apache-tomcat-8.5.85.tar.gz.asc
Do ls in the opt directory to see the file, you will see the following file
  
Now we have to unzip and untar the package with a single command, i.e.
  sudo tar -xvzf apache-tomcat-8.5.85.tar.gz
Let's just rename the file as tomcat, so it will be easy for us to access it
  sudo mv apache-tomcat-8.5.85.tar.gz tomcat
Move into the file "apache-tomcat-8.5.85.tar.gz" and do ls
  
Now move to the bin folder
  
Now the startup.sh is used for starting the tomcat server and shutdown.sh for shutting down.
  ./startup.sh
By default, tomcat will run on port 8080, we have to change it to 8090 by
  cd /conf  vi server.xml
Change the connector port to 8090
Shut down the server by ./shutdown.sh and then start again.
  
Also, let's add some users to tomcat with different roles
Do cd /conf and edit tomcat-users.xml
  
Next thing is, we have to find the context.xml for solving this error when we click on manage App
  
This error comes because tomcat only allows access from the local system, but we want to access from outside as well
  find / -name context.xml
We will consider files that are under webapps.
We have to comment out the valve command as it only allows access from local systems.
Do the following with the last two files
  vi /opt/apache-tomcat-8.5.85/webapps/host-manager/META-INF/context.xml  vi /opt/apache-tomcat-8.5.85/webapps/manager/META-INF/context.xml
  
Add tomcat credentials to Jenkins
Go to Manage Jenkins>Manage Credentials
  
Click on global Credentials and then Add Credentials
Remember, we added a user with a role in tomcat
  
Add the username and password as given and then create.
  
Create a Job for deploying the Maven Project
Click on create a job.
Here is the GitHub Link to the project we are using
Fork the repo
Enter a name for the project/job and select maven project
  
In the Source Code Management, choose git and enter the repository URL, by clicking on code
  
  
Since all the code is in the master branch in the given project, we will use the master branch.
  
Under the build section, we usually give these three options, i.e., "clean install package with the pom.xml file already present there.
pom.xml  It is an XML file that contains information about the project and configuration details used by Maven to build the project.
  
Now choose the option Deploy war/ear to a container
  
Next, we have to specify the path of the war file, or we can just write **/*.war so that Jenkins will find the file having the type .war in that particular workspace.
  
Now in the container section, choose tomcat with its latest version i.e., tomcat 9
Also, choose the credentials from the dropdown that we just created, i.e., deployer
Next, copy the URL of the tomcat server, on which you can access it from the browser
  
  
Now save and click on Build Now
  
  
Yay! 🚀 that's a SUCCESS!
This is where your war file will be copied, i.e., in the webapps directory
  
Now we can access our app on the browser by adding "/webapp" in the URL
  
So whatever was written in the index.jsp will be visible here!
You have successfully deployed a maven application to a tomcat server🎉
Don't forget to like and share this post. Connect with me on Twitter. Follow me for more such blogs on Hashnode.
Follow Kubesimplify on Hashnode, Twitter, Instagram and LinkedIn. Join Discord server to learn a lot more stuff.


StatefulSets
Srinivas Karnati — Thu, 16 Feb 2023 17:12:10 GMT
Before getting into what Statefulsets are, let us first talk about what Stateful and stateless applications are.
Stateful and Stateless applications
Stateful apps keep track of the session(state) details of the previous transactions that happened and they will behave differently based on the previous state of the application.
While the stateless applications only rely on the clients to have some session data but the server itself doesn't store any session data.
Note: The term "State" means multiple things in multiple contexts. In this particular context, it mostly refers to the session details or some authorization tokens.
To understand more, let's take an example of a banking application that is built in both stateful and stateless architecture.
Assume that you want to make a transaction of $1000 from your bank account using the application. What are the steps that are involved?
User has to log in using their credentials.
Choose the transaction and enter the amount to transfer.
Confirmation of the transaction.
The transaction is marked as complete.
Let us perform all the steps mentioned above in both stateful and stateless architectures.
In a stateful way, the user enters his credentials, the server verifies the credentials in the auth server, and authentication is successful. As the application is built in stateful architecture, the state is stored in the server.
Then the user is prompted with a transaction page which might or might not be on the same node(server) as the auth server, but as the state(auth details) are already stored in the server the request will be processed without any hassle. Same for the confirmation stage, it will get auth details from the state store. Our user performed his transaction successfully.
In a stateless way, the server verifies the credentials in the auth server, and authentication is successful. As the application is built in stateless architecture, the state is not stored in the server.
But when the user prompts to the transaction page which might not be the same server as auth server. As auth details (state) arent stored in the server, the user fails to authenticate. The login screen appears again, this will continue to repeat which results in difficulty to make a transaction.
Note: The operations are simplified and can't be compared with real-world cases.
I hope now we have some idea about what stateful and stateless applications are. Lets see what StatefulSets are, how they are different from Deployments, and how to create one.
StatefulSet
"StatefulSet is a Kubernetes API object that is used to manage stateful applications"
Coming to deploying applications in Kubernetes, we already have Deployments which is very useful. Then why we would need StatefulSets? What is the added advantage that we get from using StatefulSets?
Issues with Deployments
Yes, deployments are very useful to deploy applications, they provide replicas, and make rollbacks and updates easy. But deploying Stateful applications using deployments comes with some issues as follows.
Pods created with Deployments dont provide Persistent identity
All pods created with Deployment share the PV
Scaling up and Scaling down in deployment is instantaneous
For better understanding let's create a sample deployment and discuss the issue mentioned above.
In this example, Im deploying a Redis image with 2 replicas with PV attached to it. Create the Deployment using kubectl apply -f deployment.yml.
You can find the manifests here
Pods dont have a persistent identity
By default, the pods that are created with deployment names are as follows: Deployment name- [random hash number]. This inconsistent naming convention makes the database connections unreliable which needs to be reliable in the case of Stateful applications.
All Pods share the same PV
All the replicas that are created with Deployment share the same Persistent Volume( if provided). This makes the whole application prone to downtime considering the cases where the PV can crash. You can see which pod has attached to which PV using the following command.
kubectl get po -o json --all-namespaces | jq -j '.items[] | "\(.metadata.namespace), \(.metadata.name), \(.spec.volumes[].persistentVolumeClaim.claimName)\n"' | grep -v null
Scaling up and Scaling down
In Deployments, the Scaling up and Scaling down are instantaneous. This means all the pods are scheduled at the same time and also during the Scale down all the pods terminate at the same time.
Scaling up the deployment using kubectl scale deployment/redis-cluster --replicas=10
  
Now try to scale down the deployment using kubectl scale deployment/redis-cluster --replicas=1
As you can see this instant scaling up/scaling down can cause an abrupt change in the stateful application and can cause downtime.
StatefulSets
Lets see how statefulset deploys an application and whether will it solve the issues that we got from Deployments or not.
A StatefulSets provides a persistent identity to the pods that they create and manage. The StatefulSets are mostly used for deploying Stateful applications where we require a unique network identifier or Storage.
StatefulSets also guarantees the ordering of the pod deployment and its scaling. StatefulSets are very helpful while deploying applications where you need database clustering ( in which you need to know the hostname of each server).
StatefulSet Yaml Manifest:
apiVersion: apps/v1kind: StatefulSetmetadata:  name: redis-clusterspec:  serviceName: redis-cluster  replicas: 2  selector:    matchLabels:      app: redis-cluster  template:    metadata:      labels:        app: redis-cluster    spec:      containers:      - name: redis        image: redis:5.0.1-alpine        ports:        - containerPort: 6379          name: client        - containerPort: 16379          name: gossip        command: ["/conf/update-node.sh", "redis-server", "/conf/redis.conf"]        env:        - name: POD_IP          valueFrom:            fieldRef:              fieldPath: status.podIP        volumeMounts:        - name: conf          mountPath: /conf          readOnly: false        - name: data          mountPath: /data          readOnly: false      volumes:      - name: conf        configMap:          name: redis-cluster          defaultMode: 0755  volumeClaimTemplates:  - metadata:      name: data    spec:      accessModes: [ "ReadWriteOnce" ]      resources:        requests:          storage: 50Mi---apiVersion: v1kind: Servicemetadata:  name: redis-clusterspec:  clusterIP: None  ports:  - port: 6379    targetPort: 6379    name: client  - port: 16379    targetPort: 16379    name: gossip  selector:    app: redis-cluster
StatefulSet requires a Headless Service in order to route the traffic to pods and to be accessed.
Pods and their identity
Pods that are created using StatefulSet are named as [Statefulset-name- (ordinal number)]. The ordinal number ranges from 0 to N and the number defines the order of creation.
This Standard naming convention provides a persistent identity to the pods and makes them reliable for use in database connections.
Pods have separate PV
Every Pod that is created with Statefulset gets its own Volume. You can verify this using the following command.
kubectl get po -o json --all-namespaces | jq -j '.items[] | "(.metadata.namespace), (.metadata.name), (.spec.volumes[].persistentVolumeClaim.claimName)\n"' | grep -v null
Scaling up and Scaling Down
Stateful sets handle the Scaling very gracefully, It will ensure that the second Pod will not be created unless the first pod is up and running. (0 to N)
In the same manner, during the Scale down, the Pod with the highest ordinal number terminates first and the second highest pod only starts terminating after that. (N to 0)
Let's scale up our stateful set from 5 to 10 pods, you can do that by using kubectl scale sts/redis-cluster --replicas=10
  
As you can observe in the above pictures, the statefulsets scale the pods one by one, and the scaling down can be achieved by using kubectl scale sts/redis-cluster --replicas=1 .
Note: StatefulSets is a great option to run your Stateful applications but by default, Kubernetes doesnt enable the database clustering for your database. Clustering is necessary for some databases in order to ensure all the volumes (databases) maintain the replica of the data.
Follow Kubesimplify on Hashnode, Twitter, Instagramd and Linkedin. Join our Discord server to learn with us.


Become a Hashicorp Certified Terraform Associate - Preparation Guide
Kunal Verma — Mon, 13 Feb 2023 12:30:39 GMT
Introduction
Recently, I appeared for the HashiCorp Certified Terraform Associate exam after somewhere around 2 months of preparation. I am very glad to share that I passed it with a score of 85%, which is a pretty nice one considering the fact that, it is my very first certification in the DevOps ecosystem.
Now, in this particular blog, we'll dig a bit deeper into this certification, and what exactly you need to pass it and I'll be sure to share my tips and experience along the way.
So, if you are someone looking forward to preparing or give this exam in the coming future, you are in the right place today!
Getting familiar with IaC and Terraform
Before we even start to discuss the exam, it's important to be familiar with the technology around which this certification revolves, which is Terraform.
If we look at the official definition of Terraform by HashiCorp, it says that:
HashiCorp Terraform is an infrastructure as a code tool that lets you define both cloud and on-prem resources in human-readable configuration files that you can version reuse, and share.
According to me, this is a very precise definition of Terraform, but there are a few terms here that we need a bit more clarity on. Let's start with the most basic one, What is Infrastructure as Code? Infrastructure as Code (IaC), as the name suggests is an approach to managing your cloud-native infrastructure through code. Traditionally, If we talk about managing our infrastructure over different cloud providers or on-prem platforms, there was heavy use of the UI or some kind of management console to make changes, monitor and troubleshoot the resource throughout the entire application lifecycle. This was apt if we didn't have to manage large-scale infrastructure and there was a relatively limited scale of deployment. Today, as the scale of infrastructure is much, much higher, the approach of managing the infrastructure through a codified way is what is being adopted in the ecosystem, and this is what we call Infrastructure as Code.
There are a lot of tools out there that serve the purpose of creating that codified ecosystem throughout the infrastructure lifecycle and Terraform is the Infrastructure as Code (IaC) offering from HashiCorp.
Terraform, an infrastructure as a code tool, lets you define both cloud and on-prem resources in human-readable configuration files that you can version, reuse, and share. These configuration files are written in a special configuration language called the HashiCorp Configuration Language (HCL). Here is a nice example of provisioning an AWS EC2 instance through Terraform, using the HCL syntax:
resource "aws_instance" "web" {  ami           = "ami-005e54dee72cc1d00"  instance_type = "t3.micro"  tags = {    Name = "My_Server"  }}
To learn more about the HCL language, you may refer to the documentation.
💡 Tip:
As HCL is a JSON-based variant, the syntax of the two looks very similar. HCL is comparably easier to parse than other configuration languages such as YAML or XML.
To learn more about the HCL syntax, you can refer to the README on GitHub.
There are a lot of features and use cases that make Terraform stand apart from other IaC tools out there such as Ansible, Chef, Pulumi, Crossplane, etc, but the one that comes on top is being "Cloud-agnostic" i.e. the ability to provision resources across a multi-cloud infrastructure and handling cross-cloud dependencies with a single configuration file.
To know more about various use cases of Terraform, refer to the documentation.
Who is a Terraform Associate?
If we look at the official introduction to the certification:
The Terraform Associate certification is for Cloud Engineers specializing in operations, IT, or development who know the basic concepts and skills associated with open-source HashiCorp Terraform.
Most simply, a Terraform Associate is someone who can use Terraform to manage and provision infrastructure in a variety of real-world scenarios. This is a fundamental level certification where the main aim is to validate that you have a working knowledge of Terraform and the ability to use it in production environments.
From my experience of preparing for this exam, I'd say it's either for those cloud engineers that specialize in operations/IT, or for those developers who know or would like to explore and learn the basic concepts, and skills associated with open-source HashiCorp Terraform.
Pre-requisites - Before preparation
Now that we have a clear understanding as to who is this certification for and what the aim behind it is, we can now move towards our preparation phase. The very first step is to learn about the pre-requisites required, and according to HasiCorp, those are as follows:
Having a basic knowledge of using the Linux Terminal
Linux Course by Kunal Kushwaha
Linux & Docker Fundamentals Workshop by Chad M. Crowell
Having a basic understanding of how on-premises and cloud architecture works
Cloud 101: An Introduction to Cloud Computing
Apart from this, it is recommended to have a sound knowledge of using at least one of the public cloud providers such as AWS, Azure, GCP etc. Now, this is something where many folks tend to get stuck while starting to prepare for the certification. You need to focus on these words here - "it is recommended to have sound knowledge", this does not mean that one has to be a PRO in using a particular cloud provider, before starting to learn about Terraform. Having an experience with a cloud provider would prove to be helpful, but even if you are just familiar with the basics of one provider, let's say AWS, that shouldn't stop you from learning Terraform and you can always learn new cloud concepts along the way!
If I talk about my preparation, I only knew some very basic concepts in AWS such as working with EC2, and a little bit of S3 (database) and that's it. Along with learning about Terraform, I learned multiple new concepts like VPC, CIDR Blocks, AWS Secrets Manager etc. And not only in AWS, but I also gained basic experience working with Azure and Google Cloud as well as I learned to provision infrastructure over a multi-cloud environment.
I hope this gives you some confidence and now let's move forward with learning more about the exam and how you can prepare well.
About the Exam - What to Expect
General Exam Details
Here are some important exam details to keep in mind:
This is a multiple choice exam having a total no. of questions = 57.
The duration of the exam is 1 hour and in my opinion, this is a very apt time for attempting all the questions.
A minimum score of 70% is required to pass the exam.
The exam costs you $70 with no free re-take included. So, you've gotta complete it in a single take.
If you have successfully cleared the exam, the certification would be valid for 2 years.
Exam Objectives - Different areas to focus on
In other terms, you can call this the "syllabus" of the exam. As I previously mentioned, the main aim of this certification is to validate that you have a working knowledge of Terraform and an overall understanding of the Infrastructure as Code ecosystem. So, here are the different focus areas into which the exam is divided:
Understand infrastructure as code (IaC) concepts
Understand Terraform's purpose (vs other IaC)
Understand Terraform basics
Use the Terraform CLI (outside of core workflow)
Interact with Terraform modules
Navigate Terraform workflow
Implement and maintain state
Read, generate, and modify the configuration
Understand Terraform Cloud and Enterprise capabilities
Now, there are a lot of sub-topics that come under these main focus areas, which you can learn more about from the official certification page.
From my experience of giving the exaddm, it's very well balanced between these 9 areas and while preparing, you need to spend adequate time learning these topics. I will point out some key areas to focus on in the upcoming sections as well.
A Practical Approach to your Preparation!
Now that we have a fair level of understanding about the prerequisites, the exam structure, and the main focus areas, we can now start our preparation!
📍 NOTE:
The resources and points mentioned below are things that worked for me in my preparation and might not work for you, but you'll learn quite a lot in the process, that's for sure!
Resources
1. Study Guide by HashiCorp
One of the best resources to learn about any technology out there is its official documentation and the same is the case with Terraform. For this certification, HashiCorp has done a phenomenal job in structuring the official documentation in form of a study guide to help you prepare for this exam.
The Study Guide is kind of a roadmap created by HashiCorp to help you prepare for this certification exam. Each objective (which are mentioned above) with their respective sub-topics are mentioned in the form of links, which lead to separate sections of the official documentation, focusing on that particular topic. All the resources mentioned here are in the order of difficulty so that one should be able to track their progress throughout the preparation.
One of the major highlights of the study guide is links to the tutorial section where you can follow the step-by-step guide and get that essential hands-on experience while learning the concepts (which is very important).
To get a more summarized view of the study guide, you can also check out the Exam Review section as well.
Along with the study guide, you also have a section for Sample Questions where you can get a gist of the format of the questions asked in the exam.
📍 NOTE:
These sample questions aren't enough to get that full-fledged exam practice experience that you most definitely need. We will talk about that later in this section.
2. HashiCorp Terraform Associate Certification Course - By Andrew Brown
If you are someone who prefers to learn through videos or consume visual content, this one's for YOU. A complete knowledge-packed course by Andrew Brown on FreeCodeCamp is a very well-structured resource for anyone starting to prepare for this certification. Andrew Brown has done some amazing work on this 13+ hour-long course which includes both in-depth explanations of Terraform concepts and hands-on sections after each concept (which were my favourites).
Overall, both these resources have the full potential to get you exam ready in terms of the concepts and covering the exam objectives. Now, for the folks who'd be wondering, what would I recommend here? So, I combined the power of both.
Study Guide + FreeCodeCamp course == A Killer Combination 🔥
This is a personal choice and you can go for any resource mentioned, but remember that - documentation plays an important role in the exam, so please don't skip that!
3. Practice Exams - To get you Exam Ready!
The more I come back to this particular phase, the more it reminds me of how important it was during my preparation. Practice Exams are something which would connect you with what potentially could be asked in your exam and in a way validates your overall preparation at the same time.
HashiCorp Certified: Terraform Associate Practice Exam 2023 by Bryan Krausen on Udemy provides you with a set of 6 practice exams, all following the exact pattern as the main exam:
57 total questions
1hr duration, and
70% required to pass.
These are again, essential for validating your overall preparation and would help to point out any area or topic that requires further practice or revision. These practice exams helped me to track my preparation and I could see myself improving with each exam.
To give you a better idea of this, here is a table that I prepared to track my progress with each exam I gave:
Notice the very first exam I gave here. I got a 61% and that means I failed it! Though this was a bit de-motivating at first, now, I knew the areas that required more attention. So, I focused more on those and the results kept on improving with each.
💡 Tip:
After giving every practice exam, it's important to ask these two questions:
Which questions went wrong and why?
Which of my answers was right and why?
Trust me, with each exam you'll see yourself improving and becoming more confident!
4. Personal Notes on Notion
While going through the course and the documentation, I have prepared notes of all the concepts that I learned along the way. Now, these are something very informal and are not sufficient for your preparation, but would give you a nice overview of all the concepts involved!
You can access all the notes through this Notion page. Also, if you wish to access the code files as well, here is the GitHub Repo.
📍 NOTE:
I'll soon be uploading all the notes on the GitHub Repo as well, to make them more accessible to the community!
Exam & Study Tips
Throughout the blog, I've provided some short tips in each section to help you with your preparation. Here are a few more, cuz why not :)
The more hands-on practice you do for a topic, you'll have a firm grip on that particular concept.
Googling and StackOverflow are your friends if you get stuck into any issues
As I mentioned previously, making notes while learning helped me out in terms of remembering the concepts. Try to follow this practice as you learn and notice the results!
If you can, try to make 1-2 small projects using Terraform. You can even add Terraform to an existing project on GitHub. Again, the aim is to get that hands-on experience.
Learn in public! Commit to sharing your daily learnings with everyone in the community and you'll gain some nice insights and make new connections.
Conclusion
I hope that this blog post proves to be the starting point for your journey to becoming a certified Terraform Associate and I tried my best to take you through my journey along the way.
If you have any further questions regarding the certification, feel free to reach out on Twitter.
All the best for your preparation and the exam!
Follow Kubesimplify on Hashnode, Twitter and LinkedIn. Join our Discord server to learn with us.


Understanding etcd in Kubernetes: A Beginner's Guide
Srinivas Karnati — Sat, 11 Feb 2023 12:30:39 GMT
Etcd is a key-value data store used to store and manage the critical information that distributed systems need. It provides a reliable way of storing the configuration data. In this post, we will see a close look at etcd, why it is needed, and how to access its contents in Kubernetes.
What is etcd?
Etcd is an open-source distributed key-value store that is used to store and manage the information that distributed systems need for their operations. It stores the configuration data, state data, and metadata in Kubernetes.
The name etcd comes from a naming convention within the Linux directory structure: In UNIX, all system configuration files for a single system are contained in a folder called /etc; d stands for distributed.
Why Kubernetes needs a data store?
We know that Kubernetes is an orchestration tool whose tasks involve managing application container workloads, their configuration, deployments, service discovery, load balancing, scheduling, scaling, and monitoring, and many more tasks which might spread across multiple machines across many locations. Kubernetes needs to maintain coordination between all the components involved.
But to achieve that reliable coordination, k8s needs a data source that can help with the information about all the components, their required configuration, state data, etc. That data store must provide a consistent, single source of truth at any given point in time. In Kubernetes, that job is done by etcd. etcd is the data store used to create and maintain the version of the truth.
But why etcd?
As it sounds, it is not a small task to act as a single point of truth for application workload. But what makes etcd worth using?
Fully replicated: Every node in an etcd cluster has access to the full data store.
Highly available: etcd is designed to have no single point of failure and gracefully tolerate hardware failures and network partitions.
Reliably consistent: Every data read returns the latest data write across all clusters.
Fast: etcd has been benchmarked at 10,000 writes per second.
Secure: etcd supports automatic Transport Layer Security (TLS) and optional secure socket layer (SSL) client certificate authentication.
Image from etcd.io
In general, etcd is deployed as a cluster spread across multiple nodes. It is recommended for a cluster to contain an odd number of nodes, and at least three are required for production environments.
So if we have multiple etcd nodes, how the data consistency will be maintained?
etcd is built on the Raft consensus algorithm to ensure data storage consistency across all nodes in a cluster for a fault-tolerant distributed system.
Raft consensus algorithm
In the raft algorithm, the data consistency is maintained via the leader, which will replicate the data to other nodes in a cluster called followers.
The leader accepts requests from clients/users, then will forward them to followers. Once the majority of followers sent back an entry made acknowledgment, the leader writes the entry. If followers crash, the leader retries until all followers store the data consistently.
If a follower fails to receive a message from the leader, a new election for the leader will be conducted.
You can find great animation explaining about raft algorithm here: http://thesecretlivesofdata.com/raft/
Etcd and Kubernetes in action
In the Kubernetes cluster, etcd is deployed as pods on the control plane. To add a level of security and resiliency, it can also be deployed as an external cluster.
For this post, I am using the kind cluster. When kind is used to install the cluster, it will also install etcd as pod in the kube-system namespace.
We can find multiple pods in the kube-system namespace, but what we are most interested is etcd-kind-control-plane which is running the instance of etcd and it is used to store the state of the cluster.
Interact with etcd
The following command helps to interact with the etcd-kind-control-plane pod through kubectl exec. And ETCDCTL_API is the API version through which we want to interact with etcd --cacert, --key and --cert is for TLS certificates that we will get from executing the describe command present above and get / --prefix --keys-only will give all the keys present in etcd.
kubectl exec etcd-kind-control-plane -n kube-system -- sh -c "ETCDCTL_API=3 etcdctl --cacert /etc/kubernetes/pki/etcd/ca.crt  --key /etc/kubernetes/pki/etcd/server.key --cert  /etc/kubernetes/pki/etcd/server.crt  get / --prefix --keys-only" > etcdkeys.txt
ETCDCTL_API is the API version we use for etcd to interact with it. --cacert, --key and --cert is for TLS certificates that we need and get / --prefix --keys-only will give all the keys present in etcd.
The above interaction with the etcd pod gave me around 277 keys, which will define the configuration and status of all resources in the cluster.
So now lets create a pod with nginx image, and we will see what happens in the etcd cluster.
So run kubectl run my-pod --image=nginx which basically pulls and runs the nginx image. We use the same command that weve used previously to get all the keys that are stored in etcd, and we will store it into a file called etcd-after-pod.txt .
kubectl exec etcd-kind-control-plane -n kube-system - sh -c ETCDCTL_API=3 etcdctl --cacert /etc/kubernetes/pki/etcd/ca.crt --key /etc/kubernetes/pki/etcd/server.key --cert /etc/kubernetes/pki/etcd/server.crt get / --prefix --keys-only > etcd-after-pod.txt
A comparison between the two files, one before pod creation and one after pod creation, shows me the following.
Several new events were generated. We have 6 events generated specifically for our pod my-pod. Lets take a closer look at those events.
The following command gives you an JSON output for the event registry/events/default/my-pod.173bb0a9bbbda0b6. But by default, all the values of etcd are encoded.
kubectl exec etcd-kind-control-plane -n kube-system --sh -c ETCDCTL_API=3 etcdctl --cacert /etc/kubernetes/pki/etcd/ca.crt --key /etc/kubernetes/pki/etcd/server.key --cert /etc/kubernetes/pki/etcd/server.crt get \/registry/events/default/my-pod.173bb0a9bbbda0b6\ -w json
kubectl exec etcd-kind-control-plane -n kube-system -- sh -c ETCDCTL_API=3 etcdctl --cacert /etc/kubernetes/pki/etcd/ca.crt --key /etc/kubernetes/pki/etcd/server.key --cert /etc/kubernetes/pki/etcd/server.crt get \/registry/events/default/my-pod.173bb0a9bbbda0b6\ -w json | jq .kvs[0].value| cut -d  -f2 | base64 --decode
If we decode the value associated with the key, to return output is also not that much readable, but we can understand some of it.
In the above result, you can find some interesting informationstarted container my-pod. I have decoded all the pod events, and these are the events that occurred in chronological order.
Scheduled":Successfully assigned default/my-pod to kind-control-plane*Pulling"Pulling image "nginx"*Pulled"2Successfully pulled image "nginx" in 35.579712695sCreated"Created container my-pod*Started"Started container my-pod*
The last key, /registry/pods/default/my-pod, gives all the information related to the newly created Pod :
The last applied configuration
Its token
Its status etc.
Memory etc...
Follow Kubesimplify on Hashnode, Twitter and LinkedIn. Join our Discord server to learn with us.


GitOps - Demystified
Nourhan Khaled — Thu, 09 Feb 2023 12:30:39 GMT
Having had first-hand experience with both lives before GitOps and after, I can easily say managing Kubernetes applications without GitOps feels like juggling with one hand tied behind your back. Adopting GitOps isn't just about putting an end to the single-source-of-truth conundrum, it opens a whole new plane of automation, enabling you to ship applications faster and more reliably than before. In this blog, the what and the why of GitOps.
What is GitOps?
As with any new buzzword or term that comes out, a bunch of definitions surface, which causes some haze and confusion around the topic. Simply put, GitOps is a framework in which anything and everything done in your cloud native application is done through git and git alone. Its considered CD for cloud native applications.
What GitOps is NOT:
GitOps is not a single tool. Its a framework, i.e, a set of practices which if you implement, whatever the tool, then youd be implementing GitOps.
GitOps is not only for kubernetes using the same rationale as the previous point. But because the most widely used GitOps tools are built specifically with kubernetes applications in mind, it might lead people to think that GitOps is exclusive to managing kubernetes applications.
GitOps is not synonymous with infrastructure as code. Infrastructure as code is a way to manage your infrastructure. GitOps on the other hand is a way to manage the entire cloud native stack. You can use GitOps to manage your infrastructure as well as the applications deployed on it.
The 4 Principles of GitOps:
The entire system is described declaratively as code.
 A declarative system is described as a set of facts as opposed to a set of instructions, i.e., you just state WHAT you want without caring about HOW its done. When writing the manifests for our kuberernetes resources, thats exactly what we do. For example, when creating a deployment, we state the image we want to deploy, the number of replicas, the allocated resources..etc, and we dont worry about HOW this will be executed.
The desired state is versioned in Git.
 This principle is the "Git" in GitOps. To implement this principle, any changes introduced to the cluster would be committed to Git. So at any point in time, if you want to know the state of your cluster, you would simply check your git repository, which puts an end to the single source of truth conundrum.
Automatically apply approved changes
 To enforce the previous principle, we would want to make sure that whatever is committed to the git repository is actually whats deployed on the cluster, which is where this principle comes in. Having any changes in git automatically applied to the cluster is done by means of software agents that are deployed on the cluster. These agents or controllers constantly monitor any changes added to the repository and apply them to the cluster.
Drift Consolidation
 At the core of GitOps is this notion of drift consolidation. If at any point in time, for any reason, there is a change or drift between the configuration defined in git and the state of the cluster, the state is reconciled back to the configuration defined in git, which now serves as our single source of truth. I like to think of this as having a control loop for your application deployments.
Standard CI/CD vs GitOps
The standard CI/CD workflow is a push-based model. You push the changes to the manifests on git, then after granting your CI server access to the cluster, you would configure it to run a pipeline applying the new configuration with kubectl or helm.
 Standard CI/CD: push-based, CI server applies changesUp until the third principle, it would seem that a push-based deployment model would be implementing GitOps, after all, its source is Git and the CD is triggered through git, right? But let's clarify the difference.
GitOps is pull-based, i.e., the cluster state is updated by pulling the desired state from the repository and then applying it. The agent running on the cluster does that periodically, not just on push. So, the glaring difference between the push-based deployment model and GitOps is in having that control loop that's always running, observing and syncing the state of the cluster with the state of the repository all the time.
GitOps flow: pull-based, changes pulled from repository and applied by software agent which is on the cluster. Why GitOps?
It's important to weigh out the benefits and drawbacks of any adopted framework before riding the wave and then wondering what has ever got you in that mess in the first place. So by now, you might be wondering, why do we even need GitOps? CI/CD did the job, Kubernetes is complicated enough without introducing a new framework. Well, that would be very valid if we live in an ideal world, but if I've learned anything during my time working with Kubernetes, it's that Murphy's law is real.
"Anything that can go wrong will go wrong." - Murphy's Law.
A namespace is bound to go mysteriously missing, or someone is going to modify something on the cluster and forget to commit it to git, mistakes are inevitable, and it's good to be proactive.
So here are just some of the benefits of GitOps:
Easier and quicker error handling and recovery.
  Imagine you're playing around in the cluster - as we all do - and you accidentally modify or completely remove a resource without noticing, GitOps will save your life. The software agent will pick up on the difference between the desired state of the cluster defined in the git repository and the actual state of the cluster and will automatically reconcile the state of the cluster to match the desired state and undo whatever changes you didn't intend to introduce.
  Now imagine another scenario, where the resource you apply introduces an unintended bug. If you're using GitOps, you'd be pushing changes simply by pushing your commits, so reverting changes would be as easy as a simple git revert.
Beyond recovering from disasters, a lot of perks come with adopting GitOps
Deploy faster and more often.
  GitOps really delivers on the speed of application delivery. Once the new feature is developed and tested, there is no "post merge" step to deliver, once you merge your latest changes and push those commits to the branch being synced to the cluster, changes are automatically pushed and applied.
Self-documenting deployments and shared knowledge in the team.
  Another cool feature that we end up with when using GitOps is that the state of the cluster is directly reflected by the git repositories. So the entire development team can easily find out exactly what is deployed at any point in time just by checking the repo. Developers no longer have to access the cluster to find out which version of a certain helm chart is deployed or if their latest release has been deployed yet or not. Additionally
Easier credential management.
  Because application delivery is now pull-based, we no longer have to give our CI server full access to the cluster to apply changes as we did before GitOps. And by extension, the same applies to giving cluster access to developers. Creating, updating or deleting resources can now be managed entirely through git.
Now that you know some of the benefits of GitOps, you can start evaluating whether it's for you or not. If you're just one or two developers working on a non-critical test cluster, you may not need GitOps, but you may start considering implementing it when the team and/or applications start to scale.
Follow Kubesimplify on Hashnode, Twitter and LinkedIn. Join our Discord server to learn with us.


Understanding the Architecture of Kubernetes: A Beginner's Guide
Nitish Kumar — Mon, 30 Jan 2023 12:30:39 GMT
Kubernetes is a powerful container orchestration system that has taken the world of cloud computing by storm. With its ability to manage and scale containers across multiple hosts, Kubernetes has become the go-to platform for running containerized applications in production. In this blog post, we'll take a closer look at the architecture of Kubernetes and how it works to manage and scale containerized applications.
Take a closer look at the below diagram (especially the arrows & lines). We'll take a closer look at what these arrows represent, what is an API server, what is the line that is pointing to the API server from Kubelet, what is the difference between the control plane & worker node, why is there a need for a control plane and many more such questions shall be entirely answered by the end of this blog.
Before understanding the architecture, let's first look at the terminologies used widely in the Kubernetes ecosystem.
Node: A Node in Kubernetes is a representation of a single machine in your Kubernetes cluster. In production, a node will most likely be a physical machine at a data center or a virtual machine hosted on the cloud.
Pods: Kubernetes does not run containers directly. Instead, it runs containers inside a spherical body called Pods.
Inside your Node, runs the pod. Inside your Pod, runs the containers. Inside your container, runs the application.
  
Cluster: It is a group of nodes, both physical & virtual, which is used to run the containerized application.
Let's suppose I've 3 nodes that are used to run a containerized application. The grouping of all these nodes to create a more powerful machine is what we call a cluster.
Let's try to understand the architecture of Kubernetes - what is happening behind the scenes?
Look at the above image carefully. We've grouped three nodes (1 Control plane & 2 worker nodes) which is called a Kubernetes cluster. You can create as many nodes as you wish, depending on the application requirement.
In any Kubernetes cluster, two types of nodes are present:
One or more Control Plane nodes (also called a Master node)
One or more Worker nodes
Now if you look above, you'll find that each node is having some components associated with it. The Control Plane node has some components such as API server, Scheduler, etc. whereas Worker nodes have some components such as kubelet, kube-proxy, etc. We'll take a look at the role of each component in a specific node.
The worker nodes are responsible for running your application (as you can see pods are present inside the worker node) whereas the control plane node is responsible for managing your cluster operations such as starting the cluster, adding new nodes to the cluster, removing pods, scaling pods and much more.
Hence without a control plane node, your Kubernetes cluster won't work. It is important to keep the control plane running at all costs.
Let's understand the usage of each component of the control plane
API Server: This is the brain behind all the operations in a Kubernetes cluster. To interact with the cluster, all the requests are sent to the API server. The API Server intercepts RESTful calls from users, administrators, developers, operators, and external agents, then validates and processes them. Whenever a request has been sent to the API server by the user, the API server performs three tasks:
Authentication: Authenticates the user
Authorization: Authorizes the request made by the authenticated user (using RBAC)
Admission control policy: applies certain rules on Pods to run
Scheduler: The main role of the scheduler is to assign the pods to the nodes. Let's suppose, you request the API server via the command line mentioning running a pod (or a container). That request will be received and forwarded to the scheduler after authentication, authorization & admission control policy so that the scheduler can find the best node (worker node) to run a pod inside of it.
 The scheduler determines the valid nodes for the placement of a node in the scheduling queue, ranks each node based on resources available and required, and then binds the pod to a specific node.
Controller Manager: The controller manager is the component of the Kubernetes control plane node that regulates the state of the Kubernetes cluster by running controllers or operators. Controllers are watch-loop processes that compare the cluster's desired state with the cluster's actual state. But where does this actual state being stored? Etcd store.
Key-Value data store(etcd): All the cluster-related information is stored inside etcd. It is important to note that the application data is not stored in the etcd store. New data is written in the etcd store by appending and not by overriding. Obsolete data (incorrect data) is compacted (or shredded) periodically to minimize the size of the data store. etcd is based on Raft Consensus Algorithm. Remember, when we discussed above, that the scheduler selects a node to run a pod inside of it? Here's the actual process that happens:
A request is sent by the client to the API server to run a pod
API server validates the user & request
The request is passed to the scheduler to select a node.
After receiving the request, the scheduler sends a request to the API server to get the cluster-related information like resources available etc.
API server is the only component that can read and write data in the etcd store. No other component can connect directly with the etcd store.
API server after receiving the information informs the scheduler
Based on the information received by the API server, scheduler binds the pod to a node.
Now you might ask me, "Does this mean that pod is running?"
The answer is No.
Till this time, the correct node has been selected on which a pod should run but a Pod is not running yet.
Cloud Controller Manager(CCM): The CCM is responsible for running the controllers or operators to interact with the underlying infrastructure of a cloud host provider when nodes become unavailable.
Now try to go through these control plane components once again and then move forward.
It's time to understand the usage of each component on the worker node:
A worker node provides an environment to run a containerized application. The components present inside the worker node are:
Kubelet
kube-proxy
container runtime
Addons or DNS
Container runtime: Although Kubernetes is regarded as a container orchestrating tool it cannot run containers directly. Hence, a container runtime is needed on a node where a pod is scheduled to manage a container's lifecycle. It is important to note that container runtime are present on both nodes - the control plane and worker. Kubernetes supports several container runtimes, which are mentioned below:
CRI-O
containerd
Docker
Mirantis Container Runtime
  
Kubelet: Like container runtime, kubelet is present on both the nodes - the control plane and the worker node. To run a pod that is present on a worker node, there should be some component that must communicate with the control plane to run a pod. Kubelet is that component. The Kubelet of each node interacts with the control plane and waits for the order from the API server to run a Pod. Once the kubelet of a node receives the orders from the API server, it interacts with the container runtime of its node through a plugin-based interface (CRI shim). Hence, a pod starts running now.
In case you got confused, here's the summary of the internal working of Kubernetes components: Don't miss this!
The user sends a request to the API server to start a Pod
  kubectl run  --image=
This request is now being validated by the API server.
API server forwards this request to the scheduler on the control plane.
In return, Scheduler requests cluster-related information from the API server since API server is the only component that can interact with etcd store.
API after receiving this request from the scheduler reads the data from the etcd store and provides it to the scheduler.
The scheduler after receiving the information assigns a pod to a node based on the information and conveys this message to the API server.
  "Hey API server, the pod should run on node-01"
  - scheduler
API server assigns a specific node's kubelet to start a pod.
On receiving the orders from the API server, kubelet of that node interacts with the container runtime via CRI shim and now a pod has started running on a specific node. While the Pod is running, the controller manager checks whether the desired state of the cluster is in matches the actual state of the Kubernetes cluster.
  
Now, you might ask me what is the role of kube-proxy & addons?
Kube-proxy (runs on each node) is responsible for networking rules in a cluster. For eg.
Container-to-container communication inside Pods
Pod-to-Pod communication on the same node and across cluster nodes
Pod-to-Service communication within the same namespace and across cluster namespaces
External-to-Service communication for clients to access applications in a cluster.
Addons are cluster features that are implemented through 3rd-party pods and services. For eg., a Dashboard is a general-purpose user interface for cluster management via web UI.
Finally, look at the arrows in the topmost image and let me know if you understood what they were referring to.
If you've read up to here, Congratulations!
Feel free to reach out to me on Twitter or LinkedIn.
Follow Kubesimplify on Hashnode, Twitter and LinkedIn. Join our Discord server to learn with us.


Introduction to Helm!
Aviral Singh — Mon, 23 Jan 2023 12:36:08 GMT
Why Helm?
Kubernetes is awesome at managing complexities and we humans often tend to struggle while dealing with complexities. An application deployed on a Kubernetes cluster can be very complex to handle. A typical application made up of a collection of objects needs to be interconnected to work. Let us take an example of a simple WordPress site that might need deployment to deploy the pods that you want to run such as MySQL web servers, a persistent volume(PV) to store the database, PersistentVolumeClaim(PVC), a Service to expose the web server running in a pod to the internet and a Secret to store admin credentials & maybe other things like periodic backups, jobs and so on.
For every Object, we might need a separate YAML file, and then we apply kubectl apply to get these objects created which can be a tedious task and this even not might be the end of it if we download these YAML files from a remote repository and we might want to change its default settings. Assuming months went by and we might want to upgrade some components in our application, we'll have to edit all those YAML files with great care again! Sometimes, if we might want to delete the application, we'll need to remember each object that belongs to our application and delete them one by one. Isn't it such a tedious task? Some of you might think, It will be better if we combine all YAMLs into a single YAML and be done with it. Well, that's true but it will become even harder for us to look for the editing purpose in future changes which could be up to, for instance,25-30 pages of a YAML file. In separate YAML files at least we'll be able to easily categorize them into different categories.
Enters Helm!!
Helm changes the entire scenario! Kubernetes doesn't care about our application as a whole. All it knows is that we have created different objects in its cluster. It treats each component of the WordPress application as a separate individual entity and doesn't know that these are the components of the WordPress application! However, Helm is built from the ground to know about such stuff. That's why it's sometimes called Packet Manager for Kubernetes. It looks at those objects as a part of a big package as a group. Whenever we want to perform an action, we don't tell Helm the objects it should look after rather we tell Helm it belongs to the WordPress package, and then on the package name, it then knows what object it should edit even if 100s of objects belong to that package!
We all have played games such as FIFA, GTA, etc which contained different types of files such as audio, saved, visuals, etc. Fortunately, we didn't have to go through the horrors of installing each different component individually instead we execute the game installer and specify the directory where we want to install it and it does the rest. Helm does a similar thing, more of the YAML files and the Kubernetes objects that make our application.
$ helm install wordpress
Using Helm, we install our whole application using a single command even if it needs hundreds of objects. Helm proceeds to automatically add every necessary object to Kubernetes without bothering us about the details. We can customize the settings for our application or package by specifying desired values at the install time but instead of having to edit multiple values in multiple YAML files, we have a single location where we can declare every custom setting. In a file like values.yaml, we can change the size of persistent volumes, choose the name of our WordPress website, admin password, settings for the database engine, and so on. We can upgrade our application with a single command.
$ helm upgrade wordpress$ helm rollback wordpress$ helm uninstall wordpress
Helm will know what individual objects need to change to make the desired changes happen. Helm also keeps track of all the changes made to the app files and that allows us to roll back to the previous so-called revision. We use a single command to uninstall our app and it keeps track of all the objects used by each app so it knows what to remove. We don't need to remember each object that belongs to one of our apps anymore or use ten separate commands to remove everything. Helm does all the work.
Installing and configuring Helm!
Before installing helm, we must have a functional Kubernetes cluster and kubectl installed and configured on your local computer with the right login details set up in the kubeconfig file to work with the intended Kubernetes cluster. You can also try setting up Helm on an already set-up Kubernetes cluster on Killercoda or setting up Minikube which is a lightweight Kubernetes implementation that creates a VM(virtual machine on your local machine and deploys a simple cluster containing a single node).
https://killercoda.com/playgrounds
 
Helm can be installed on Linux, Windows, or Mac OS systems. We will go over the instructions for installing Helm on Linux systems in this blog.
Systems with snap can install helm using the snap install helm command. Use the classic option to install a more relaxed sandbox that gives the app a bit more access to the host system rather than strictly isolating it to its separate environment. This way Helm can easily access the kubeconfig file in your home directory so it knows how to connect to your Kubernetes cluster
$ sudo snap install helm --classic
For APT-based systems such as Debian or Ubuntu, Follow the instructions to add the key and sources list before installing Helm.
$ curl https://baltocdn.com/helm/signing.asc | gpg --dearmor | sudo tee /usr/share/keyrings/helm.gpg > /dev/null$ sudo apt-get install apt-transport-https --yes$ echo "deb [arch=$(dpkg --print-architecture) signed-by=/usr/share/keyrings/helm.gpg] https://baltocdn.com/helm/stable/debian/ all main" | sudo tee /etc/apt/sources.list.d/helm-stable-debian.list$ sudo apt-get update$ sudo apt-get install helm
And for PKG, run the package install helm command
$ pkg install helm
For installing Helm on Windows and Mac OS Refer to the Official Helm documentation
Helm 2 vs 3
Helm 1.0 was released in Feb 2016, 2.0 in Nov 2016, and 3.0 in Nov 2019. Since the project was launched, Helm has gone on to better as Kubernetes itself was improving. So during these things, There have been significant changes in Helm 2.0 and 3.0
Helm 2
Helm has a CLI client installed on your local machine that helps to perform Helm-specific actions against your Kubernetes cluster. When Helm 2 was launched, Kubernetes lacked features like RBAC(Role Based Access Control) and Custom Resource Definitions. To allow Helm to perform its actions, an extra component Tiller is installed in the K8s cluster. So, Whenever a user wanted to perform some Helm-specific action, It communicates with the tiller that is running on some server which in turn communicates with the K8s cluster and proceeds to take the action requested by the user. So, Tiller being the middleman adds complexities to the cluster and gives rise to security concerns. By default, Tiller has the privilege to do whatever it wanted. This is good as it allows us to make all necessary changes in a K8s cluster to install charts (Discussed Later). But this was also bad since anybody with Tiller access can do whatever they want.
After the introduction of Role Based Access Control (RBAC) and Custom Resource Definitions in Kubernetes, The need for Tiller decreased, so It was removed entirely in Helm 3. Now, nothing was sitting between the Cluster and Helm CLI, and security improved with RBAC as users can be limited with what they do with Helm. Before RBAC, we had to set these limits in Tiller which was not the best option but with RBAC built from the ground up to fine-tune permissions in K8s, it is pretty straightforward to do. As far as K8s is concerned, It doesn't matter if you are trying to make changes by accessing the cluster with kubectl or with helm commands, The user requesting the changes has the same RBAC allowed permissions whatever tool they use. That's a big difference between Helm 2 and 3 where Helm 2 uses Tiller while Helm 3 simplifies it by removing Tiller & integrating it with Kubernetes.
3-Way Strategic Merge Patch
Helm has a very cool snapshot feature. Let's take an example of installing a WordPress website using Helm. It'll create revision number 1 for this install. Then you upgrade your site to an updated version by changing the image, Helm will take us to snapshot number 2 which will be the exact state of the Kubernetes cluster at that moment in time. If there is a need you can return to snapshot 1 by creating a new snapshot 3 which will be in the same state as snapshot 1 which was at the start of the installation of the WordPress website. Helm 2 was less sophisticated when it came to how we did such rollbacks. When a rollback command is issued, Helm compares the current chart which is the chart that has the current WordPress website image 5.8 in it with the previous chart, which is the chart that has WordPress 4.8 image in it, and concludes that they are different so it applies the original chart to revert the image to 4.8
Let's consider a situation where we create Snapshot 1 just like above but for upgrading to a newer image with the kubectl set image command, so the application gets updated and this is done without using Helm and this does not create a Snapshot in Helm because the change was not made using Helm. When we know rollback, Helm compares the current version with the previous version. Since there is only one revision, Helm does not detect any changes, so it does not roll back or make any changes to the deployment. In this case, this didn't help us as the user change made through kubectl is still active.
On the other hand, Helm 3 is more intelligent. It compares the chart currently in use if we had created a revision that is which we didn't, the chart we want to revert and also the Life state, how our Kubernetes objects look like, and their declarations in the YAML form. This is where the fancy name 3-way Strategic Merge Patch comes from. By also looking at the live state, it notices the image version is 5.8 but the image in Snapshot 1 that we want to revert to is 4.8. So, it makes necessary changes to come back to the original state.
Besides rollbacks, there are things like upgrades where Helm2 was also lacking. For example, you want to install a chart but then you make some changes to some of the Kubernetes objects installed. It all works nicely until you want to perform an upgrade. Helm 2 looks at the old chart and the new chart you want to upgrade to and all your changes will be lost since they don't exist in the new chart whereas Helm 3 looks at the chart as well as the Live State of the cluster and notices the changes made by the user and make changes preserving the additional tweaks made by the user.
Helm Components
Let's get familiar with the components in Helm, their general structure, and the concepts that we are going to be working with it. We have the Helm Command line Utility (Helm CLI) that we'll be using to perform Helm actions such as installing charts etc. Charts are collections of files and they contain all the instructions to be known to create a collection of objects that you need in a Kubernetes Cluster. By using charts and adding the objects according to the specific instructions in the chart, Helm installs the application in your Kubernetes Cluster. When a Chart is applied to your cluster, a Release is created. A Release is a single installation of an application using a Helm Chart. Within each release, You can have multiple revisions (snapshots of an application).
Every time a change is made to the application such as an upgrade of the image or change of replicas or configuration objects, a new revision is created. Just like how we can find all Docker Images at Docker Hub, In the same manner, we can find Helm Charts in a public repository. We can easily download publicly available charts which are readily available and we can use them to Deploy Applications on our cluster. And finally, to keep track of what it did to our cluster such as the releases that it installed, charts used, revision state, etc. Helm needs a place to save this data. This data is known as Metadata (data about data). It wouldn't be too useful if Helm saves this on our Local Computer. If another person needs to work with our Helm Releases through Helm, he would need to have a copy of this data. Helm does the smart thing by storing this data in our Kubernetes cluster as Kubernetes Secrets. This way data survives as long as the Kubernetes cluster survives, and everyone from a team can access it. They can perform different Helm actions to it. The punchline is that Helm will know about everything you did to cluster and will be able to keep track of every step you do since it has its metadata always available
Charts
Charts are collections of files. They contain all the instructions that Helm needs to be able to create the collection of objects that you need in your Kubernetes cluster. By using these Charts and adding the objects according to the User's modification, Helm installs the application in your Kubernetes cluster.
Let's take an example of a simple Hello World application which is a simple Nginx based web-server and a service to expose it. In this, we have two objects, a Deployment, and a Service. In the Deployment YAML file, you'll notice that images and replicas are specified in a different format. This is known as templating. The values that go here a part of another file called values.yaml file.
In helm Chart, we'll be often interacting with this special file(values.yaml). Most of the time, we won't have to build a chart ourselves. We simply have to download it from an online public repository. We'll just have to configure the package that we installed through that chart. The values.yaml is the place where the configurable values are stored. Most of the time, this will be the only file you'll have to modify to customize the deployment of the application for your needs! This is like the setting file for Helm Chart.
When a Chart is applied to your cluster, a release is created. One question might arise! Why the need for an additional item? Why can't we say we just say we installed a chart to Kubernetes? One simple reason why it makes more sense to have releases based on charts is that we can install multiple releases on the same chart so we can launch, for example, a second WordPress site with a command such as a helm install my-second-site bitnami/wordpress. Since they are two different releases, they can be tracked separately! Even though they are based on the same chart as releases, they are two entirely different entities. Now, this can come in handy in a lot of situations. Let's take an example of a WordPress website that can have a release for customers' use and another release for developers who can internally add features without breaking the main website. Since the two releases are based on the same chart, once they successfully integrate it into the development side, they can transfer it to the main website since it should work exactly in the same way as both the websites are clones and built the same way!
Thousands of charts are readily available at different Helm repositories across the world. Different providers are hosting Helm repositories such as Appscode, Community Operators, True Charts, Bitnami, etc. All of these repositories have listed their charts in a single location known as Helm Hub or Artifact Hub.
Working with Helm: Basics
$ helm --help> This command will list helpful information on how to execute a particular command in a summarized manner. $ helm restore --help >  We can also use this to look for subcommands
Launching a WordPress website in a Kubernetes Cluster
Downloading the WordPress chart from Artifact Hub which has official mentioned before it as it would be by the official developers of that website!
We can search for the WordPress chart through Helm CLI also
$ helm search wordpress --search_where> In search_where we have to mention where we have to look for this. Specify the hub or repo. This will list all the charts listed at artifacthub.io$ helm search hub wordpress
Once Chart is Identified, we can deploy the application in two commands which are listed on the README file of that chart!
$ helm repo add bitnami https://charts.bitnami.com/bitnami$ helm install my-release bitnami/wordpress
Now once the chart has been deployed successfully! It is deployed as a release
To list, all existing releases, Run the following command! This is very useful not only to see what is being installed but also which hasn't been updated in a long time.
$ helm list
To delete all traces of this app. Imagine doing this by removing every file one by one which will be a tedious task to get rid of all WordPress elements! But with Helm can be done by a single-line command. We can see the power of Helm as a Package Manager for Kubernetes
$ helm uninstall my-release
Some Other commands while working with the helm
$ helm repo> This command consists of multiple subcommands to interact with chart repositories. It can be used to add, list, remove, and index chart repositories$ helm repo list> Will list all the existing repositories$ helm repo update> This command is somewhat equivalent to what a: sudo apt-get update : command does on a Linux OS
There is much more in Helm to explore! but keeping this blog a little short and quite explanatory for beginners to get a taste of helm as a Packet Manager for Kubernetes
For more exploration in Helm: Checkout the official documentation of Helm
Credits for creating this Blog
KodeKloud's Helm for Beginners Course By Mumshad Mannambeth
Helm's official documentation.
Follow Kubesimplify on Hashnode, Twitter and Linkedin. Join our Discord server to learn with us.


An overview of GitOps and ArgoCD.
Rakshit Gondwal — Mon, 16 Jan 2023 12:30:45 GMT
What is Gitops?
The basic definition of Gitops is "Infrastructure as Code" done right. Now, what is IaC (Infrastructure as code)? IaC means to define the whole infrastructure into code, not only infrastructure but also Network, Security, Configuration, and Policy.
IaC done right.
Now, what does "done right" mean in the definition of Gitops? Usually, we use IaC the wrong way. We go on to save the code in our local systems or use version control so that other members of the team or any other individual can collaborate. This might be a headache for you as everyone would be pushing their changes to the main branch directly without any checks or testing.
Things get even more troublesome when trying to push this code into the environment. You'll have to first test every change locally, verify that it doesn't break anything, and then push it to production. Thus, this process of manually testing and deploying the code is very inefficient and time-consuming.
This is where GitOps come into play. In this method, we have our infrastructure as a code and a CI/CD pipeline, in which we treat the infrastructure code the same as the application code.
In the GitOps method, the IaC is hosted on a git repository. In this method, a change is made in the form of a pull request to the repository rather than pushing it to the main branch directly.
After a change is made, it undergoes a CI(Continuous Integration) pipeline where automatic tests take place. After passing through this CI pipeline, any senior engineer, tester, or repository maintainer can approve the changes and allow the pull request to merge into the main change. This way there is no chance of any error or any other security issue.
After merging it into the main branch, the change is directly deployed to the environment with the help of a CD(Continuous Deployment) pipeline.
Continuous Deployment can be done in two ways-
Push Base Deployment: This is the traditionally used way by various tools like Jenkins, and Gitlab. Here, the pipeline executes a certain command to push the code to the environment.
Pull Base Deployment: Here, we have an agent installed into our environment itself, which is linked to the repository itself. It keeps on checking for any change made to the repository and automatically pulls and deploys if there is any change.
  Tools that use the Pull Base Deployment are ArgoCD and FluxCD.
ArgoCD
ArgoCD is a continuous delivery tool that helps you to deploy your applications into multiple environment in a declarative and in the GitOps way.
CD workflow without ArgoCD
Let's say we use Jenkins for the CI/CD pipeline. Now, when a new change gets committed into the main repository, it will undergo various builds and tests to test that it does not break anything. It might even build a new docker image. This whole process falls under the CI pipeline.
Now Jenkins will push these changes to the environment using kubectl or helm or any other commands. I've stated the challenges we might face while any push-base deployment tool below:
The first challenge we face is to install tools like kubectl or helm.
The second challenge we might face is to provide Jenkins access to the Kubernetes cluster. If we are using EKS, we'll have to provide Jenkins access to the AWS. This might create a serious security risk.
The third challenge we face is that once Jenkins has deployed any change into the environment, it loses access to the cluster, making it impossible to check whether the change has been successfully deployed or not.
CD workflow with ArgoCD
ArgoCD follows the GitOps principles, which means that it uses the pull-base deployment.
ArgoCD is an agent which we install inside our Kubernetes cluster and then bind it to a git repository. ArgoCD will keep looking for any change in the repository and as soon as a change is made in the repository, ArgoCD will pull it and apply it to the Kubernetes cluster.
Now, our configuration might contain more than just YAML files and might also contain secrets, services, ingress, hence it is a best practice to separate our application code and the configuration code into two different repositories.
This way, there is would be no need to run the entire CI pipeline when there is a change to any service or any configuration.
This way, Jenkins will update the manifest file in the configuration repository and ArgoCD will auto fetch the change and apply it into the environment.
Advantages of using ArgoCD
Everything is version controlled, means we can keep a check that who made which change.
We can easily rollback to any old state if anything breaks after applying any change.
Git is the single source of truth. Even if a change is made manually to the cluster, ArgoCD will keep a check on the desired state and the actual state. And since the actual state was changed, it will automatically roll back the changes made to the cluster. This way, ArgoCD keeps a check on both, the repository and the cluster itself.
Git allows us to set up access rules so that any team member can submit a pull request, while only senior members can merge. This way, we don't need to create ClusterRole and User resources in kubernetes.
Cluster disaster recovery: Suppose I have an EKS cluster in region 1-a and this cluster completely crashes, then I can make a new cluster and point it to the same git repository. This will create the same cluster with the same configuration as earlier.
We get a real time update of our application state even after the changes are made.
ArgoCD Demo
Configuring ArgoCD
ArgoCD is configured directly into the Kubernetes cluster itself, and it extends the Kubernetes API's with CRD's(custom resource definition). ArgoCD is installed in the Kubernetes cluster with the help of YAML file. In this file, we define which git repository should be synced with which Kubernetes cluster. It can be any git repository and any Kubernetes cluster rather, be it the cluster in which ArgoCD is installed or any other cluster that ArgoCD is managing.
If we have different cluster environments, such as deployment, staging, and production, then we deploy ArgoCD separately. All these environments are configured with one single git repository where all the configuration is stored.
Installing ArgoCD into a k8s cluster.
I am using minikube for a Kubernetes cluster, but you can use anything, be it either minikube or any cloud provider such as AWS, Civo, etc.
Create an ArgoCD namespace.
kubectl create namespace argocd
Install the required services and application resources.
kubectl apply -n argocd -f https://raw.githubusercontent.com/argoproj/argo-cd/stable/manifests/install.yaml
Run the following command to get the pods running inside the ArgoCD namespace.
kubectl get pods -n argocd
The above should return the following code snippet.
NAME                                                READY   STATUS    RESTARTS   AGEargocd-application-controller-0                     1/1     Running   0          2m9sargocd-applicationset-controller-74575b6959-v6nf5   1/1     Running   0          2m13sargocd-dex-server-64897989f8-qxqsb                  1/1     Running   0          2m13sargocd-notifications-controller-566bc99494-hmfzc    1/1     Running   0          2m12sargocd-redis-79c755c747-524rj                       1/1     Running   0          2m12sargocd-repo-server-bc9c646dc-9w5q8                  1/1     Running   0          2m12sargocd-server-757fddb4d7-hgtp9                      1/1     Running   0          2m10s
Download the ArgoCD CLI
Linux
Using Homebrew.
brew install argocd
Using Curl.
curl -sSL -o argocd-linux-amd64 https://github.com/argoproj/argo-cd/releases/latest/download/argocd-linux-amd64sudo install -m 555 argocd-linux-amd64 /usr/local/bin/argocdrm argocd-linux-amd64
Mac
Using Homebrew.
brew install argocd
Using Curl
VERSION=$(curl --silent "https://api.github.com/repos/argoproj/argo-cd/releases/latest" | grep '"tag_name"' | sed -E 's/.*"([^"]+)".*/\1/')
Access the ArgoCD UI.
Port-forwarding can also be used to connect to the API server without exposing the service.
kubectl port-forward svc/argocd-server -n argocd 8080:443
The UI can then be accessed using https://localhost:8080
Login into the UI.
Use admin for the username.
Use the following command to get the password
kubectl -n argocd get secret argocd-initial-admin-secret -o jsonpath="{.data.password}" | base64 -d; echo
Creating a new application
You will see a UI like this, after logging in.
Click on + New App to create a new application.
Enter the following details to create a new application. For this demo, I am using https://github.com/argoproj/argocd-example-apps for the repository URL and https://kubernetes.default.svc for the Cluster URL.
  
Click on `Create` on the top to create a new application.
Deploying the application.
Currently, the guestbook application is OutOfSync that means it is not deployed.
  
To deploy the application, click on the Sync button and the application will automatically get deployed and would be in sync with the desired state.
  
The application is now in sync with the provided git repository, and ArgoCD is constantly looking for changes made to the git repository.
Follow Kubesimplify on Hashnode, Twitter and LinkedIn. Join our Discord server to learn with us.


Deploying Java Application using Docker and Kubernetes- DevOps Project
Tania Duggal — Mon, 09 Jan 2023 12:30:42 GMT
Hello Everyone, Welcome to the blog. In this blog, We going to see "How to Deploy a Java Application using Docker and Kubernetes". So, Let's start
The workflow of the project is going to be like this in the below image 
So, You have already installed Docker, Git, Kubernetes and Maven in your system.
Firstly, start the minikube cluster to up the k8s cluster. Now, the k8s cluster is up.
You can do kubectl cluster-info to see the information of the cluster. You can see this here
You must have maven installed on your system and set the maven home page. We use maven to build the application.
Since I already installed maven on my system. To Check if your maven is installed or not. You can use maven --version to check the version of the maven in your system. If it provides the output, it means you successfully installed the maven.
Now, fork the project(Java Application) from GitHub. I used someone's Java application. You can go to my GitHub repo and fork it and can clone it.
Here's the link to my repository
Now, go into the repository. By doing ls, you can see the different microservices i.e productcatalogue, shopfront and stockmanager.
Now, we are going to build our microservices to get the jar file.
Now, go into the shopfront microservice. By doing ls, you can see the files/folders inside the shopfront.
You can build the shopfront microservice by using mvn clean install command.
After the build success, you can see the target folder in the shopfront. In that, we have our jar file.
Now, You can build the image of your shopfront jar file from the dockerfile by using docker build command.
Let's, take a look at the dockerfile.
From: We call the base image for OpenJDK for the JRE file
And: Add YAML and config files that are necessary
We expose the port and give EntryPoint
You can check the image by docker images command. It lists all the images.
Now, you repeat the same steps for your other microservices.
First, by mvn clean install command, get the jar file for the remaining microservices.
Second, by docker build command, build the images of the remaining microservices.
Now, after following the above steps, you can build your remaining images. You can see all the images by docker images command.
Now, You have to push all your images to your docker account.
Now, login to your docker account by docker login. It asks for the username or password.
After the login succeeded, you can push your images to the dockerhub. You have to give the username of your docker account because if you don't give the username, you won't able to push the images because it has to understand from where the images are being pushed (from which account).
Now, you go to dockerhub, and you can see your images in your repository.
Once you push your images, you can docker logout from dockerhub login from your system.
Now if I go to my application, there is a folder "Kubernetes". It contains the YAML files for each microservice.
If you don't know about service and deployment, please refer to this article .
Now, go to the Kubernetes folder. Here, You can see YAML files for each microservice.
Now, let's take a little bit look at one of the YAML files.
In the YAML file, we use "Deployment" and "Service" objects. Using a deployment allows you to easily keep a group of identical pods running with a common configuration. Once you have defined and deployed your deployment Kubernetes will then work to make sure all pods managed by the deployment meet whatever requirements you have set.
When using a Kubernetes service, each pod is assigned an IP address. As this address may not be directly knowable, the service provides accessibility, then automatically connects the correct pod. When a service is created it publishes its own virtual address as either an environment variable to every pod or, if your cluster is using coredns, as a DNS entry any pod can attempt to reach. In the event of any changes to the number of available pods the service will be updated and begin directing traffic accordingly with no manual action required.
Now, We are going to apply all the YAML files using kubectl apply command.
Now, your Deployment and Service objects are created. You can see them by using Kubectl get deployment and kubectl get svc commands respectively.
Now, it's time to access our microservices from the web. Now, do minikube service microservicename . It gives the URL to access the application. You can see this here
Now, You'll go to your web browser and put the IP address there to access the microservice from the web browser. You can see this here
Now, do the same for other microservices to access it from the web browser. You can see this here
These all are the steps to deploy the Java application using Docker and Kubernetes. I hope🤷 it should be helpful for you to understand the concepts.
Don't forget to like and share this post. Connect with me on Twitter. Follow me for more such blogs on Hashnode.
Follow Kubesimplify on Hashnode, Twitter, and LinkedIn. Join our Discord server to learn with us.


Firewall: A Network's Gatekeeper
Arnav Barman — Tue, 03 Jan 2023 12:30:42 GMT
🌐Introduction
A firewall is a part of the computer system or network whose fundamental objective is to separate trusted and untrusted components of a network. It uses pre-defined rules/policies to establish a secure connection and stop unauthorized traffic from flowing from one network to another. It acts as a protective layer that filters data, redirect traffic, and protects against network attacks. A firewall can be based on software or hardware, depending on the use case.
A firewall is the first line of defense against malware or application layer attacks in the network domain. It assesses the flowing data packets for any suspicious activity in the network and blocks it using the policies set up while configuring the firewall.
Requirements for a firewall
All the traffic between any two zones must pass through the firewall.
Only the traffic authorized by virtue of security policies should be allowed to pass through a firewall.
The firewall must be impenetrable in itself.
The De-Militarized Zone (DMZ)
While DMZ is a separate topic of its own, I have given a summary to explain its architecture and functioning in its most basic sense!
If you're someone who is intrigued by cybersecurity, then do check DMZs in more depth and how honeypots are placed in a DMZ to secure the network policies.
The DMZ is a subnet that can be configured logically and physically. The primary function of DMZ is to connect the public network to an organization's private network by sandwiching the organization's servers, which are in contact with the public network, between two firewalls. DMZs can be configured using one or two firewalls. Still, the better approach is to use two firewalls (preferably made by different vendors to reduce common vulnerabilities, if any) and then put the DMZ between them. The inner firewall has secure and tight policies, while the outer firewall is somewhat forgiving. The DMZ adds another layer of security to an organization's network structure by detecting a security flaw before it reaches the private LAN. Some servers usually placed in a DMZ are DNS, email servers, web servers, etc., as they are more prone to being attacked through the external network and provide services outside the LAN.
📝Firewall Policies & Actions
There are three kinds of policies on which the firewall authorizes the data:
User Control: Here, access to the requested data is based upon the requesting user's role. This kind of control is applied to users inside the parameter of the firewall. E.g., consider a university's network; the accounting department subnet here can access the financial database, but the systems on the faculty subnet don't have access to the same, etc.
Service Control: Controls access by the type of service the host offers. The rules are applied based on the network address, the protocol of the connection port, and the port numbers of services.
Direction Control: This ensures the direction in which the requests must be initiated and are allowed to flow through the firewall (inbound or outbound).
After applying the configured rules on the data packets, the firewall takes some predefined actions. These actions are:
Accept: Here, the data is allowed to enter through the firewall.
Drop: Here, the data is filtered outside and not permitted to flow through the firewall.
Reject: This action is the same as the Drop action. It only adds a rejection message to the source using an ICMP packet on top of the Drop action.
From the aspect of cybersecurity, one must keep in mind to use REJECT whenever the firewall must disallow the packet flow originating from a trusted source. But in the cases where the source is not trusted, one must always use the DROP action as it will not send back any message and lead to a timeout of the request from a potential attacker. E.g., if we send a REJECT message back to an attacker, it would get aware of the up-and-running status of the machine even after successful filtering of packets, whereas on using DROP, the attacker can not analyze anything to conclude the running status of a machine.
🔥Types of Firewalls
There are two kinds of filtering that are performed by a firewall:
Ingress Filtering
Inspection of the incoming traffic to safeguard an internal network.
Blocks out the suspicious packets that are coming from outside.
Egress Filtering
Inspection of the outgoing traffic to block the internal user's access to specific networks.
Blocks users from reaching out to the outside network. E.g., blocking social networking sites, etc.
Now, depending on the mode of operation, there are three types of firewalls:
1. Packet Filtering Firewall
This type of firewall controls the traffic based on the information stored in the network and transport layer headers of the data packet, without paying attention to the packet's payload data.
The firewall does not maintain the states of packets. Hence, it is also called the Stateless Firewall.
The firewall does not care if the packet is a part of an existing data stream. It verifies the header information, nevertheless.
2. Stateful Firewall
This type of firewall tracks traffic states by monitoring all the connection interactions until the connection is closed.
A table for states of connection of packets is maintained to keep track of information.
E.g., the firewall can be configured to allow packet flow in already open port connections.
3. Application/ Proxy Firewall
Such a firewall acts as a proxy between private and public networks. The client's connection terminates at the proxy, and a separate connection is initiated from the proxy to the destination server.
The data analysis is done up to the application layer in the application firewall.
Such a firewall controls inputs, outputs, and access to/from an application or a service.
The proxy behaves like an intermediary by impersonating the intended recipient.
Typically, an application firewall is set up to be used as a proxy. The general one is the web proxy (to control what the browsers can access). To set this up, we place them on a network bridge between the internal and external networks and configure all the web traffic to be routed through them.
A proxy can also be used to avoid egress filtering. E.g., if a firewall filters packets based on the destination IP address, then we can route our packets to the proxy's IP address configured to be accepted by the firewall, which then forwards our query to the desired destination.
Another use of a proxy is to anonymize the origin of a network request from servers. As the request will contain the IP of the proxy, the servers will have no clue about the actual origin of the request.
👀How to bypass firewalls?
1. SSH Tunneling
Let's say our firewall blocks traffic on port 23 (Telnet). To access a telnet server inside the firewall, we can set up an SSH connection on port 22 between a server outside the network and a client (with an open port 23) inside the network. Now the client can connect with the telnet server within the network and route its data through the ssh connection to the external server, thereby tunneling the blocked data through the firewall.
2. VPNs
We can encapsulate our data within IP packets and send it across the firewall through a tunnel between an internal and external system created using VPN. As the tunnel traffic is encrypted, the firewall is unaware of the contents, and the blocked content can bypass the firewall filtering.
3. Other methods to evade firewalls
There are many more methods of bypassing a firewall. Some of the most popular ones are:
Dynamic port forwarding
Banner Grabbing
Fragmenting Packets
Firewalking
Source Routing
IP address spoofing
MAC Address spoofing
Through XSS Attack
And many more
Follow Kubesimplify on Hashnode, Twitter, and LinkedIn. Join our Discord server to learn with us.
Like the explanation? Want to connect? You can find me here. Till then, happy learning!



Kubeflow Pipelines: Orchestrating Machine Learning Workflows - Part 3
Rishit Dagli — Tue, 27 Dec 2022 12:30:44 GMT
Kubeflow Pipelines is a great way to build and deploy end-to-end scalable and portable Machine Learning workloads. In this article, we take a look at how to use Kubeflow Pipelines for your own tasks and how Kubeflow Pipelines works under the hood.
Previous articles in the series
Kubeflow: Machine Learning on Kubernetes - Part 1
Kubeflow Notebooks: ML Experimentation Made Easier - Part 2
In the last article, we already took a look at Kubeflow Notebooks, when can they be used, customizations you could make, and how they work. The time around we will talk about Kubeflow Pipelines, another component of Kubeflow.
Kubeflow Pipelines
Building and deploying portable, scalable machine learning workflows is really important especially since you would have different stages in your machine learning workflow all of which use different tools: preparing data, training the model, evaluating performance, deployment, and more. This particularly motivates the need for an orchestrator and is also a way to foster reusability.
This is exactly what Kubeflow Pipelines aims to do. Kubeflow Pipelines is based on top of Argo Workflows, which is an open-source, container-native workflow engine for Kubernetes, we will talk more about this later. With Kubeflow Pipelines your machine learning pipeline is implemented as a graph, and each of the nodes in this graph forms different stages in a workflow.
A workflow in Kubeflow Pipelines
You can think of a pipeline as a description of your machine learning workflow including the inputs required to run the pipeline and all the pipeline components. A pipeline component is a self-contained set of code (a Docker image) that performs a single step in your pipeline, such as data preprocessing, model training, and so on. Multiple of these components and how you arrange them in a graph will make up your pipeline.
The main goals of Kubeflow Pipelines are:
End-to-end orchestration: enabling and simplifying the orchestration of machine learning pipelines.
Easy experimentation: making it easy for you to try numerous ideas and techniques and manage your various trials/experiments.
Easy re-use: enabling you to re-use components and pipelines to quickly create end-to-end solutions without having to rebuild each time.
Running a pre-built pipeline
We will start by exploring how we can run a pre-built Kubeflow pipeline, this will help you get familiar with Kubeflow Pipelines UI as well as set the background for the other sections in this article. Kubeflow comes installed with a few sample pipelines which you can notice under the Pipelines tab in the Kubeflow Central Dashboard.
Kubeflow comes with pre-packaged Pipelines
Clicking on a specific pipeline you can see its graph as well as the pipeline or pipeline component's compiled code, which is essentially an Argo YAML file.
When running a pipeline you must choose an experiment, an experiment is a workspace and you can use experiments to organize your runs into logical groups.
Start by clicking on the "Data passing in python components" pipeline and as you will notice, it is a quite simple pipeline that runs some Python commands. We will start by creating an experiment by clicking the "Create an Experiment" on the UI, give it a name, and then you should end up on a page to start a run.
The Start a Run page
Right now, we will just select our run to be a one-off run and not set up a recurring run which allows you to (as the name suggests) run the pipeline after some set time. Your pipeline run will now start, Your run should soon be over since it is a very small pipeline.
The run is completed
You just run your first Kubeflow pipeline and Before talking about building the pipeline using the Python SDK we will see the main components of Kubeflow Pipelines.
Components of Kubeflow Pipelines
The Kubeflow Pipelines platform consists of:
A UI for managing and tracking pipelines and their execution
An engine for scheduling a pipelines execution
An SDK for defining, building, and deploying pipelines in Python
Notebook support for using the SDK and pipeline execution
We already took a look at the UI and now we will take a better look at using the Python SDK and how you can create your own new pipelines.
The Python SDK
As you now know Kubeflow Pipelines are stored as Argo YAML files executed by Argo. Kubeflow also exposes a Python domain-specific language for creating new pipelines. The Kubeflow Pipelines SDK provides a set of Python packages that we can use to specify and run our machine-learning workflows as pipelines.
A pipeline is just a graph of container execution. In addition to specifying which containers should run in which order, it also allows us to pass arguments to the entire pipeline and between these containers.
What do you need?
For all of these containers, we need to make sure a couple of things are being done:
First off you of course want to create a container and this could be as simple as you writing a Python function and Kubeflow Pipelines packaging it up as a container or bringing your container as well
You then need to show Kubeflow Pipelines how it should run the container which could involve any command line arguments or data mounts you need to be able to run this container as desired
You also need to order these containers, which of these should run sequentially which of these should run in parallel, and so on
Finally, as you know Kubeflow Pipelines needs an Argo YAML file and not Python code so finally, you want to be able to compile your Python code into Argo YAML files
And all of this is how the Kubeflow Pipelines Python SDK helps us out.
Installing the SDK
You can install the Kubeflow Pipelines SDK through PyPI considering you already have Kubeflow with all of its components installed or just the standalone Kubeflow Pipelines installed:
pip install kfp --upgrade
I would recommend working through the demos in a Kubeflow Notebook which by default has kfp installed as well as gives you access to Kubeflow Pipelines by default since the notebook lives in the same cluster. I will be using Kubeflow Notebooks as we talked about in Kubeflow Notebooks: ML Experimentation Made Easier article.
However, You are not bound to do so you could most certainly try out these experiments outside Kubeflow Notebooks as well. You would need to connect to Kubeflow Pipelines with the SDK, an in-depth guide on doing so can be found here.
In this article, I will connect to the SDK simply using:
client = kfp.Client()
which works well since the notebook is in the same cluster as Kubeflow Pipelines.
Building new pipelines
We will first take a look at building components with just a Python function and allowing Kubeflow to package it.
Function based components
We will build a component in our pipeline which multiplies two numbers, this is a rather simple component and creating Python function-based components will be an easier way to go rather than building a container image for your component which we will soon see as well.
Here is a simple function that multiplies two numbers:
def multiply(a: float, b: float) -> float:  return a * b
Next up, we create a pipeline component just from this function using the create_component_from_func method. You can also see the underlying component yaml file created at multiply_component.yaml, if you notice under the hood this creates a container with a Python container image and runs our program while also adding some code for serialization and passing arguments. This yaml file is a reusable and shareable definition of your component.
The create_component_from_func also returns a factory function, were you to call multiply_op() it would create kfp.dsl.ContainerOp class instances which are how you represent an op implemented by a container image, we would use the multiply_op later while creating a pipeline.
def multiply(a: float, b: float) -> float:    return a * bimport kfpmultiply_op = kfp.components.create_component_from_func(    multiply, output_component_file="multiply_component.yaml")
We will now create a pipeline using this component. We first annotate the pipeline creation function with @dsl.pipeline which specifies that this function will be used to create a pipeline.
Notice something odd? The arguments to multiply_pipeline are strings and not floats, this is indeed expected and would be taken care of by the serializer and deserializer.
Finally, this piece of code also connects to the Kubeflow Pipelines using the SDK, you should also read this documentation which lists how you would do so for all kinds of platforms. After connecting to Kubeflow Pipelines we also create a run for this pipeline: if you remember from earlier we should now expect to see our pipeline running in Kubeflow dashboard.
def multiply(a: float, b: float) -> float:    return a * bimport kfpmultiply_op = kfp.components.create_component_from_func(    multiply, output_component_file="multiply_component.yaml")import kfp.dsl as dsl@dsl.pipeline(name="Multiply", description="An example pipeline.")def multiply_pipeline(    a="1",    b="5",):    multiply_task = multiply_op(a, b)arguments = {"a": "2", "b": "3"}client = kfp.Client()client.create_run_from_pipeline_func(multiply_pipeline, arguments=arguments)
Alternatively, you could also create a zipped yaml file for our pipeline and load it to Kubeflow Pipelines which works the same way.
compiler = kfp.compiler.Compiler()compiler.compile(multiply_pipeline, 'multiply-pipeline.zip')
The above code creates a file multiply-pipeline.zip which can be uploaded using the Kubeflow Pipeline UI.
You would also need to follow the steps we covered earlier in this article to run the pipeline we just uploaded however this time around when running the pipeline through the UI you see the option of run parameters which is the parameters our pipeline accepts, in our case a and b.
Specifying Base Images
The current approach we saw, was using Python functions as pipeline components. By default, this uses the Python image corresponding to the current Python environment. However, Kuebflow also allows explicitly specifying base images to use for your pipeline components.
Here is an example where I create a pipeline component from the same Python function however, I specify a different base image for running this:
def multiply(a: float, b: float) -> float:    return a * bimport kfpmultiply_op = kfp.components.create_component_from_func(    multiply,    output_component_file="multiply_component.yaml",    base_image="python:3.7",)import kfp.dsl as dsl@dsl.pipeline(name="Multiply", description="An example pipeline.")def multiply_pipeline(    a="1",    b="5",):    multiply_task = multiply_op(a, b)arguments = {"a": "2", "b": "3"}client = kfp.Client()client.create_run_from_pipeline_func(multiply_pipeline, arguments=arguments)
You can also specify a list of packages you want to install before the pipeline component is run, this is particularly helpful if your component just requires a few other libraries to be installed. Here is an example of the same function however using the default image and installing one new package.
def multiply(a: float, b: float) -> float:    return a * bimport kfpmultiply_op = kfp.components.create_component_from_func(    multiply,    output_component_file="multiply_component.yaml",    packages_to_install=['pandas==0.24'],)import kfp.dsl as dsl@dsl.pipeline(name="Multiply", description="An example pipeline.")def multiply_pipeline(    a="1",    b="5",):    multiply_task = multiply_op(a, b)
We can now run this just as we did earlier directly through the SDK using this piece of code:
arguments = {"a": "2", "b": "3"}client = kfp.Client()client.create_run_from_pipeline_func(multiply_pipeline, arguments=arguments)
Or we could also trigger a run from the UI by running the following to get the compiled pipeline:
compiler = kfp.compiler.Compiler()compiler.compile(multiply_pipeline, 'multiply-pipeline.zip')
Using container images
Building pipeline stages directly from Python provides a great way to do much with Kubeflow Pipeline. It does limit our implementation to Python, though. With Kubeflow Pipelines we can orchestrate the execution of container images thus allowing us to use any tool or language for your pipeline. For Kubeflow Pipelines to run your component, your component must be packaged as a Docker container image and published to a container registry that your Kubernetes cluster can access. This does not involve doing any changes to your container image for Kubeflow pipelines.
We can do this by using kfp.dsl.ConatinerOp, here is some simple code to load the Python image and then run some commands on the container image:
import kfpimport kfp.dsl as dsl@dsl.pipeline(name="cointoss", description="Example Pipeline.")def random_coin_toss():    random_step = dsl.ContainerOp(        name="Flip coin",        image="python:alpine3.7",        command=["sh", "-c"],        arguments=[            "python -c \"import random; result = 'heads' if random.randint(0,1) == 0 "            "else 'tails'; print(result)\" | tee /tmp/output"        ],        file_outputs={"output": "/tmp/output"},    )
We can also have environment variables while running this step, to do so we would need to use the Kubernetes Python Client:
from kubernetes import client as k8s_clientimport kfpimport kfp.dsl as dslsome_step = (    dsl.ContainerOp(        name="example", image=image    )    .add_env_variable(k8s_client.V1EnvVar(name=env_var_1, value=value_1))    .add_env_variable(k8s_client.V1EnvVar(name=env_var_2, value=value_2)))
However, ideally, you would want to be able to better reusable steps and it is often suggested to not directly use kfp.dsl.ContainerOp and rather use load_component_from_text. below is an example of the same step using load_component_from_text this time around and as you might observe the syntax is pretty similar:
create_step_coin_toss = kfp.components.load_component_from_text("""    name: Flip Coin    description: Example Pipeline.    inputs:    - {name: text, type: String}    outputs:    - {name: data, type: Data}    implementation:      container:        image: python:alpine3.7        command:        - sh        - -c        - |          python -c \"import random; result = 'heads' if random.randint(0,1) == 0          else 'tails'; print(result)\" | tee /tmp/output""")
Passing data between steps
The examples we saw earlier were pretty simple with a single function being run however as you start building complex pipelines, you most certainly would need to pass data between containers and maybe even pass the output of one step to the other step.
Under the hood, when Kubeflow Pipelines runs a component, a container image is started in a Kubernetes Pod and your components inputs are passed in as command-line arguments. When your component has finished, the components outputs are returned as files.
We can do this using .output on a dsl.ContainerOp object. Building a pipeline that reuses the outputs from other steps also tells Kubeflow Pipelines the order in which the components should be run. Here is an example of a pipeline that reuses the outputs from a previous step:
def multiply(a: float, b: float) -> float:    return a*bdef add(a: float, b: float) -> float:    return a + bimport kfpmultiply_op = kfp.components.create_component_from_func(    multiply, output_component_file="multiply_component.yaml")add_op = kfp.components.create_component_from_func(    add, output_component_file="multiply_component.yaml")import kfp.dsl as dsl@dsl.pipeline(name="Multiply and Add", description="An example pipeline.")def multiply_add_pipeline(    a="2",    b="5",    c="3"):    multiply_task = multiply_op(a, b)    # Calculate (a * b) + c    add_task = add_op(multiply_task.output, c)
However, it might be helpful to have multiple outputs and not just a single output from a function that you can use, we can use NamedTuples here. We would essentially still return a single value, a tuple however we would be able to reference the outputs we need from it. Here is an example of a pipeline that achieves the same goal but with NamedTuples:
from typing import NamedTupledef multiply(a: float, b: float) -> NamedTuple("MultiplyOutput",[("result", float)]):    from collections import namedtuple    output = namedtuple('MultiplyOutput', ['result'])    return output(a*b)def add(a: float, b: float) -> float:    return a + bimport kfpmultiply_op = kfp.components.create_component_from_func(    multiply, output_component_file="multiply_component.yaml")add_op = kfp.components.create_component_from_func(    add, output_component_file="multiply_component.yaml")import kfp.dsl as dsl@dsl.pipeline(name="Multiply and Add", description="An example pipeline.")def multiply_add_pipeline(    a="2",    b="5",    c="3"):    multiply_task = multiply_op(a, b)    # Calculate (a * b) + c    add_task = add_op(multiply_task.outputs["result"], c)
Until now, we took a look at passing simple data between containers, primitive types, or Python objects however that limits what we can do with Kubeflow Pipeline. We would often want to pass much larger data maybe blobs and not just objects. One particular example could be passing the entire dataset between steps. You would probably want to leverage Kubernetes Persistent Volumes for this.
From the Kubernetes documentation which accurately summarizes Persistent Volumes:
The PersistentVolume subsystem provides an API for users and administrators that abstracts details of how storage is provided from how it is consumed.
We can use Kubeflow Pipelines VolumeOp class to allow us to create an automatically managed persistent volume. This allows us to represent an op that will be translated into a resource template that creates a PersistentVolumeClaim. Let us now try to create a pipeline that is able to create a Persistent Volume and then have the next step write some data to the volume. We will start with some code and I will try to explain what is happening:
import kfpimport kfp.dsl as dsl@kfp.components.create_component_from_funcdef write_to_volume():    with open("/mnt/file.txt", "w") as file:        file.write("Hello world")@dsl.pipeline(    name="volumeop-basic",    description="A Basic Example on VolumeOp Usage.")def volumeop_basic(size: str="1Gi"):    vop = dsl.VolumeOp(        name="create-pvc",        resource_name="my-pvc",        modes=dsl.VOLUME_MODE_RWO,        size=size    )    write_to_volume().add_pvolumes({"/mnt": vop.volume})
The first step in the pipeline description tells it to use VolumeOp to create a Persistent Volume Claim and Persistent Volume and here we show a couple of options to customize the Volume creation particularly the resource_name, modes and the size of the volume. The modes argument allow us to specify the access modes for the Persistent Volume Claim and it can be any one of the:
ReadWriteOnce: The volume can be mounted as read-write by a single node.
ReadOnlyMany: The volume can be mounted read-only by many nodes.
ReadWriteMany: The volume can be mounted as read-write by many nodes.
Here we make a Persistent Volume Claim with the ReadWriteOnce access mode. Running this step should allow us to have a Persistent Volume created for us and well that does happen:
There are quite a few customizations you could add to making a Persistent Volume which can be quite helpful while designing larger stateful pipelines. Some things which might be of interest to you while using the VolumeOp could be adding Kubernetes Affinity, adding nodeSelector, or even Kubernetes tolerations which among others can be very useful and can be done easily with the VolumeOp.
Next up we have a component that simply writes "Hello World" to a new file. We want this to use the Persistent Volume we created which we can easily do using the .add_pvolumes() which as the name suggests allows you to sue this volume. It might be helpful for further customization to also check out .add_volume() which allows you to use a Kubernetes Volume you created, well we do that as well, but using .add_volume() you are not limited to volumes creating using the Kubeflow Pipelines VolumeOp.
Conditional execution
At the moment, all steps we define in the pipeline are run. One way to get around this would be to make a wrapper pipeline step and run some conditional in that however, this becomes difficult to implement for larger pipelines. With Kubeflow Pipelines we can make use of conditional executions via kfp.dsl.Condition.
Here is a very simple example taken from the samples showing conditional execution:
import kfpfrom kfp import dslfrom kfp.components import func_to_container_op, InputPath, OutputPath@func_to_container_opdef get_random_int_op(minimum: int, maximum: int) -> int:    """Generate a random number between minimum and maximum (inclusive)."""    import random    result = random.randint(minimum, maximum)    print(result)    return result@func_to_container_opdef process_small_op(data: int):    """Process small numbers."""    print("Processing small result", data)    return@func_to_container_opdef process_medium_op(data: int):    """Process medium numbers."""    print("Processing medium result", data)    return@func_to_container_opdef process_large_op(data: int):    """Process large numbers."""    print("Processing large result", data)    return@dsl.pipeline(    name="Conditional execution pipeline",    description="Shows how to use dsl.Condition().",)def conditional_pipeline():    number = get_random_int_op(0, 100).output    with dsl.Condition(number < 10):        process_small_op(number)    with dsl.Condition(number > 10 and number < 50):        process_medium_op(number)    with dsl.Condition(number > 50):        process_large_op(number)
We start by building all the steps we need in the same way as we did earlier and while running the pipeline we just use dsl.Condition to identify the steps we want to run.
Conclusion
Thank you for sticking with me until the end. I hope that you've taken away a thing or two about Kubeflow Pipelines, and how they work, and enjoyed reading this. If you learned something new or enjoyed reading this article, please share it so that others can see it. Until then, see you in the next post!
We will take forward what we talk about in this article in the next article in this series where we will take a deeper dive into Kubeflow Pipelines, until then, adieu!
You can also find me on Twitter @rishit_dagli, where I tweet about machine learning, and open-source.
Follow Kubesimplify on Hashnode, Twitter and LinkedIn. Join our Discord server to learn with us.


Automate repetitive tasks - Shell Scripting
Bhavya Sachdeva — Wed, 21 Dec 2022 12:30:42 GMT
Introduction to Shell
In simple terms, a shell is an interface that accepts user input in the form of commands and passes it on to an operating system and gives output.
It is a medium between the user and the operating system to communicate with each other.
The popular shells used on Linux are:
C Shell (csh)
Kron Shell (ksh)
Z Shell(zsh)
Prerequisites
Before discussing shell-scripting, we will discuss one real-life example i.e. just imagine you want to interact with someone in you should know the language. In a simple manner, if we want to learn shell scripting we should know some basic Linux commands.
To learn about Linux commands, there are many blogs you can refer like these two blogs on Linux Commands by Aayush and Bishal
Jump into the world of Shell Scripting
Now, we are done with learning some basic Linux commands using these referred blogs. So, we are fully ready to learn Shell Scripting.
Shell Scripting is basically a list of commands which are listed in the order of execution to do specific tasks.
Shell Scripting Shebang #!
This #! is called shebang or hashbang.
It is used to tell the kernel which interpreter should be used to run the commands present in the file.
For example #!/bin/bash It means the interpreter should be of the bash shell. #!/bin/zsh Here, this means the interpreter should be of z-shell.
First Script
In the below example, this is how our first script looks like
#!/bin/zsh# This is a comment!echo "Hey KubeSimplify!"
The first line tells Unix that the file is to be executed by /bin/zsh. It means the interpreter should be of z-shell.
The second line comprised of this # tells that it is a comment and it is completely ignored by the shell.
The third is comprised of this echo command that is used to display a line of text/string that is passed as an argument.
How we are going to execute this?
Firstly create any folder where we are going to write our script files. Here, we are going to create our first script named first.zsh using touch.
Now, use vi editor, we are going to create some scripts in our file first.zsh
Yay! Let's write our first script!
Now, run it with chmod u+x name_of_script in order to make the file executable.
You are done with your first script, and now we will learn about Variables.
Variables
Variables are like a box that is used to assign some values and that value can be of any type number, string or float.
Syntax of variable:
#!/bin/zshbest_community="Kubesimplify" echo $best_community
There should not be any space between the variable name and value. We can access the value of that particular variable by using this dollar $ sign.
Operators
Arithmetic Operators
Operator Define Usage
+ Addition a+b
- Subtraction a-b
* Multiplication a*b
/ Division a/b
% Modulus a%b
\= Assignment a=value
Relational Operators
Operator Define Usage
-eq It checks the value of two operands, and it will return true if both operands' value are equal. $a -eq $b
-ne It checks the value of two operands, and it will return true if both operands' values are not equal. $a -ne$b
-ge It checks the value of the left operands is greater than or equal to the value of the right operand, then it will return true. $a -ge$b
-le operand, $a -le$b
-gt operand, $a -gt$b
-lt operand, $a -lt$b
String Operators
Operator Define Usage
\= It checks if two strings are equal or not, if they equal to each other, it returns true. $string1=string2
!= It checks if two strings are not equal, if it will not equal to each other, it will return true. $string1!=$string2
-z It will check if a given string operand size is zero (0), and then it will return true. -z $String
-n It will check if a given string operand size is non-zero, and then it will return true. -n $String
File Operators
Operator Define Usage
-d It checks if the file is a directory, if it is a directory then it will return true. -d $filename
-f It checks if the file is an ordinary file or not, if it is an ordinary file then it will return true. -f $filename
-s It checks if the file exists and is not empty, if it is not empty it will return true. -s $filename
-x It checks if the file is executable or not, if it is executable it will return true. -x $filename
-w It checks if the file is writable or not, if it is writable it will return true. -w $filename
-r It checks if the file is in a readable format, if it is in readable form, it will return true. -r $filename
Conditionals
You can use conditional statements in your shell script to decide what to do in response to a condition or test.
If Statements
It is used to execute different instructions if the provided condition is true, otherwise, the task will not be carried out.
if [  ]then      <command>fi
If and else Statements
This is also similar to the if statements described above, but it additionally allows for a condition to be performed if the condition is not true. If the condition is false, the command or combination of commands () between else and fi will be run.
if [ test> ]then      else      fi
If elif else
This is used to work on different conditions statements.
if [  ]then      elif [  ]then      elif [  ]then      elif [  ]then      else      <command>fi
Loops
For
for item in LIST #Starting for loopdoCOMMANDS #Write commandsdone
Do while
while [ condition ]do   command1   command2   commandNdone
Functions
With the help of functions, we can divide a script's overall functionality into more manageable, logical sections that can be used separately as needed.
Let's look into "how to create functions". It is no way different rather than creating your normal functions.
#!/bin/zsh# Function definingKubeSimplify () {   echo "KubeSimplify is the best community for DevOps"}# Function CallingKubeSimplify
Uses
Resources
freeCodeCamp Article
Conclusion
In this blog, we have learned about Shell Scripting. Just try some hands-on shell scripting and create some amazing scripts just to automate your tasks. To learn more about these awesome topics, follow KubeSimplify Simplify DevOps Series. Don't forget to like and share this post if you liked this blog. Connect with me on Twitter. Follow me for more such blogs.
THANKS FOR READING 😄📖!!
Bhavya Sachdeva👩💻  
Follow Kubesimplify on Hashnode, Twitter and LinkedIn. Join our Discord server to learn with us.


Managing your Operating System with Package Managers.
Aayush Sharma — Fri, 16 Dec 2022 13:30:44 GMT
What is a Package?
Before actually learning about package managers, let us put some light on what is a package.
Almost every Linux and Unix-based software program is passed as packages, these are nothing but archives containing the pre-compiled binary software files, installation instructions, configuration information, and other software-related information.
If in today's world I ask you to download software, you would probably visit the website of the tools and click the download button to get it in your local system. But ever wondered what use to happen before this function was introduced?
Back then, software was installed from source code. Users have to jump to a file and check for the software requirements a particular tool needs to function completely, including stuff like binary locations, memory locations, and versions. To perform all these tasks, users used to create a configure script or makefile and then compile the software based on their needs by providing different conditions and handling all the dependencies on their own.
To overcome all this hard work and complex steps, Linux distributions started creating ready-to-use binary files for installing software together with certain information (version number, description, and dependencies).
Both Red Hat Linux and Debian invented the .deb or DEB packaging format and the .rpm or RPM packaging system, respectively. You require a package manager in order to interface with or use the packaging systems.
What are Package Managers?
Let's try to relate the term and understand it with a real-world example.
Consider Linux as a small boy, it is understood amongst us all that a small boy needs someone to help him learn various new traits such as walking, talking etc. That someone always ready to help are the parents of that small boy without him asking to do so.
In the same way, package managers act as parents of Linux to install, upgrade and remove software automatically.
Package Managers are those set of software tools that automates the installation, upgrade, configuration, and deletion of software packages within an operating system. The responsibility of a package manager is to provide an interface that helps the user manage the set of installed packages on their system.
Package Managers for different Operating Systems
For various operating systems, there are many types and styles of package managers, as we shall soon learn in this article.
Here are some of the common package managers for different operating systems:
- Linux:
Here are some of the Package Managers for various Linux Distributions:
- macOS:
There is no default package manager installed on macOS, but the user can install one of the most used package manager named Homebrew.
MacPorts and Fink are some alternatives.
- Windows:
Early windows used a third-party package manager named Chocolatey. But now Microsoft created its own package manager called Windows Package Manager(winget) which can be used virtually by anyone that uses Windows 10 or Windows 11.
Are Package Managers only specific to an Operating System?
Package Managers is not a generic concept, and its not exclusive to Linux. There are various software that has a package manager supporting their back to help them with various functions.
If you are someone who has been working on projects that involve coding languages such as Python or Java, you have probably used a package (dependency) manager by the name of pip or maven
List of various languages using package (dependency) manager:
Workings of a Package Manager
This is the overall visualization of how a package manager works:
Let's try to understand the process shown in the above picture
Let us assume that you are a developer using a package manager for doing various stuff. There are software repositories, which are essentially collections of software packages. Software packages of various types can be found in the repository. These repositories contain all the details related to a package such as - name, description, dependencies, version, etc.
Whenever you use a package manager in your local system, it first collects the data from the online repository. Before using the package manager to complete tasks, it is advised to run the command with the update flag to update the package manager with the latest version. After you run the command (package manager name) followed by the update keyword, the package manager fetches all the latest data from the repository which is uploaded to a provider and is ready for further use.
Moving ahead, when a developer starts using the package manager to fulfill the tasks, the package manager refers to the data that is being fetched by running the update keyword with the command to complete the required task.
Examples of Package Managers
1. Linux (Ubuntu):
Since I am using a playground by killercoda for running Ubuntu, I will be using apt as a package manager.
Before actually using apt to complete tasks, let us just update it:
apt update
We can easily see that apt has fetched the metadata from the repositories that are located somewhere on the official Ubuntu website.
Now, let us try to install another package manager named aptitude using apt.
Before that, let's check if the package manager is already present in the system by running aptitude help in our CLI:
aptitude help
As we see that bash is throwing the error and cannot find any file or directory of such name. So let us run the command apt install aptitude:
apt install aptitude
We see that the package along with the dependencies has been installed in our local system and to verify let us run the aptitude help again:
Now we can see that the package manager has been installed successfully and is working. Now if you want to uninstall the package you simply run apt remove aptitude and after running this command when you try to call aptitude again it will throw an error of no such file found:
apt remove aptitude
2. macOS:
As discussed, macOS does not have any default package manager installed, so will be using Homebrew as a package manager to run an example. To install Homebrew in your local system, you can refer to this website  brew.sh.
Let us just follow the old tradition of updating the package manager before using it for the actual task and to do that just hit brew update in your CLI:
brew update
Let us install minikube using brew:
brew install minikube
To check if minikube is installed, we can run minikube verison:
minikube version
It is seen in the above image that the minikube is successfully installed and now to remove it from the system just hit brew uninstall minikube:
brew install minikube
In the above image, it is confirmed that minikube is uninstalled as when you try to run minikube version it throws an error(minikube not found).
3. Windows:
winget is the package manager provided by windows. To check if winget is installed in your system, just open the command prompt and run winget --help:
winget --help
Let us try to install Mozilla Firefox using winget in Windows system by using winget install command:
winget install Mozilla.Firefox
It will ask for some administrator permissions:
In the same way, you can use winget to uninstall packages by running winget unistall command:
winget uninstall Mozilla.Firefox
Resources
Package Management | Ubuntu
Homebrew
Windows Package Manager
Conclusion
In this article, we learned about packages and various Package managers. There you are at the end of this blog post, I hope this blog helps you understand Package Managers. Don't forget to like and share this post if you liked this blog. Connect with me on Twitter. Follow me for more such blogs.
THANKS FOR READING 😄📖!!
#LEARNINPUBLIC #LEARNWHILEDOING
Aayush Sharma 👨🏻💻
Follow Kubesimplify on Hashnode, Twitter and LinkedIn. Join our Discord server to learn with us.


Kubernetes 1.26 - The electrifying release setup
Saloni Narang — Wed, 14 Dec 2022 12:30:42 GMT
Kubernetes 1.26 was released four days ago; a huge thanks to the release team for pulling off this awesome release!
This release includes:
37 Enhancement
11 graduating to stable
10 graduating to beta
16 coming into alpha
Some highlights
The new release images will now be under registry.k8s.io - This will provide faster downloads and also removes a single point of failure.
kubeadm init --image-repository=registry.k8s.io
CRI v1alpha2 removed - kubelet will not register the node if the container runtime doesn't support CRI v1. So to work with Kubernetes 1.26, containerd 1.6.0 is required.
Storage improvements -
1 - the vSphere and Azure in-tree driver migration to CSI have graduated to Stable.
2 - With 1.26 CSI drivers now have the option to apply the fsGroup settings during attach or mount time of the volumes.
3 - GlusterFs and OpenStack cinder in-tree storage is removed in this release.
Kuberentes release signing - graduates to beta and binaries now ship additional *.sig (signature) and *.cert (certificate) files side by side with the artifacts for verifying their integrity.
Not a Windows fan but I think this is also highlighted as a major feature in the release - Support for running privileged containers on windows nodes graduates to beta.
Kubernetes metrics include significant improvements: The framework moves to alpha with all the metrics documented here. /metrics/slis for Kubernetes binaries for better health dashboards of Kubernetes components.
PodSchedulingReadiness - You can now specify using the new schedulinggates feature in the pod to mark the pod as SchedulingGated.
nodeInclusionPolicy moves to beta - this is to indicate whether to take taints/tolerations into consideration when calculating Pod Topology Spread skew.
Loadbalancers can now use multiple protocols like UDP and TCP both.
Dynamic resource allocation will let the pods use external hardware resources
Many other features can be read here.
Now that we know some of the cool features, let's set up a Kubernetes cluster on Ubuntu 20.04 machines for version 1.26.
Prerequisites
4 Ubuntu 20.04 instances with ssh access to them, you can use any cloud provider to launch these instances
Each instance should have a minimum of 4GB of ram
Here I have 4 instances in place
controlplane 74.220.27.73
worker1 74.220.24.61
worker2 74.220.27.7
worker3 74.220.30.68
Let's being!!
Step 1 - Run this on all the machines
Kubeadm | kubectl | kubelet install
curl -s https://packages.cloud.google.com/apt/doc/apt-key.gpg | sudo apt-key add -echo "deb https://apt.kubernetes.io/ kubernetes-xenial main" | sudo tee /etc/apt/sources.list.d/kubernetes.listsudo apt update -ysudo apt -y install vim git curl wget kubelet=1.26.0-00 kubeadm=1.26.0-00 kubectl=1.26.0-00sudo apt-mark hold kubelet kubeadm kubectl
Load the br_netfilter module and let iptables see bridged traffic
sudo modprobe overlaysudo modprobe br_netfiltersudo tee /etc/sysctl.d/kubernetes.conf<
Setup Containerd
cat <# Setup required sysctl params, these persist across reboots.cat <# Apply sysctl params without rebootsudo sysctl --system#Install and configure containerd curl -fsSL https://download.docker.com/linux/ubuntu/gpg | sudo apt-key add -sudo add-apt-repository "deb [arch=amd64] https://download.docker.com/linux/ubuntu $(lsb_release -cs) stable"sudo apt update -ysudo apt install -y containerd.iosudo mkdir -p /etc/containerdcontainerd config default | sudo tee /etc/containerd/config.toml#Start containerdsudo systemctl restart containerdsudo systemctl enable containerd
Above will install contained version 1.6.12-1.
Pull the images, pulls the images for Kubernetes 1.26 version.
sudo kubeadm config images pull --image-repository=registry.k8s.io --cri-socket unix:///run/containerd/containerd.sock --kubernetes-version v1.26.0
Step2 - Run the kubeadm init command on the control plane node
Here the pod network CIDR is dependent on the CNI you will be installing later on, so in this case, I am using flannel, and --control-plane-endpoint will be the public IP for the instance (it can be private IP as well but if you want to access it from outside the node by using Kubeconfig then you need to give the public IP).
sudo kubeadm init --pod-network-cidr=10.244.0.0/16 --upload-certs --kubernetes-version=v1.26.0 --control-plane-endpoint=74.220.27.73 --cri-socket unix:///run/containerd/containerd.sock
The above command will give the following output
[init] Using Kubernetes version: v1.26.0[preflight] Running pre-flight checks[preflight] Pulling images required for setting up a Kubernetes cluster[preflight] This might take a minute or two, depending on the speed of your internet connection[preflight] You can also perform this action in beforehand using 'kubeadm config images pull'[certs] Using certificateDir folder "/etc/kubernetes/pki"[certs] Generating "ca" certificate and key[certs] Generating "apiserver" certificate and key[certs] apiserver serving cert is signed for DNS names [kube-1-1-1-26-1-5b02-7bcf18 kubernetes kubernetes.default kubernetes.default.svc kubernetes.default.svc.cluster.local] and IPs [10.96.0.1 192.168.1.21 74.220.27.73][certs] Generating "apiserver-kubelet-client" certificate and key[certs] Generating "front-proxy-ca" certificate and key[certs] Generating "front-proxy-client" certificate and key[certs] Generating "etcd/ca" certificate and key[certs] Generating "etcd/server" certificate and key[certs] etcd/server serving cert is signed for DNS names [kube-1-1-1-26-1-5b02-7bcf18 localhost] and IPs [192.168.1.21 127.0.0.1 ::1][certs] Generating "etcd/peer" certificate and key[certs] etcd/peer serving cert is signed for DNS names [kube-1-1-1-26-1-5b02-7bcf18 localhost] and IPs [192.168.1.21 127.0.0.1 ::1][certs] Generating "etcd/healthcheck-client" certificate and key[certs] Generating "apiserver-etcd-client" certificate and key[certs] Generating "sa" key and public key[kubeconfig] Using kubeconfig folder "/etc/kubernetes"[kubeconfig] Writing "admin.conf" kubeconfig file[kubeconfig] Writing "kubelet.conf" kubeconfig file[kubeconfig] Writing "controller-manager.conf" kubeconfig file[kubeconfig] Writing "scheduler.conf" kubeconfig file[kubelet-start] Writing kubelet environment file with flags to file "/var/lib/kubelet/kubeadm-flags.env"[kubelet-start] Writing kubelet configuration to file "/var/lib/kubelet/config.yaml"[kubelet-start] Starting the kubelet[control-plane] Using manifest folder "/etc/kubernetes/manifests"[control-plane] Creating static Pod manifest for "kube-apiserver"[control-plane] Creating static Pod manifest for "kube-controller-manager"[control-plane] Creating static Pod manifest for "kube-scheduler"[etcd] Creating static Pod manifest for local etcd in "/etc/kubernetes/manifests"[wait-control-plane] Waiting for the kubelet to boot up the control plane as static Pods from directory "/etc/kubernetes/manifests". This can take up to 4m0s[apiclient] All control plane components are healthy after 7.507032 seconds[upload-config] Storing the configuration used in ConfigMap "kubeadm-config" in the "kube-system" Namespace[kubelet] Creating a ConfigMap "kubelet-config" in namespace kube-system with the configuration for the kubelets in the cluster[upload-certs] Storing the certificates in Secret "kubeadm-certs" in the "kube-system" Namespace[upload-certs] Using certificate key:74bfd9237ded9661ca3ee337057caba0be417c19b6493034ec0da3dbcffc8fff[mark-control-plane] Marking the node kube-1-1-1-26-1-5b02-7bcf18 as control-plane by adding the labels: [node-role.kubernetes.io/control-plane node.kubernetes.io/exclude-from-external-load-balancers][mark-control-plane] Marking the node kube-1-1-1-26-1-5b02-7bcf18 as control-plane by adding the taints [node-role.kubernetes.io/control-plane:NoSchedule][bootstrap-token] Using token: 3y24ca.kq73lohh99nzmcl5[bootstrap-token] Configuring bootstrap tokens, cluster-info ConfigMap, RBAC Roles[bootstrap-token] Configured RBAC rules to allow Node Bootstrap tokens to get nodes[bootstrap-token] Configured RBAC rules to allow Node Bootstrap tokens to post CSRs in order for nodes to get long term certificate credentials[bootstrap-token] Configured RBAC rules to allow the csrapprover controller automatically approve CSRs from a Node Bootstrap Token[bootstrap-token] Configured RBAC rules to allow certificate rotation for all node client certificates in the cluster[bootstrap-token] Creating the "cluster-info" ConfigMap in the "kube-public" namespace[kubelet-finalize] Updating "/etc/kubernetes/kubelet.conf" to point to a rotatable kubelet client certificate and key[addons] Applied essential addon: CoreDNS[addons] Applied essential addon: kube-proxyYour Kubernetes control-plane has initialized successfully!To start using your cluster, you need to run the following as a regular user:  mkdir -p $HOME/.kube  sudo cp -i /etc/kubernetes/admin.conf $HOME/.kube/config  sudo chown $(id -u):$(id -g) $HOME/.kube/configAlternatively, if you are the root user, you can run:  export KUBECONFIG=/etc/kubernetes/admin.confYou should now deploy a pod network to the cluster.Run "kubectl apply -f [podnetwork].yaml" with one of the options listed at:  https://kubernetes.io/docs/concepts/cluster-administration/addons/You can now join any number of the control-plane node, running the following command on each as root:  kubeadm join 74.220.27.73:6443 --token 3y24ca.kq73lohh99nzmcl5 \    --discovery-token-ca-cert-hash sha256:f22dadb9c02bd9ac69b1819cbeaa11330ee70bb5fb6343f8b8a288b9ea83b00f \    --control-plane --certificate-key 74bfd9237ded9661ca3ee337057caba0be417c19b6493034ec0da3dbcffc8fffPlease note that the certificate-key gives access to cluster sensitive data, keep it secret!As a safeguard, uploaded-certs will be deleted in two hours; If necessary, you can use"kubeadm init phase upload-certs --upload-certs" to reload certs afterward.Then you can join any number of worker nodes by running the following on each as root:kubeadm join 74.220.27.73:6443 --token 3y24ca.kq73lohh99nzmcl5 \    --discovery-token-ca-cert-hash sha256:f22dadb9c02bd9ac69b1819cbeaa11330ee70bb5fb6343f8b8a288b9ea83b00f
Export KUBECONFIG and install CNI Flannel
mkdir -p $HOME/.kubesudo cp -i /etc/kubernetes/admin.conf $HOME/.kube/configsudo chown $(id -u):$(id -g) $HOME/.kube/configexport KUBECONFIG=/etc/kubernetes/admin.confkubectl apply -f https://github.com/coreos/flannel/raw/master/Documentation/kube-flannel.yml
Step 3 - Run the join command on all the worker nodes
kubeadm join 74.220.27.73:6443 --token 3y24ca.kq73lohh99nzmcl5 \> --discovery-token-ca-cert-hash sha256:f22dadb9c02bd9ac69b1819cbeaa11330ee70bb5fb6343f8b8a288b9ea83b00f[preflight] Running pre-flight checks[preflight] Reading configuration from the cluster...[preflight] FYI: You can look at this config file with 'kubectl -n kube-system get cm kubeadm-config -o yaml'[kubelet-start] Writing kubelet configuration to file "/var/lib/kubelet/config.yaml"[kubelet-start] Writing kubelet environment file with flags to file "/var/lib/kubelet/kubeadm-flags.env"[kubelet-start] Starting the kubelet[kubelet-start] Waiting for the kubelet to perform the TLS Bootstrap...This node has joined the cluster:* Certificate signing request was sent to apiserver and a response was received.* The Kubelet was informed of the new secure connection details.Run 'kubectl get nodes' on the control-plane to see this node join the cluster.
Step 4 - Nginx Test
You can copy the kubeconfig file from the controlplane node(~/.kube/config ) to local and export the KUBECONFIG variable or directly access the cluster from the controlplane node.
kubectl get nodesNAME                          STATUS   ROLES           AGE    VERSIONkube-1-1-1-26-1-5b02-7bcf18   Ready    control-plane   4m1s   v1.26.0kube-1-1-1-26-2-c673-7bcf18   Ready              59s    v1.26.0kube-1-1-1-26-3-be3b-7bcf18   Ready              54s    v1.26.0kube-1-1-1-26-4-dc16-7bcf18   Ready              52s    v1.26.0
The cluster is up and running with a single control plane and 3 worker nodes.
Now run nginx
kubectl run nginx --image=nginxpod/nginx createdkubectl expose pod nginx --type=NodePort --port 80service/nginx exposedkubectl get podsNAME    READY   STATUS    RESTARTS   AGEnginx   1/1     Running   0          10skubectl get svc nginxNAME    TYPE       CLUSTER-IP     EXTERNAL-IP   PORT(S)        AGEnginx   NodePort   10.109.33.40           80:32573/TCP   10s
Access the service using Node public IP:32573 (make sure your firewall rules are properly set to allow traffic to required ports)
YAY!! You have successfully set up a self-managed Kubernetes cluster, version 1.26.0 and containerd as the container runtime.
Saiyam Pathak created a Killercoda playground with Kubernetes 1.26. Give it a try -> K8s 1.26 playground
Follow Kubesimplify on Hashnode, Twitter and LinkedIn. Join our Discord server to learn with us.



Kubernetes Access Control with Authentication, Authorization & Admission Control
Bishal Das — Mon, 12 Dec 2022 12:30:42 GMT
Hey! are you using kubernetes? Have you ever wondered how a simple request you make, as a user gives you access to various Kubernetes objects like pods and deployments? "Access control", thats how everything works! 🤔
In this blog, lets learn what access control is and how K8s manages access permissions behind the scenes. So, let's get started ! 🔗
Suppose, you are travelling from your country to another country, and you don't have any permission to enter into that country. So firstly you have to authenticate yourself (who you are) to that country by your identity ex:- passport. Then you have to get authorization to enter into that country by your Visa and after that you still have to go through some customs checking which is admission control for k8s to successfully enter into that country. This is exactly what happens for kubernetes access control to get enter permission to the k8s cluster. Likewise, a user must be authenticated first, and then the particular user must be authorized to access the resources of Kubernetes. We will be using minikube cluster & kubectl cli in this blog for example.
All Kubernetes clusters have two categories of users:-
service accounts managed by Kubernetes
normal users account
Normal users account
Kubernetes does not have objects which represent normal user accounts. Normal users cannot be added to a cluster through an API call. They need to go through the access control processes to get added to the cluster. 
Though a normal user cannot be added via an API call, any user that presents a valid certificate (ex:- passport) signed by the cluster's certificate authority (CA) is considered authenticated. There is no username concept in the Kubernetes. In the configuration, Kubernetes determines the username from the common name field in the 'subject' of the cert (e.g., "/CN=Bob Killen"). From there, the role based access control (RBAC) sub-system would determine whether the user is authorized to perform a specific operation on a resource or not.
Service account
In contrast, service accounts are users managed by the Kubernetes API. Here, do not need the access control permission. They are bound to specific namespaces, and created automatically by the API server or manually through API calls. Each service account is associated with a secret, and each secret has a token. And that particular token is used for authentication.
Authentication Methods :-
Authentication step use to happen before reaching the request to api-server. Kubernetes uses client certificates, bearer tokens, or an authenticating proxy to authenticate API requests through authentication plugins. 
As HTTP requests are made to the API server, plugins attempt to associate the following attributes with the request:
Username: a string which identifies the end user. Common values might be kube-admin or jane@example.com.
UID: a string which identifies the end user and attempts to be more consistent and unique than username.
Groups: a set of strings, each of which indicates the user's membership in a named logical collection of users. Common values might be system:masters or devops-team.
Extra fields: a map of strings to list of strings which holds additional information authorizers may find useful.
So, Username, UID, Groups, and extra fields all will be added before the request goes to the authorization stage after the authentication is passed.
A lot of authentication methods are there. You can use one of them to authenticate a user.
  1. X509 Client Certs  2. Static Token File  3. Service Account Tokens  4. Bearer Tokens  5. OpenID Connect Tokens
we will be using X509 client certificate for authentication in this blog. For minikube you will see initially only one user minikube itself (admin) is there in the kubeconfig file for your cluster, and we can add another user who can access the cluster & perform some action and to add that user we have to go through these three admission control processes.
What is kubeconfig file?
kubeconfig is a yaml file located in ~./kube/config in your machine. This file will be available only after installing
 minikube. This is the main file which actually helps any user/client to access Kubernetes cluster and all its  resources. 
  This file contains all the user & corresponding cluster list itself. There are three sections in this yaml file -
  clusters, contexts & users. Just use this command in your terminal nano ~/.kube/config and you can   see that file. But make sure that your minikube cluster is running.
Authorization Methods :-
After successfully authenticating, that request will go to the authorization step. This authorization step actually happens in api-server. The Kubernetes API server may authorize a request using one of several authorization modes:-
Node - A special-purpose authorization mode that grants permissions to kubelets based on the pods they are scheduled to run.
ABAC - Attribute-based access control (ABAC) defines an access control paradigm whereby access rights are granted to users through the use of policies which combine attributes together. The policies can use any type of attributes (user attributes, resource attributes, object, environment attributes, etc.).
Webhook - A WebHook is an HTTP callback: an HTTP POST that occurs when something happens; a simple event-notification via HTTP POST. A web application implementing WebHooks will POST a message to a URL when certain things happen.
RBAC - Role-based access control (RBAC) is a method of regulating access to cluster or cluster resources based
 on the roles of individual users within an enterprise. In this context, access is the ability of an individual user to  perform a specific task, such as view, create, or modify a file. RBAC works typically for users and groups. 
 Suppose, there are three users called marketing, dev, prod. And you have created 3 roles READ, WRITE, DELETE.  You have to bind these roles by another k8s object called RoleBinding to the specific user. By those bound  role, they can perform the particular action which is bound to them.
Role and Role binding lives in a namespace level in Kubernetes. This means you could have a dev group who have access to the development namespace where all the development microservices (in pods, container) live for a development project. 
Or you have a marketing namespace where all the marketing system live & marketing users have access to that namespace. Roles & Role bindings will give those users access to that namespace because roles & role bindings lives in a namespace level. Also, to provide cluster level permissions meaning to all namespace to a user, you will have to use ClusterRole & ClusterRoleBinding. [ use these two carefully as it is giving whole cluster permission ]RBAC uses the rbac.authorization.k8s.io API group to drive authorization decisions, allowing admins to dynamically configure permission policies through the Kubernetes API.
Let's try with Hands-on
we will first authenticate a user named bob and then will give the authorization using RBAC
Authentication steps :-
At first, make sure that your minikube cluster is running & kubectl is installed. Check with this command minikube status.
#make a folder named RBAC mkdir RBAC cd RBAC # now install OpenSSL which will be used to generate key and cert (search on internet to get install command a/q your OS)
Now we will use ca.crt & ca.key of minikube which already exist in ~/.minikube folder. These two will be used to generate a certificate for user Bob Killen we are going to create in a moment. So copy these two files in your RBAC folder or whatever you gave your folder name.
User Certificates:-
First thing we need to do is create a certificate signed by our minikube CA (Certificate Authority). We have the CA, (ca.crt ca.key) Let's create a certificate for user Bob Killen:
#start with a private key (use this command)openssl genrsa -out bob.key 2048
So bob.key is generated. Now we have a key, we need a certificate signing request (CSR) which will be used to sign the cert for Bob using minikube CA. We also need to specify the groups that Bob belongs to. Let's pretend Bob is part of the Marketing group and will be developing applications for the Marketing.
# here we are requesting a csr by -output bob.csr and common name Bob Killenopenssl req -new -key bob.key -out bob.csr -subj "/CN=Bob Killen/O=Marketing"
Now bob.csr is generated in your RBAC folder. We will use this CSR to create a certificate named bob.crt. Use the minikube CA (which you copied ca.crt ca.key) to generate our certificate by signing our CSR. We may set an expiry on our certificate as well.
openssl x509 -req -in bob.csr -CA ca.crt -CAkey ca.key -CAcreateserial -out bob.crt -days 10
Here you can see we are using x509 client cert and passing bob.csr & also using ca.crt & ca.key as Certificate Authority (CA) to sign the CSR. And getting the output as bob.crt and expiry of this cert is 10 days. This is the main certificate which will be used for authentication.
Now we also have bob.crt along with bob.key. We will use this two to add user Bob in our ~/.kube/config file. We'll be trying to avoid messing with our current Kubernetes config. So let's tell kubectl to look at a new config that does not yet exist. Don't forget to run this command :
export KUBECONFIG=~/.kube/new-config
We are pointing KUBECONFIG environment variable from ~/.kube/config to ~/.kube/new-config. Otherwise, Bob user will be added in your main kubeconfig file which will be a little bit messed up.
Create a cluster entry which points to the cluster and contains the details of the CA certificate. Don't forget to copy api-server address from your ~/.kube/config file. Ex:- https://127.0.0.1:42323. It can be changed at any time, so keep your eye open to this address at ~/.kube/config file and change it in ~/.kube/new-config otherwise you will get this type of error below
The connection to the server 127.0.0.1:42323 was refused - did you specify the right host or port?
kubectl config set-cluster dev-cluster --server=https://127.0.0.1:42323 \--certificate-authority=ca.crt \--embed-certs=true#see changes that dev-cluster is added in cluster listnano ~/.kube/new-config
Here we are adding a new cluster named dev-cluster in your new kubeconfig file & also using the same ca.crt because this ca.crt is certificate of minikube cluster, and we are referencing this to get signed by its CA provided already.
Now add the user bob in the users section of your kube/new-config file
kubectl config set-credentials bob --client-certificate=bob.crt --client-key=bob.key --embed-certs=true
Now we will add context section into kube/new-config file. This context section is the important section which helps to connect a user with the cluster added in the list (dev-cluster with user bob).
kubectl config set-context dev --cluster=dev-cluster --namespace=marketing --user=bob
Here we are setting the namespace marketing as default in dev-cluster and user bob can access this namespace. We have set the context as named dev context. you can switch cluster by changing this current context. As in your kube/new-config, only one context is present named "dev", so you can't switch to another context.
Now just use this command, and you will be switched into your dev-cluster and context named dev & namespace is marketing
kubectl config use-context dev
Now your user authentication is successfully completed. Now bob can only access the dev-cluster not its resources like pods, deployment, service or whatever object in marketing namespace. If you run the command :
# commandkubectl get pods# outputError from server (Forbidden): pods is forbidden: User "Bob Killen" cannot list resource "pods" in API group "" in the namespace "marketing"
To access those objects, you have to authorize the user bob. We will use RBAC authorization here and will create a role and bind that role to user bob so that bob can access the resources of dev-cluster.
Authorization steps :-
Now go back to your minikube cluster from dev-cluster by this command :
export KUBECONFIG=~/.kube/config
As ~/.kube/config file is for minikube cluster, so changing env variable to pointing config not new-config. Now in your minikube cluster create a namespace named marketing and Bob will access this namespace from dev-cluster.
kubectl create namespace marketing
Now in the marketing namespace we will create a role and bind that role by RoleBinding so that user bob can access the resources like pod, deployment etc.
Create the role by this role.yaml file
apiVersion: rbac.authorization.k8s.io/v1kind: Rolemetadata:  namespace: marketing  name: manage-podsrules:- apiGroups: [""]  resources: ["pods", "pods/exec"]  verbs: ["get", "watch", "list", "create", "delete"]- apiGroups: ["apps"]  resources: ["deployments"]  verbs: ["get", "watch", "list", "delete", "create"]
Here we are using rbac.authorization.k8s.io/v1 apigroup and kind is Role & namespace: marketing & role-name is manage-pods. We are defining some rules to access the resources. From this above error
output
Error from server (Forbidden): pods is forbidden: User "Bob Killen" cannot list resource "pods" in API group  ""  in the namespace "marketing"
So to access pods it is telling that we should use "pods" in the resources list you can see in the yaml file we are using & also for pod apiGroup is "" which also are using. Also, we are using some verbs you can see which is the actual verbal command we will use like kubectl get pods. Actually kubectl get pods this command use list verb that's we have added list in the verbs array. Likewise, for deployment apiGroups "apps" is used and some verbs also added. So you can add more rules like this to access another resources like nodes, secret, service, namespace. For this, you have to write these 3 lines under rules section in your role.yaml for each resource.
Now create a rolebinding.yaml will be used to bind the role.yaml
apiVersion: rbac.authorization.k8s.io/v1kind: RoleBindingmetadata:  name: manage-pods  namespace: marketingsubjects:- kind: User  name: "Bob Killen"  apiGroup: rbac.authorization.k8s.ioroleRef:  kind: Role  name: manage-pods  apiGroup: rbac.authorization.k8s.io
Here kind is RoleBinding and in the metadata section, role binding name is manage-pods. In the subjects section you can see we are binding Role by roleRef section to the User Bob Killen and this name is exactly what we have set in /CN=Bob Killen. As I mentioned before that no username concept is there in k8s. Username always fetched from the common name from your provided certificate (bob.crt) for user, ex:- bob.
Now apply your role.yaml & rolebinding.yaml file to marketing namespace
 kubectl -n marketing apply -f role.yaml kubectl -n marketing apply -f rolebinding.yaml
Now check that whether role & role binding has been successfully created or not by this command:
kubectl get rolekubectl get rolebinding
You will see manage-pods is there. Now we have successfully bounded the role to user bob. So go back to your dev-cluster and try to access the resources pods, deployment, as we did not mention more than these two in role.yaml. Go to the dev-cluster by again changing KUBECONFIG env var pointer
export KUBECONFIG=~/.kube/new-config
Now you are on your dev-cluster. Test this command :
kubectl get pods
You will see this because you haven't created any pod in your marketing namespace in dev-cluster.
No resources found in marketing namespace.
Now you can create, delete, list your pods, and also you can exec your container running inside your pod as we have added pods/exec in the verbs array in role.yaml. So create a pod -
kubectl run nginx --image=nginx
kubectl get pods
Wohoo! Your pod is running, and user bob has been successfully authenticated & authorized 🎉
Admission Controllers :-
An admission controller is a piece of code that intercepts requests to the Kubernetes API server prior to persistence of the object, but after the request is successfully authenticated and authorized. And it is the last checking like customs checking. Admission controllers may be validating, mutating, or both. Mutating controllers may modify related objects to the requests they admit; validating controllers may not.
There are two special controllers: MutatingAdmissionWebhook and ValidatingAdmissionWebhook. These execute the mutating and validating (respectively) admission control webhooks which are configured in the API.
The admission control process proceeds in two phases. In the first phase, mutating admission controllers are run. In the second phase, validating admission controllers are run. Note again that some of the controllers are both.
If any of the controllers in either phase reject the request, the entire request is rejected immediately and an error is returned to the end-user. So if your request successfully passed the admission control process, you can access any resources what you want. One thing is that you can enable or disable this admission controller checking in your command by passing --enable-admission-plugins & --disable-admission-plugins.
So these are the three steps to reach to k8s api-server successfully!!
Kubernetes Service Accounts
So we've seen how to give permission to users, but what about applications or services running in our cluster ? Most business apps will not need to connect to the Kubernetes API unless you are building something that integrates with your cluster, like a CI/CD tool, an autoscaler or a custom webhook.
Generally, applications (not human user) will use a service account to connect with your cluster.
Let's deploy a service account  Go to your minikube cluster by changing KUBECONFIG env var. At first create a serviceaccount.yaml file
apiVersion: v1kind: ServiceAccountmetadata:  name: marketing-api
Name of the service account is marketing-api. Then apply your serviceaccount.yaml in marketing namespace.
kubectl -n marketing apply -f serviceaccount.yaml
Now we can deploy a pod that uses the service account. So create pod.yaml first
apiVersion: v1kind: Podmetadata:  name: shopping-apispec:  containers:  - image: nginx    name: shopping-api  serviceAccountName: marketing-api
Here pod name is shopping-api but we are using recently created serviceAccount named marketing-api. Assume that this pod is the actual application which is using the service account created recently, and suppose this application (shopping-api pod) is actually trying to get the all pod list in the marketing namespace. Like bob (human user) was trying to get all the pod list in marketing namespace but failed for the first time because we didn't create any role or role binding at that time for bob. Don't be confused with application and pod, we are using shopping-api pod as an application which is not a human user and this application could be anything which wants permission from cluster.
Then apply the pod.yaml in marketing namespace. Note:- We are still on minikube cluster
kubectl -n marketing apply -f pod.yaml
Now let's go inside the running shopping-api pod by this command:
kubectl -n marketing exec -it shopping-api -- bash
Then run this command :
ls -l /var/run/secrets/kubernetes.io/serviceaccount
Now suppose shopping-api pod is trying to get all the pod's list. So firstly we have to set all the necessary variable by fetching all the three values from service account which is namespace, token, ca.crt. Follow this below and run these commands :
# Point to the internal API server hostnameAPISERVER=https://kubernetes.default.svc# Path to ServiceAccount tokenSERVICEACCOUNT=/var/run/secrets/kubernetes.io/serviceaccount# Read this Pod's namespaceNAMESPACE=$(cat ${SERVICEACCOUNT}/namespace)# Read the ServiceAccount bearer tokenTOKEN=$(cat ${SERVICEACCOUNT}/token)# Reference the internal certificate authority (CA)CACERT=${SERVICEACCOUNT}/ca.crt# List pods through the API# Here we are using all the above set variable to get the list of pods by the shopping-api application (pod)curl --cacert ${CACERT} --header "Authorization: Bearer $TOKEN" -s ${APISERVER}/api/v1/namespaces/marketing/pods/ # we should see an error not having access
This error is coming because we haven't created any role or rolebinding for the serviceaccount yet. So we will create serviceaccount-role.yaml and serviceaccount-rolebinding.yaml in the minikube cluster so that shopping-api pod/application can get the permission from cluster-admin. So again go back to minikube cluster
Let's create serviceaccount-role.yaml
apiVersion: rbac.authorization.k8s.io/v1kind: Rolemetadata:  namespace: marketing  name: shopping-api-rolerules:- apiGroups: [""]  resources: ["pods"]  verbs: ["get", "watch", "list"]
Then create serviceaccount-rolebinding.yaml
apiVersion: rbac.authorization.k8s.io/v1kind: RoleBindingmetadata:  name: shopping-api  namespace: marketingsubjects:- kind: ServiceAccount  name: marketing-apiroleRef:  kind: Role  name: shopping-api-role  apiGroup: rbac.authorization.k8s.io
Here we are just changing the kind: from user to ServiceAcount as we are binding this for serviceaccount. Now apply those two yaml file in your minikube cluster.
kubectl -n marketing apply -f serviceaccount-role.yamlkubectl -n marketing apply -f serviceaccount-rolebinding.yaml
Now go to your dev-cluster and exec to your shopping-api application/pod by same command above and then set all the variable again which we did set before like APISERVER, SERVICEACCOUNT, TOKEN etc. Then again run this command:
curl --cacert ${CACERT} --header "Authorization: Bearer $TOKEN" -s ${APISERVER}/api/v1/namespaces/marketing/pods/
Now your application will successfully get the pod list as we have set role and role binding!! So we have seen how a user can get cluster permission by admission control and how an application (not human user) can get cluster permission via service account.
So, I hope this blog helped you to understand how access control works for a user and how service account works for applications in Kubernetes cluster.
Thanks for reading !!💖
Follow Kubesimplify on Hashnode, Twitter and LinkedIn. Join our Discord server to learn with us.


12 Practical Grep Command Examples In Linux
sysxplore — Tue, 06 Dec 2022 12:30:42 GMT
In Linux command line, the grep command is a powerful tool for searching text within files.
In this article, I will go over 12 examples of grep command usage that every Linux user, sysadmin, and developer should be aware of.
What it is the Grep command on Linux?
If you're wondering what grep stands for, it stands for global regular expression print. You can use grep to search through files, or use it in conjunction with pipes or other commands to filter the output of another command. You'll find the grep command very useful in your day-to-day work as a sysadmin or Linux user once you've mastered it.
The syntax of the grep command is very simple:
$ grep [OPTION...] PATTERNS [FILE...]
In this article, I will use a file called 'linuxquotes.txt' to demonstrate how to use the commands. If you want to follow along, you can find the contents of the file here :
Grep command examples
Now that you know what grep is, let's look at some examples.
1. Finding all occurrences of a string in the given file.
In this example, we tell grep to look for the word "Linux" and provide the file to look into as an argument. If the string you want to search contains spaces, you must surround it with quotes:
(trawkali)-[~/articles]$ grep "Linux" linuxqoutes.txt
2. Finding all occurrences of a string in multiple files.
We can also instruct grep to search multiple files for a specific string. The following example demonstrates this clearly:
(trawkali)-[~/articles]$ grep "Linux" linuxqoutes.txt learnlinux.txt
Notice, the grep command does a good job of specifying the file in which it finds a specific match. This is why the grep command is so useful for searching for strings in files.
3. Filtering or Searching output of another command
Grep can, as previously stated, be used to filter or search the output of another command. This is made possible by making use of the command line chaining operator pipe (|).
(trawkali)-[~/articles]$ head -n 12 linuxqoutes.txt | grep "Linux"
4. Display Line Numbers Containing Matches
Using the -n option in conjunction with grep, it will display the line numbers containing matches as well as their respective matches:
(trawkali)-[~/articles]$ grep -n "Linux" linuxqoutes.txt
5. Making grep search case insensitive
Grep search is case sensitive by default. If you want the search to be  case insensitive you can use -i option.
(trawkali)-[~/articles]$ grep -i "LINUX" linuxqoutes.txt
In the preceding example, we searched for the word "LINUX" and grep returns the words Linux and LINUX as matching.
6. Using regular expressions with grep
The search string can be a regular expression, which makes grep quite strong. However, I will not go into detail on how to use regexp with grep in this article.
The following example will search for lines which contains any digits from 0 up to 9.
(trawkali)-[~/articles]$ grep "[0-9]" linuxqoutes.txt
You can boost the power of your search by employing a regex pattern. There are special grep options that allow you to use a regex pattern.
e - enables the use of regex patterns.
E - enables the use of extended regex patterns.
G - enables the use of basic regex patterns.
P - enables the use of perl regex patterns.
Here, we used extended regular expressions to search for a word which starts with any characters (*) and ends with (sh). 
(trawkali)-[~/articles]$ cut -d ":" -f 7 /etc/passwd | grep -E "*sh$"
Notice, if we don't use the "-E" option grep won't display anything, this shows that without the "-E" parameter grep command won't recognize the provided pattern.
7. Displaying all the lines that DO NOT match a given pattern.
Another thing you might find useful is to use the "-v" option to reverse the result, eliminating all the lines that match a specific search string:
Here we eliminated lines which do not contain the word "Linus Torvalds":
8. Combining grep options
Grep, like any other Linux command, can combine several options to perform multiple tasks at once. In this case, we combined the -v and -i options to instruct the grep command to be case-insensitive with the pattern and to only display lines that DON'T match that given pattern:
(trawkali)-[~/articles]$ grep -vi "Linux" linuxqoutes.txt
9. Find Exact Match Words.
If you search for the phrase 'Lin,' grep will also return lines containing the words 'Linux' or 'Linus.' 
Fortunately, grep has option "-w"  which allows it search and match the exact whole words only. In this case grep didn't find any match because the file "linuxqoutes.txt" doesn't contains the word "Lin".
(trawkali)-[~/articles]$ grep -w "Lin" linuxqoutes.txt
10. Searching in all files recursively
With the grep option "-r", you can execute a recursive search. It will look for the specified pattern in all files in the current directory and its sub-directories. Notice, I tacked "-i"  with "-r option" to make the search case insensitive:
(trawkali)-[~/articles]$ grep -ir "linux"
11. Print only names of FILEs with matching lines
Grep displays the matching lines by default. If you only want to know which files contain the string, use the following grep options:
r - recursively search every file in the current dir
l - print only names of FILEs with matches
i - this option is optional, I have added it since I want my search to be case-insensitive:
(trawkali)-[~/articles]$ grep -irl "linux"
12. Print only names of FILEs with no matching lines
If you only want to know which files do contain the string, use the following command:
-r - recursively search every file in the current directory.-L - print only names of FILEs without the matching lines.
(trawkali)-[~/articles]$ grep -rL "Ubuntu"
Bonus
To help you remember grep commands while using Linux, I have made this cheatsheet. So feel free to download and save it for quick reference.
Conclusion
Those were some straightforward grep examples. If you read the man page for this command, you'll notice that it has a plethora of additional parameters and uses. This information should be sufficient to help you understand the Linux grep command and how to use it.
That's all! Thank you for getting this far. I hope you find this article useful. If you did found this article valuable: 
Toss us a follow for more amazing articles on Linux, sysadmin and security 
And be sure to share with other Linux folks who you think it might be useful to them.
Like the blog? Follow Kubesimplify on Hashnode, Twitter and LinkedIn. Join our Discord server to learn with us.


Introduction to CRI
Dipankar Das — Tue, 29 Nov 2022 12:30:42 GMT
Prerequisites
Docker
Kubernetes
What is CRI?
The CRI is a plugin interface which enables the kubelet to use a wide variety of container runtimes, without having a need to recompile the cluster components.
What is the Open container initiative (OCI)?
The Open Container Initiative is an open governance structure for the express purpose of creating open industry standards around container formats and runtimes.
Containerd
It is a CNCF Graduated project which is OCI Standard. Its is a container manager tool. All other container management tools make use of containerd to run containers Like Kubernetes, Docker, Nerdctl
Why learn it?
However, since it's the closest thing to the actual containerd API, it can serve as a great exploration means - by examining the available commands, you can get a rough idea of what containerd can and cannot do. You can also replace the other tools with containerd + nerdctl in your local environment. It is also used in kubernetes as default container runtime for debugging
Nerdctl, Crictl, ctr
These are CLI tools for running and managing the containers being run by containerdAmong these ctr is pre shipped when containerd is installed, and youve to install crictl and nerdctl manually. 
Local Setup
Arkade
One of the best hassle-free way to install containerd along with nerdctl is to use arkade. If youre hearing about arkade for the first time, checkout the blog here
https://blog.kubesimplify.com/arkade
arkade system install containerd arkade get nerdctl arkade system install cni -p /usr/libexec/cni
Package managers
The containerd.io packages in DEB and RPM formats are distributed by Docker (not by the containerd project). See the Docker documentation for how to set up apt-get or dnf to install containerd.io packages:
CentOS, Debian, Fedora, UbuntuThe containerd.io package contains runc too, but does not contain CNI plugins.
sudo yum install -y containerd.ioSudo apt install -y containerd.io
We need CNI (Container networking interface)Installing CNI pluginsDownload the cni-plugins---.tgz archive from Tar File to download, verify its sha256sum, and extract it under /opt/cni/bin
$ mkdir -p /opt/cni/bin$ tar Cxzvf /opt/cni/bin cni-plugins-linux-amd64-.tgz
Install Nerdctl via brew
$ brew install nerdctl# do this when you need access to nerdctl in root user (Root install)$ sudo cp -v /home/linuxbrew/.linuxbrew/bin/nerdctl /usr/local/bin/$ sudo nerdctl version
What are the options and why in ctr
ctr run  
So all setup is done, let's go through some concepts
What is namespace, tasks in containerd terms
Task is the runtime state of the container.
Namespace in the context of containerd is the logical separation between tools which use containerd like docker has namespace of moby and nerdctl uses default.
NOTE:  Instead of building images with ctr, you can import existing images built with docker build or other OCI-compatible software. Surprisingly, containerd doesn't provide out-of-the-box image building support. However, containerd itself is often used to build images by higher-level tools.
Start and stop container
# When pulling images, the fully-qualified reference seems to be required, so you cannot omit the registry or the tag part$ ctr images pull docker.io/library/hello-world:latest$ ctr run docker.io/library/hello-world:latest helloHello from Docker!..$ ctr -namespace default container ls$ ctr container rm hello
When you do ctr create container, then task are not automatically created
When using ctr run task is created
Create Namespace
$ ctr ns create $ctr ns rm 
ctr run command is actually a shortcut for ctr container create + ctr task start
Lets exec into a container which is running in detached mode
Now to stop the container
Got error
# first stop the taskctr task kill nginxctr task ls# will see that  the taks is now stopper statectr c rm nginx
Now you understand that it's not very practical to use ctr in day to day use only for debugging, so there is another alternative to docker cli which is nerdctl which is almost compatible with most of the docker commands and is easy to use
Translate docker learnings to nerdctl
Almost all the docker commands are there in nerdctl (Very flat learning curve) with some additional as namespace create and delete .
If you want, you can also set an alias using the command alias docker=nerdctl and keep using the docker command, and it will use nerdctl under the hood. 
Playground Link
Some more resourceLive stream on containerd and nerdctl
Authors
Anurag
Dipankar
Like what you read? Follow Kubesimplify on Hashnode, Twitter and LinkedIn. Join our Discord server to learn with us.


How to setup your ftp server in Linux
sysxplore — Mon, 21 Nov 2022 12:30:42 GMT
In this tutorial I'm going to show you how you can setup your own ftp server on linux. But before we begin I'm going to give you a brief description of what ftp is.
What is FTP(File Transfer Protocol)?
FTP is an acronym for File Transfer Protocol. As the name suggests, FTP is used to transfer files between computers on a network. You can use FTP to exchange files between computer accounts, transfer files between an account and a desktop computer, or access online software archives. Keep in mind, however, that many FTP sites are heavily used and require several attempts before connecting.
An FTP address looks a lot like an HTTP or website address, except it uses the prefix ftp:// instead of http://.
Typically, a computer with an FTP address is dedicated to receive an FTP connection and a computer dedicated to receive an FTP connection is referred to as an FTP server or FTP site.
How to set it up
Now you know what FTP is, lets begin a special adventure. We will make FTP server to share files with friends and family. I will use vsftpd for this purpose.
VSFTPD is an FTP server for Unix-like systems, including Linux. It is the default FTP server in the Ubuntu, CentOS, Fedora, NimbleX, Slackware and RHEL Linux distributions. In fact, the first two letters in VSFTPD, stand for very secure. The software was built around the vulnerabilities of the FTP protocol.
Nevertheless, you should always remember that there are better solutions for secure transfer and management of files such as SFTP (uses OpenSSH). The FTP protocol is particularly useful for sharing non-sensitive data and is very reliable for that.
Step 1: Installing VSFTPD in Linux
You can quickly install VSFTPD on your Fedora/Red Hat/SUSE servers/pc by issuing the following command on your terminal
0xtraw@xtremepentest# dnf -y install vsftpd
If you are using Ubuntu/Debian-based distributions, you can install VSFTPD using this command:
0xtraw@xtremepentest# sudo apt-get install vsftpd
If you are using Arch-based distributions, try this command for installing VSFTPD.
0xtraw@xtremepentest# sudo pacman -S vsftpd
That's basically it for the installation, now let's quickly jump into setting it up
Step 1: Installing VSFTPD on Linux
You can quickly install VSFTPD on your Fedora/Red Hat/SUSE servers through the command line interface with:
dnf -y install vsftpd
If you are using Ubuntu/Debian-based distributions, you can install VSFTPD using this command:
sudo apt-get install vsftpd
If you are using Arch-based distributions, try this command for installing VSFTPD.
sudo pacman -S vsftpd
Step 2: Configuring FTP server
The vsftp config file is usually located in /etc/vsftpd.conf. The config file itself is well-documented, so this section will only highlight some important changes you may want to make. For all available options and basic documentation, see the man pages by simply issuing the following command on your terminal.
man vsftpd.conf
And files are served by default from /srv/ftp directory as per the Filesystem Hierarchy Standard. You can use an available text editor  of your choice for editing the ftp config file (/etc/vsftpd.conf). If in my case I will be using nano editor which comes pre-installed on most linux distros. If you also want to use nano issue the following command.
0xtraw@xtremepentest# nano /etc/vsftpd.conf
Enable Uploading to the FTP server:
The write_enable flag must be set to YES in order to allow changes to the filesystem, such as uploading: If this entry is comment, uncomment it by simply removing the leading # sign
write_enable=YES
Allow Local Users to Login:
In order to allow users in /etc/passwd to login, the local_enable directive must look like this:
local_enable=YES
Anonymous Login
The following lines control whether anonymous users can log in:
# Allow anonymous loginanonymous_enable=YES# No password is required for an anonymous login (Optional)no_anon_password=YES# Maximum transfer rate for an anonymous client in Bytes/second (Optional)anon_max_rate=30000# Directory to be used for an anonymous login (Optional)anon_root=/example/directory/
Chroot Jail
It is possible to set up a chroot environment, which prevents the user from leaving his/her home directory. To enable this, add/change the following lines in the configuration file:
chroot_list_enable=YES chroot_list_file=/etc/vsftpd.chroot_list
The chroot_list_file variable specifies the file in which the jailed users are contained to.
Now done setting our ftp server, it's time to get it up and running!
Step 4: Restart your FTP server
To get your ftp server up and running with the new configurations, type the following command on your terminal and hit enter
0xtraw@xtremepentest# sudo systemctl restart vsftpd
Congrats if you have reached this far.  If you have any problem setting up the ftp server feel free to dm on Twitter xtreme pentesting
Follow Kubesimplify on Hashnode, Twitter and LinkedIn. Join our Discord server to learn with us.

Hostname	Role	Private IP	public IP
lb-0	LoadBalancer	192.168.1.8	74.220.22.92
-	-	-	-
db-0	Etcd-0	192.168.1.2	-
db-1	Etcd-1	192.168.1.3	-
db-2	Etcd-2	192.168.1.4	-
-	-	-	-
cp-0	Control-Plane-0	192.168.1.9	-
cp-1	Control-Plane-1	192.168.1.10	-
cp-2	Control-Plane-2	192.168.1.11	-
-	-	-	-
wp-0	Worker-Plane-0	192.168.1.12	-
wp-1	Worker-Plane-1	192.168.1.13	-

Option	Description
if	Specifies the input file (source).
of	Specifies the output file (destination).
bs	Defines the block size to read from the input file and write to the output file.
count	Specifies the number of blocks to copy.
skip	Skips a specific number of blocks or bytes while reading the input file.
seek	Skips a specific number of blocks or bytes while writing to the output file.
status	Shows the progress of the dd command.
conv	Specifies conversion options for the input or output file.

Operator	Define	Usage
+	Addition	a+b
-	Subtraction	a-b
*	Multiplication	a*b
/	Division	a/b
%	Modulus	a%b
\=	Assignment	a=value

controlplane	74.220.27.73
worker1	74.220.24.61
worker2	74.220.27.7
worker3	74.220.30.68

Kubesimplify

Perform CRUD Operations on Kubernetes Using Golang

Getting Started - Understanding the Basics

Familiarity with Kubernetes API Concepts

Importance of Using Client Libraries

Exploring Client-go

Demo - CRUD Operations on Pod

Prerequisites

Step 1 - Creating a Kubernetes Cluster

Step 2 - Initial Project Setup

Step 3 - Create a new Kubernetes Client

Step 4 - Retrieving All the Current Pods

Step 5 - Create a Pod

Step 6 - Update an Existing Pod

Step 7 - Delete an Existing Pod

Additional Configurations Options In Client-go

Alternate Way to Kubeconfig Setup

Alternate Way to Create a New Client

Conclusion

Resources

Optimizing Scalability: A Deep Dive into Load Testing with Locust on EKS

Introduction

Prerequisites

Understanding Horizontal Pod Autoscaler

Introduction to Locust

Create VPC and EKS using the Terraform module

Install monitoring components on the cluster

Deploy sample app

Deploy Locust

Demo

Observation: Cluster Scaling

HPA

Pending State

Automatic Node Creation by Cluster Autoscaler

Transition to Running State for Pods

Conclusion

Introducing Unikraft - Lightweight Virtualization Using Unikernels

Understanding Unikernels

Evolution from VMs and Containers

Unikernels v/s Traditional OSes

Why Unikraft?

Key Features

Performance

Security

Efficiency

Compatibility

Potential Use Cases and Applications

Get Started Using Unikraft

Step 1 - Install the kraft CLI

Step 2 - Using the Application Catalog

Step 3 - Starting an Nginx Server

Step 4 - Verify the Nginx Unikernel

Conclusion

Resources

Kubernetes on Apple MacBooks (M Series)

Pre-requisites

Provision the VMs

Provisioning the controlplane instance (kubemaster)

Provisioning the first worker node (kubeworker01)

Provisioning the second worker node (kubeworker02)

Configure the local DNS

Install Kubernetes

Versions

Install and configure prerequisites

Forwarding IPv4 and letting iptables see bridged traffic

Install a Container Runtime

Step 1: Install containerd

Step 2: Install runc

Step 3: Install CNI plugins

Install kubeadm, kubelet and kubectl

Configure crictl to work with containerd

Initializing the controlplane node

Install a Pod network add-on

Join the worker nodes to the cluster

Validation

Backup and Restore

Backup

Restore

Cleanup

Resources

Exploring `Client-go`

Additional Configurations Options In `Client-go`

Step 1 - Install the `kraft` CLI

Provisioning the controlplane instance (`kubemaster`)

Provisioning the first worker node (`kubeworker01`)

Provisioning the second worker node (`kubeworker02`)

`Install a Container Runtime`

`Step 1: Install containerd`

`Step 2: Install runc`

`Step 3: Install CNI plugins`

`Install kubeadm, kubelet and kubectl`

`Configure crictl to work with containerd`

`Initializing the controlplane node`

`Install a Pod network add-on`

`Join the worker nodes to the cluster`

`Validation`

`Backup and Restore`

`Backup`

`Restore`

`Cleanup`

`Resources`

Step 4 - Making the HTTP request Using `curl`