CI/CD Pipeline

Deploy a CI/CD pipeline into your home lab

All files are available here - https://github.com/arbaaz29/pipeline/tree/main

Docker:

Docker is an open-source platform that simplifies application development, deployment, and management using containerization. Containers package applications and their dependencies into a lightweight, portable unit that runs consistently across different environments. Unlike traditional virtual machines, Docker containers share the host OS kernel, making them more efficient and faster to start.

With Docker, developers can:

Ensure consistency across development, testing, and production environments.
Easily scale applications by deploying multiple container instances.
Improve resource utilization compared to traditional virtualization.

Key Docker components include Docker Engine, Docker Images, Docker Containers, and Docker Compose. It is widely used in DevOps, microservices architectures, and cloud-native development.

For installation, follow the steps from the official website:

https://docs.docker.com/engine/install/

Containerize the applications

# go to respective directory and build images
# command
docker build -t arbaazij/back_jnks App-files/backend/.
docker build -t arbaazij/front_jnks App-files/frontend/.

Test using docker-compose

# got to App-files there is a docker-compose manifest
docker compose up # if you are using it freshly

docker compose build --no-cahce # build from scratch
docker compose up #will use the newly built images

Push the built images to docker repo

# you need to login to docker via terminal to push it to your repository
docker login

docker push arbaazij/back_jnks
docker push arbaazij/front_jnks

docker push {your docker username}/{name you want to gave to your image}

Kubernetes (Minikube):

https://minikube.sigs.k8s.io/docs/start/?arch=%2Fwindows%2Fx86-64%2Fstable%2F.exe+download

Kubernetes (K8s) is an open-source container orchestration platform that automates containerized applications deployment, scaling, and management. Kubernetes enables organizations to run applications reliably across clusters of machines.

Key features of Kubernetes include:

Automated Scaling – Adjusts application instances based on demand.
Self-healing – Restarts failed containers and reschedules them as needed.
Load Balancing – Distributes traffic efficiently across containers.
Service Discovery – Provides built-in mechanisms to find and communicate with services.
Declarative Configuration – Uses YAML manifests to define desired application states.

Kubernetes is widely used for managing microservices, hybrid cloud deployments, and large-scale distributed applications.

Start minikube ( ubuntu ):

# ubuntu
minikube start --memory=8192 # starting minikube with 2 Cpu cores and 8 GB of memory (RAM)
# mac
minikube start --driver=hyperkit
# windows
minikube start --driver=hyperv

Install add-ons:

minikube addons enable ingress
minikube addons enable metrics-server

Jenkins:

Jenkins is an open-source automation server used for continuous integration (CI) and continuous delivery (CD) in software development. It allows developers to automate building, testing, and deploying applications, ensuring faster and more reliable software releases.

Key Features of Jenkins:

Pipeline Automation – Automates the software delivery process using declarative or scripted pipelines.
Extensibility – It supports hundreds of plugins that can be integrated with various tools like Git, Docker, Kubernetes, and AWS.
Distributed Builds – Runs tasks on multiple machines to improve efficiency.
Version Control Integration – Works seamlessly with Git, SVN, and other version control systems.
Customizable Workflows – Enables teams to define workflows tailored to their development needs.

Jenkins is widely used in DevOps to streamline CI/CD pipelines, making software development more efficient and scalable.

Install Jenkins: https://www.jenkins.io/doc/book/installing/linux/
Add Jenkins to the Docker group:
```
usermod -aG Jenkins docker
```

Install plugins (Manage Jenkins → Plugins):

jdk
nodejs
docker pipeline
docker
sonarqube quality gates
OWASP dependency check
SonarQube Scanner

Restart the Jenkins service:
```
http://localhost:8080/restart
```
Go to Manage Jenkins→ System and configure SonarQube installations
1. Keep a note of the Name field (The scm scripts environment will refer these names)
Go to Manage Jenkins → Tools configure jdk, SonarQube Scanner installations, NodeJS Installations, and Docker Installations
1. Keep a note of the Name field (The scm scripts environment will refer these names)

Setup credentials:

# add git token as a secret text as well
git token # (username and token) # repo read and write
# use the same git token to create a secret text credential

# save the Sonarqube Token as secret text
sonarqube token # global 

# save the docker username and docker token as username and passwords
docker token # (username and token)

# all the tokens should be added to global groups as you will be using it in frontend and backend scm

Create a project:

# create a project give it a name and select cateogry pipeline
# Use gitscm polling 
# Pipeline Script from SCM
    ## select git for scm
# Paste your git repository's link 
# select github username and password
# specify branch path specify the path for jenkinsfile (case sensitive)

Deploying a pipeline:

# Comment out everything and try building one stage at a time for debugging purpose
# BUILD_ID is a in-built function in jenkins

Pipeline builds:

SonarQube:

SonarQube is an open-source platform for continuous code quality and security analysis. It helps developers identify bugs, vulnerabilities, and code smells in applications by performing static code analysis on various programming languages. SonarQube integrates seamlessly into CI/CD pipelines to ensure high-quality and maintainable code.

Key Features of SonarQube:

Code Quality Analysis – Detects code smells, duplicated code, and maintainability issues.
Security Scanning – Identifies vulnerabilities and security hotspots.
Technical Debt Measurement – Provides insights into the effort needed to fix code issues.
Multi-language Support – Analyzes over 25 programming languages, including Java, Python, and JavaScript.
Integration with DevOps Tools – Works with Jenkins, GitHub, GitLab, Bitbucket, and more.

SonarQube helps teams enforce coding standards, improve security, and maintain clean, reliable code throughout the software development lifecycle.

This is a docker-compose file to create a sonar qube instance with persistent storage so that you don’t need to create a new api key and user for every time you restart sonarqube

version: "3"
services:
  db:
    image: postgres:12-alpine
    environment:
      - POSTGRES_USER=sonar
      - POSTGRES_PASSWORD=sonar
      - POSTGRES_DB=sonar
    volumes:
      - postgres_data:/var/lib/postgresql/data
    networks:
      - bridge

  sonarqube:
    image: sonarqube:community
    depends_on:
      - db
    environment:
      - sonar.jdbc.username=sonar
      - sonar.jdbc.url=jdbc:postgresql://db/sonar
      - sonar.jdbc.password=sonar
    ports:
      - "9000:9000"
    volumes:
      - sonar_conf:/opt/sonarqube/conf
      - sonar_data:/opt/sonarqube/data
      - sonar_extensions:/opt/sonarqube/extensions
      - sonar_plugins:/opt/sonarqube/lib/bundled-plugins
    networks:
      - bridge

networks:
  bridge:

volumes:
  sonar_conf:
  sonar_data:
  sonar_extensions:
  sonar_plugins:
  postgres_data:

Setup →
1. Access the instance at port 9000, i.e., http://localhost:9000
2. Create a local project
3. The name you give to the project is projecKey; in this case, it is npm_back for backend and npm_front for frontend
4. After creating a project, on the top right corner, you will find Security under My Account. There is an option to generate tokens. Give it a name and select the type of global analysis token if you want to grant admin access or select project analysis token so you grant only access to the respective project. Set an expiration date in click on generate now copy paste the token somewhere safe for the time being, once generated you cannot view it again after closing the tab

SonarQube uses code scores and quality gates to evaluate and enforce the quality of your code

Code Scores: These are numerical ratings given to different aspects of your code, like code complexity, duplication, and potential bugs. Higher scores usually indicate better quality, with the focus being on improving maintainability, security, and reliability
Quality Gates: A quality gate is a set of conditions that your code must pass to be considered acceptable for deployment. It typically includes criteria like no new critical bugs, no security vulnerabilities, and sufficient test coverage. If the code fails the quality gate, it won’t be promoted to production
These tools help ensure that the code is both high-quality and secure, reducing the risk of defects in production

Trivy:

Trivy is an open-source vulnerability scanner designed to detect security issues in container images, file systems, and code repositories. It helps developers and security teams identify vulnerabilities, misconfigurations, and exposed secrets early in the development cycle.

Key Features of Trivy:

Container Image Scanning – Detects vulnerabilities in Docker and OCI images.
Infrastructure as Code (IaC) Scanning – Identifies misconfigurations in Kubernetes, Terraform, and more.
Code Repository Scanning – Finds exposed secrets and security risks in source code.
Database & OS Scanning – Supports scanning for package vulnerabilities in Linux distributions.
Fast & Lightweight – Minimal setup with quick scanning capabilities.

Trivy integrates seamlessly into CI/CD pipelines and security workflows, making it a powerful tool for DevSecOps and cloud security.

Install trivy:

https://trivy.dev/v0.18.3/installation/

sudo apt-get install wget apt-transport-https gnupg lsb-release
wget -qO - https://aquasecurity.github.io/trivy-repo/deb/public.key | sudo apt-key add -
echo deb https://aquasecurity.github.io/trivy-repo/deb $(lsb_release -sc) main | sudo tee -a /etc/apt/sources.list.d/trivy.list
sudo apt-get update
sudo apt-get install trivy

OLM (Operator Lifecycle Management):

Operator Lifecycle Manager (OLM) is a framework in Kubernetes that helps manage the installation, upgrade, and lifecycle of Operators—specialized controllers that extend Kubernetes functionalities. OLM simplifies Operator deployment by handling dependencies, updates, and permission management, ensuring seamless application management within a cluster.

Key Features of OLM:

Automated Operator Installation & Upgrades
Dependency Management for Operators
RBAC (Role-Based Access Control) Handling
Operator Catalog for Discoverability

OLM is widely used in Red Hat OpenShift and Kubernetes environments to streamline the deployment and maintenance of Operators, making application management more efficient.

Manual Install:

kubectl create -f https://github.com/operator-framework/operator-lifecycle-manager/releases/download/v0.31.0/crds.yaml
kubectl wait --for=Established -f https://github.com/operator-framework/operator-lifecycle-manager/releases/download/v0.31.0/crds.yaml
kubectl create -f https://github.com/operator-framework/operator-lifecycle-manager/releases/download/v0.31.0/olm.yaml
kubectl rollout status -w deployment/olm-operator --namespace=olm
retries=30
until [[ $retries == 0 ]]; do
    new_csv_phase=$(kubectl get csv -n olm packageserver -o jsonpath='{.status.phase}' 2>/dev/null || echo "Waiting for CSV to appear")
    if [[ $new_csv_phase != "$csv_phase" ]]; then
        csv_phase=$new_csv_phase
        echo "Package server phase: $csv_phase"
    fi
    if [[ "$new_csv_phase" == "Succeeded" ]]; then
        break
    fi
    sleep 10
    retries=$((retries - 1))
done

if [ $retries == 0 ]; then
    echo "CSV \"packageserver\" failed to reach phase succeeded"
    exit 1
fi

ArgoCD:

ArgoCD is a declarative, GitOps-based continuous delivery (CD) tool for Kubernetes. It ensures that applications deployed in a cluster match the desired state defined in a Git repository. ArgoCD continuously monitors Kubernetes configurations and automatically syncs changes, making deployments more reliable and automated.

Key Features of ArgoCD:

GitOps Workflow – Uses Git as the single source of truth for deployments.
Automated Syncing – Ensures Kubernetes resources match the desired state.
Declarative UI & CLI – Provides a user-friendly interface for managing applications.
RBAC & Multi-Tenancy – Supports secure access control and multi-team collaboration.
Integration with CI/CD Pipelines – Works alongside Jenkins, GitHub Actions, and other CI tools.

ArgoCD simplifies Kubernetes application management, making it an essential tool for modern DevOps and cloud-native deployments.

Install ArgoCD CRDs through https://operatorhub.io/operator/argocd-operator
Create a deployment file and apply it in your k8s environment, these deployment files refers the custom api that was deployed using OLM. This means deploying new pods, services, updating and maintatining the controllers everything is taken care of by the OLM :
```
apiVersion: argoproj.io/v1alpha1
kind: ArgoCD
metadata:
  name: argocd
  labels:
    app: argocd
spec: {}
```
In the image below can see how ArgoCD implements self-healing measure. In this instance, the back image was not configured properly and was not able to work in a k8s environment. ArgoCD detected that the new pod images are not healthy which resulted in ArgoCD deploying the last healthy image that was deployed into the k8s environment

In the following image ArgoCD detects a new version in the manifests and implenets the new image, detecting that the new pods are healthy it automatically deletes the old stable pod from the environment

Prometheus:

Prometheus is an open-source monitoring and alerting toolkit designed for cloud-native environments. It collects and stores time-series data, making it ideal for tracking system performance and detecting anomalies in real time. Originally developed at SoundCloud and now part of the Cloud Native Computing Foundation (CNCF), Prometheus is widely used in Kubernetes-based infrastructures.

Key Features of Prometheus:

Time-Series Data Collection – Stores metrics with timestamps for easy analysis.
Powerful Query Language (PromQL) – Enables flexible data exploration and visualization.
Multi-Dimensional Data Model – Uses labels to categorize and filter metrics efficiently.
Built-in Alerting (Alertmanager) – Triggers alerts based on predefined rules.
Kubernetes & Cloud-Native Integration – Works seamlessly with Kubernetes and service discovery.

Prometheus is a critical tool in observability, helping DevOps and SRE teams monitor applications, infrastructure, and microservices efficiently.

Install prometheus using Operator:
1. https://operatorhub.io/operator/prometheus#

Perform clusterrolesbinds and rolebinds:

#rbac:
apiVersion: rbac.authorization.k8s.io/v1
kind: Role
metadata:
  name: prometheus-monitoring-reader
  namespace: monitoring
rules:
- apiGroups: [""]
  resources: ["services", "endpoints", "pods"]
  verbs: ["get", "list", "watch"]
  ---
apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRole
metadata:
  name: prometheus-reader
rules:
- apiGroups: [""]
  resources: ["services", "endpoints", "pods"]
  verbs: ["get", "list", "watch"]

kubectl create -f https://operatorhub.io/install/prometheus.yaml

# create cluster roles and bind them to the service account:
kubectl create clusterrolebinding prometheus-cluster-view   --clusterrole=view   --serviceaccount=monitoring:prometheus-k8s
kubectl create rolebinding prometheus-view-binding   --role=view   --serviceaccount=monitoring:prometheus-k8s   --namespace=default
kubectl create rolebinding prometheus-monitoring-view   --role=view   --serviceaccount=monitoring:prometheus-k8s   --namespace=monitoring

kubectl create rolebinding prometheus-monitoring-reader-binding   --role=prometheus-monitoring-reader   --serviceaccount=monitoring:prometheus-k8s   --namespace=monitoring
kubectl create clusterrolebinding prometheus-reader-binding   --clusterrole=prometheus-reader   --serviceaccount=monitoring:prometheus-k8s

# Verify if the roles are bound successfully
kubectl auth can-i get services --as=system:serviceaccount:monitoring:prometheus-k8s -n monitoring

Apply the manifests from the Prometheus folder:
```
kubectl apply -f prometheus
```

Verify if the deployment and service monitor has been created

kubectl get servicemonitor -n monitoring # you should see a backend-monitor service monitor
kubectl get po -n monitoring # you should see 2 instances of highly avaiable pods

Start port forwarding:

kubectl port-forward -n monitoring svc/prometheus-operated 9090:9090
# Status>targets you should be able to see your monitor and you should be able to query your custom query
# if not check if servicemonitor is working properly, your pods are online and try to get some traffic on your sites so that your pod will register some of the metrics

Exposing Custom metrics for better visibility

Metrics:

Metrics type:

Counter → always incrementing, will never decrement
1. e.g., number of http requests, number of users signed up (even if the user deleted his account, he did create it beforehand so it will be held as a counter metric), etc
gauge →it will show variations like incrementing and decrementing at the instance
1. e.g. - configmap, CPU util, memory util
histogram → specific records of data. That is, buckets of specific conditions of metrics will be created, and if the metric matches the condition, it will be added to the bucket
1. e.g, imagine the conditions are for latency, buckets of specific time duration will be created, for instance, bucket1-5ms, bucket2-10ms. Now, when the request for response meets these latency conditions, they will be added to their respective buckets
summary → similar to histogram

Creating custom metrics for your application:

If you want to use any other metric collector except Prometheus, you can use open-telemetry (it is a generic module that helps aggregate metrics that can be interpreted by the respective implementation)

If using Prometheus (best for k8s):

npm specific implementation:
```
npm install prom-client
```

Add the promclient to your code:

const promClient = require('prom-client');

//following are the basic templates you can use for metrics
//adding a http counter:
const httpRequestCounter = new promClient.Counter({
    name: "http_requests_total",
    help: "Total number of http requests",
    labelNames: ['method', 'path', 'status_code'], 
});

//adding a duration histogram
const requestDuration = new promClient.Histogram({
    name: "http_requests_duration_seconds",
    help: "duration of http requests in seconds",
    labelNames: ['method', 'path', 'status_code'],
    buckets: [0.1, 0.5, 1, 5, 10], //buckets for histograms in seconds 
});

//adding a duration summary
const requestDurationSummary = new promClient.Summary({
    name: "http_requests_summary_seconds",
    help: "Summary of http requests in seconds",
    labelNames: ['method', 'path', 'status_code'],
    buckets: [0.5, 0.9, 0.99], //Percenties
});

// gauge metric
const gauge = new promClient.Gauge({
    name: "node_gauge",
    help: "gauge tracking async task duration",
    labelNames: ['method', 'status_code'], 
});

These metrics are specific to my implementation, you will need to customize them according to your needs

//Middleware to track metrics
app.use((req, res, next) => {
    const start = Date.now();
    res.on('finish', () => {
        const duration = (Date.now() - start) / 1000; // Duration in seconds
        const { method, url } = req;
        const statusCode = res.statusCode; // Get the actual HTTP status code
        httpRequestCounter.labels({ method, path: url, status_code: statusCode }).inc();
        requestDuration.labels({ method, path: url, status_code: statusCode }).observe(duration);
        requestDurationSummary.labels({ method, path: url, status_code: statusCode }).observe(duration);
    });
    next();
});

// to expose metrics 
//you will need to add this api route
app.get('/metrics', async (req, res) => {
    res.set('Content-Type', promClient.register.contentType);
    res.end(await promClient.register.metrics());
});

Exposing metrics using service discovery:

# the prometheus stack should be running in namespace montoring
apiVersion: monitoring.coreos.com/v1
kind: ServiceMonitor
metadata:
  labels:
    backmon: backend-monitor
    release: monitoring
  name: backmon
  namespace: monitoring
spec:
  jobLabel: job_back
  endpoints:
    - interval: 2s
      targetPort: 5000
      path: /metrics
  selector:
    matchLabels:
      app: back
  namespaceSelector:
    matchNames:
      - default

Grafana:

Grafana is an open-source data visualization and monitoring tool used to analyze and display real-time metrics from various sources. It provides interactive dashboards with customizable panels, making it a popular choice for observability in cloud and on-premise environments.

Key Features of Grafana:

Multi-Source Data Integration – Supports Prometheus, InfluxDB, Elasticsearch, Loki, MySQL, and more.
Customizable Dashboards – Interactive graphs, charts, and alerts for real-time monitoring.
Alerting System – Notifies users of anomalies via email, Slack, or other channels.
Role-Based Access Control (RBAC) – Ensures secure access to dashboards.
Kubernetes & Cloud-Native Support – Works seamlessly in DevOps environments.

Grafana is widely used in monitoring, observability, and analytics, helping teams track system performance, detect issues, and optimize infrastructure.

Install grafana using operator:

Create CRDs:

kubectl create -f https://operatorhub.io/install/prometheus.yaml

Apply manifests from the grafana folder:
```
kubectl apply -f grafana
```

Create a dashboard:

rate(http_requests_total{namespace!="",pod!="",path!=""}[5m])

Final results:

K8s deployment:

Prometheus:

Grafana:

References:

https://faun.pub/using-the-operator-lifecycle-manager-to-deploy-prometheus-on-openshift-cd2f3abb3511

https://operatorhub.io/operator/prometheus#

https://operatorhub.io/operator/grafana-operator

CI/CD Pipeline#

Docker:#

Kubernetes (Minikube):#

Jenkins:#

Key Features of Jenkins:#

SonarQube:#

Key Features of SonarQube:#

Trivy:#

Key Features of Trivy:#

OLM (Operator Lifecycle Management):#

Key Features of OLM:#

ArgoCD:#

Key Features of ArgoCD:#

Prometheus:#

Key Features of Prometheus:#

Exposing Custom metrics for better visibility#

Metrics:#

Creating custom metrics for your application:#

Grafana:#

Key Features of Grafana:#

Final results:#

CI/CD Pipeline

Docker:

Kubernetes (Minikube):

Jenkins:

Key Features of Jenkins:

SonarQube:

Key Features of SonarQube:

Trivy:

Key Features of Trivy:

OLM (Operator Lifecycle Management):

Key Features of OLM:

ArgoCD:

Key Features of ArgoCD:

Prometheus:

Key Features of Prometheus:

Exposing Custom metrics for better visibility

Metrics:

Creating custom metrics for your application:

Grafana:

Key Features of Grafana:

Final results: