This page highlights initiatives I manage or have recently executed across DevOps, SRE, and cloud operations.

Agentic AI Operation

Studied and implemented AI Agents to accelerate DevOps and SRE team operations, with a focus on SRE troubleshooting, task refinement, management reports, and auto code review.

Role: DevOps & SRE Manager

  • AI
  • Python
  • Kubernetes
  • Agentic Frameworks

FinOps - Multiple AWS Accounts

Analyzed monthly AWS costs and proposed improvements to reduce the monthly bill while maintaining recovery capabilities and compliance.

Role: DevOps & SRE Manager

  • AWS
  • Amazon S3
  • EC2
  • Kubernetes

IDLC - Infrastructure as Code Development Lifecycle

Created the infrastructure development framework, enabling reorganization and optimization of infrastructure delivery processes. The solution reduced cognitive load during development, eliminated rollout risks, and improved cross-team collaboration.

Role: DevOps & SRE Manager

  • AWS
  • Terraform
  • Terragrunt
  • Atlantis
  • IaC

Incident Management tool (Incident IO) Implementation

Implemented a new incident management tool, integrating it with existing infrastructure components to streamline incident response and reduce costs.

Role: DevOps & SRE Leadership

  • Prometheus
  • Cloud Watch
  • Opensearch
  • Backstage
  • GitOps

Service Mesh (Istio) implementation

Implemented Istio as the company's Service Mesh solution, enhancing observability, internal application routing, and security.

Role: DevOps & SRE Manager

  • Kubernetes
  • AWS
  • Istio
  • AWS EKS

Canary rollout Kubernetes cluster

Led a canary rollout of a new Kubernetes cluster on AWS EKS, implementing best practices for security, scalability, and cost optimization while ensuring compliance with regulatory standards.

Role: Staff DevOps Engineer

  • ArgoCD
  • Karpenter
  • Bottlerocket
  • Terraform
  • AWS EKS