Key Responsibilities:
- Architect and Implement: Design and deploy scalable, high-availability Kubernetes clusters using OpenShift and Rancher Kubernetes Engine (RKE) .
- Automation Orchestration: Develop and manage infrastructure-as-code (IaC) solutions using Terraform, Helm, or Ansible.
- CI/CD Integration: Implement and optimize CI/CD pipelines using Jenkins, GitLab CI/CD, ArgoCD, or Tekton for automated deployment and testing.
- Security Compliance: Enforce security best practices for Kubernetes clusters, RBAC policies, service mesh configurations, and container image scanning.
- Monitoring Logging: Set up observability solutions using Prometheus, Grafana, ELK/EFK Stack, or OpenTelemetry for proactive monitoring and alerting.
- Multi-Cloud Hybrid Cloud Deployments: Design hybrid cloud and multi-cloud strategies using AWS, Azure, GCP, or on-prem solutions integrated with OpenShift and Rancher.
- SRE Performance Optimization: Implement SRE best practices for high availability, auto-scaling, and performance tuning of microservices architectures.
- Collaboration: Work closely with development, security, and operations teams to streamline DevOps processes and enable faster deployments.
- Disaster Recovery Backup: Implement disaster recovery strategies , backup automation, and cluster failover solutions.
Required Skills Experience:
Kubernetes Containerization: Deep understanding of Kubernetes orchestration, OpenShift, and Rancher Kubernetes Engine (RKE2/RKE) .
Containerization Service Mesh: Experience with Docker, Istio, Linkerd, or Envoy.
Infrastructure as Code (IaC): Hands-on expertise with Terraform, Helm, and Ansible.
CI/CD Pipelines: Strong knowledge of Jenkins, GitOps (ArgoCD, FluxCD), and Tekton.
Cloud Platforms: Experience with AWS, Azure, GCP, and on-premises Kubernetes clusters.
Monitoring Logging: Experience with Prometheus, Grafana, ELK/EFK Stack, OpenTelemetry.
Security Compliance: Kubernetes RBAC, Pod Security Policies, image scanning, and network policies.
Scripting Automation: Proficiency in Bash, Python, or Go for automation and scripting.
Networking Load Balancing: Expertise in Kubernetes networking, Ingress controllers (NGINX, Traefik), and service discovery.
Backup DR: Experience with Velero, Longhorn, or Kasten for Kubernetes backup and recovery.
Skills Required
Scripting, Kubernets, Azure, Automation, Python, Devops, Technical Architecture