Key Responsibilities:
- Design, build, and maintain cloud infrastructure using Terraform and GitOps methodologies (ArgoCD / FluxCD).
- Automate infrastructure provisioning, configuration, and deployment using Python and CI/CD pipelines.
- Implement GitOps workflows to ensure declarative, version-controlled infrastructure management.
- Collaborate with development and security teams to integrate DevSecOps best practices.
- Optimize CI/CD processes using Jenkins, GitHub Actions, or GitLab CI for faster and more reliable delivery.
- Manage Kubernetes clusters, including scaling, monitoring, and troubleshooting workloads.
- Develop and maintain monitoring and alerting systems using tools like Prometheus, Grafana, and ELK Stack.
- Implement and enforce best practices for infrastructure security, cost optimization, and disaster recovery.
- Support incident response and root cause analysis for production issues.
Required Skills & Qualifications:
- 8+ years of experience in DevOps, Site Reliability, or Cloud Engineering.
- Strong hands-on experience with Terraform (modules, workspaces, remote state, etc.).
- Proficiency in Python scripting for automation, tooling, and integrations.
- Solid understanding of GitOps principles and experience with ArgoCD or FluxCD.
- Experience with cloud platforms (AWS, Azure, or GCP).
- Knowledge of Kubernetes, Docker, and container orchestration best practices.
- Experience with CI/CD tools (Jenkins, GitLab CI/CD, GitHub Actions).
- Familiarity with Linux administration, networking, and security concepts.
- Strong understanding of monitoring, observability, and logging tools.