At Nebula Tech Solutions, weβre expanding our global reliability engineering team to support mission-critical systems for our US-based enterprise clients during night shifts only.
Weβre looking for experienced DevOps/SRE professionals (5+ years) who bring hands-on depth in Kubernetes, monitoring/metrics, and coding β not just infrastructure management.
This is a role for engineers who thrive on troubleshooting, automation, and continuous improvement in high-availability environments.
ππ
π§ What Youβll Do
β Build, optimize, and maintain Kubernetes clusters (EKS/GKE/AKS) for scalability and resilience
β Design and improve CI/CD pipelines (Jenkins, ArgoCD, FluxCD, Harness, GitHub Actions)
β Implement and extend observability using Prometheus, Grafana, OpenTelemetry, and custom metrics
β Develop and maintain internal tools and automations using Python, Go, or similar programming languages
β Drive incident response, RCA, and reliability improvements across services
β Collaborate with global teams to ensure continuous uptime and performance
π§© What Weβre Looking For
πΉ 5+ years of DevOps / SRE / Platform Engineering experience
πΉ Deep, hands-on knowledge of Kubernetes architecture, deployments, debugging, and scaling
πΉ Strong programming or scripting skills in Python, Go, Java, or Node.Js (beyond shell scripting)
πΉ Proven experience with monitoring and telemetry systems (Prometheus, Grafana, ELK, OpenTelemetry)
πΉ Understanding of web services, REST APIs, and distributed systems troubleshooting
πΉ Familiarity with Terraform, Helm, and GitOps workflows (FluxCD/ArgoCD)
π Bonus Points
- Experience with .NET, Java, or JavaScript application stacks
- Exposure to message brokers (Kafka, RabbitMQ) or databases (PostgreSQL, MongoDB, MSSQL)
- Knowledge of service meshes (Istio, Linkerd) and network observability
π Location: Remote (India)
π Shift: US Night Shift (Continuous)
π Client: US-based Enterprise (Global Scale)
If you love solving complex reliability challenges, enjoy scripting and building automation, and want to work with globally distributed systems β weβd love to hear from you.
π