Job Description
<p><p><b>Role : AWS DevOps Engineer</b> (7+ years relevant experience in DevOps)<br/><br/><b>Skills : </b><br/><br/><b>- Required Knowledge Level</b><br/><br/><b>- AWS Services, AWS Networking, Windows Server, PowerShell scripting - Advanced</b><br/><br/><b>- GitHub, Terraform, Python, Azure DevOps - Intermediate</b><br/><br/><b>Work Mode : Remote</b><br/><br/><b>Work Timing : 12:30PM to 9:30PM</b><br/><br/><b>About the Role :</b><br/><br/>We are seeking a Senior Cloud Engineer to join our Infrastructure & Platform Engineering team.
This role is critical to managing secure, scalable, and high-performance cloud infrastructure while embedding automation, AI-driven service optimization, and regulatory compliance.
</p><p><br/></p><p>Youll work across production and non-production environments, collaborating with development, QA, and service teams to ensure smooth, secure, and efficient platform operations.<br/><br/>This is an exciting opportunity to work at the intersection of cloud engineering, AI enhanced service management, and DevOps best practices in a mission-critical environment.<br/><br/><b>What Youll Do :</b><br/><br/>Cloud Infrastructure & Automation :<br/><br/>- Design, build, and manage AWS/Azure/GCP-based infrastructure using Terraform, CloudFormation, and IaC pipelines.<br/><br/>- Manage compute, networking, load balancing, identity, and scaling for enterprise grade workloads.<br/><br/>- Deploy and manage virtualized services to ensure uptime, high availability, and cost-efficiency.<br/><br/>DevOps & CI/CD Integration :<br/><br/>- Build and maintain CI/CD pipelines using GitHub Actions, Jenkins, GitLab CI.<br/><br/>- Support serverless, containerized (Docker, Kubernetes, EKS/ECS) and event-driven platforms.<br/><br/>- Automate environment provisioning, application deployments, monitoring, and alerts.<br/><br/>IT Service Management (ITSM) & Incident Handling :<br/><br/>- Manage and streamline incident, problem, and change workflows using tools like ServiceNow, Azure DevOps, or Jira Service Management.<br/><br/>- Lead incident response and root cause analysis (RCA); define preventive measures and SLAs to maintain system reliability.<br/><br/>- Maintain detailed operational documentation and runbooks for high-availability support models.<br/><br/>AI-Driven Service Optimization & Analytics :<br/><br/>- Build and integrate AI-enhanced tooling (e.g., ChatGPT, predictive analytics, or self-healing bots) to accelerate service delivery and reduce resolution time.<br/><br/>- Use AI/ML for intelligent alerting, auto-remediation, and anomaly detection across observability tools.<br/><br/>- Analyze system performance and ITSM data to generate actionable insights using data visualization platforms (e.g., Power BI, Tableau, QuickSight).<br/><br/>Security, Risk, and Compliance :<br/><br/>- Implement secure coding, encryption, network segmentation, IAM best practices, and Zero Trust principles.<br/><br/>- Ensure compliance with SOC2, GDPR, ISO 27001, DORA, and internal InfoSec standards.<br/><br/>- Conduct regular system audits, SAST/SCA scans, and support regulatory reporting.<br/><br/>Collaboration & Continuous Improvement :<br/><br/>- Act as a liaison between infra, dev, and QA teams to ensure seamless integration and deployment.<br/><br/>- Promote a continuous improvement culture, identifying automation opportunities and driving operational efficiencies.<br/><br/>- Mentor junior team members on best practices in infrastructure-as-code, observability, and secure operations.<br/><br/><b>What You Bring :</b><br/><br/>- 7+ years of experience managing cloud-based infrastructure and enterprise IT environments.<br/><br/>- Strong skills in automation, scripting, and CI/CD pipelines.<br/><br/>- Hands-on experience with ITSM tools and AI-powered service delivery enhancements.<br/><br/>- Working knowledge of regulatory compliance frameworks (SOC2, GDPR, ISO 27001, DORA).<br/><br/>- Demonstrated ability in incident response, RCA, and preventive measures.<br/><br/>- Strong analytical and diagnostic abilities to troubleshoot complex cloud issues.<br/><br/>- Excellent cross-functional communication and collaboration skills.<br/><br/>- Bachelors degree in computer science, Engineering, or a related field<br/><br/>Nice to Have :<br/><br/>- Certifications: AWS Certified Solutions Architect / DevOps Engineer / Security Specialist<br/><br/>- Exposure to self-healing architectures, anomaly detection, or AI-based automation agents<br/><br/>- Experience integrating observability with ITSM workflows for automated post incident insights<br/><br/>- Familiarity with edge computing, serverless frameworks, and VPC/networking optimization<br/><br/></p><br/></p> (ref:hirist.tech)