Job Description
<p><p><b>Role : Platform Engineering Lead (Cloud /DevOps)</b><br/><br/><b>Job Type : </b> Full Time<br/><br/><b>Job Positions : </b> 1<br/><br/><b>Location : </b> Bangalore / Chennai (Work from Office)<br/><br/><b>Job Description : </b><br/><br/>- 12+ years in Linux/Cloud/DevOps/Platform roles.<br/><br/>- 5+ years with automation tools (Ansible, Terraform, AWX/Ansible Tower).<br/><br/>- 5+ years in Python scripting and cloud platforms (AWS, OpenStack).<br/><br/>- Strong experience with CI/CD pipelines and release engineering.<br/><br/>- Expertise in container orchestration (Kubernetes, Docker).<br/><br/>- Proven track record with monitoring & observability tools (Nagios, Prometheus, Splunk, ELK).<br/><br/>- Strong leadership skills with experience mentoring engineers and leading platform initiatives.<br/><br/><b>Key Responsibilities : </b><br/><br/><b>Platform & Infrastructure Leadership</b><br/><br/>- Lead the design, implementation, and optimization of scalable cloud-native platforms (AWS, OpenStack).<br/><br/>- Define the platform engineering roadmap, ensuring alignment with business and product goals.<br/><br/>- Drive adoption of infrastructure as code (IaC), configuration management, and automated provisioning.<br/><br/><b>DevOps, CI/CD & Release Engineering</b><br/><br/>- Own and optimize CI/CD pipelines (Concourse, Jenkins, GitLab CI, or similar).<br/><br/>- Ensure highly available, secure, and reliable build/release processes for complex enterprise systems.<br/><br/>- Collaborate with developers, QA, and operations to reduce release cycles and improve quality.<br/><br/><b>Automation & Scripting :</b></p><p><br/></p><p>- Architect and maintain automation using Ansible, Terraform, AWX/Ansible Tower.<br/><br/>- Develop and enhance automation frameworks with Python scripting.<br/><br/>- Build self-service platforms for internal teams to streamline operations.<br/><br/><b>Containerization & Orchestration :</b></p><p><br/></p><p>- Lead large-scale container platform deployments using Docker & Kubernetes.<br/><br/>- Implement strategies for scalability, observability, and resilience of microservices.<br/><br/><b>Monitoring, Logging & Observability :</b></p><p><br/></p><p>- Implement end-to-end observability frameworks (Nagios, Op5, Prometheus, Splunk, ELK).<br/><br/>- Define proactive monitoring, logging, and alerting practices to ensure system reliability.<br/><br/>- Partner with InfoSec teams to embed compliance and security controls.<br/><br/><b>Team Leadership & Collaboration :</b></p><p><br/></p><p>- Mentor DevOps/Platform engineers and foster a culture of automation and reliability.<br/><br/>- Collaborate with cross-functional stakeholders (engineering, product, operations, leadership).<br/><br/>- Influence adoption of platform engineering standards across the organization.<br/><br/><b>Must-Have- Skills :</b></p><p><br/></p><p>- AWS , OpenStack, Ansible / Terraform / Ansible Tower and AWX, Concourse, Jenkins, GitLab CI/CD<br/><br/>- Docker, Kubernetes, Python Scripting,Nagios, Op5,Prometheus, Splunk, ELK<br/><br/><b>Academic : </b>Post Graduate /Graduate in Engineering /Technology /MBA</p><br/></p> (ref:hirist.tech)