Job Description
<p><p><b>Job Description : </b><br/><br/> Location : Pune, India.<br/><br/><b>About The Role : </b><br/><br/> We are looking for a Senior DevOps Engineer to join our high-impact team in Pune, India.<br/><br/> You will lead the design and implementation of scalable, secure, and highly available infrastructure across both cloud and on-premise environments.<br/><br/> This role demands a deep understanding of Linux systems, infrastructure automation, and performance tuning, especially in high-performance computing (HPC) setups.<br/><br/> As a technical leader, youll collaborate closely with development, QA, and operations teams to drive DevOps best practices, tool adoption, and overall infrastructure reliability.<br/><br/><b>Key Responsibilities : </b><br/><br/> - Design, build, and maintain Linux-based infrastructure across cloud (primarily AWS) and physical data centers.<br/><br/> - Implement and manage Infrastructure as Code (IaC) using tools such as CloudFormation, Terraform, Ansible, and Chef.<br/><br/> - Develop and manage CI/CD pipelines using Jenkins, Git, and Gerrit to support continuous delivery.<br/><br/> - Automate provisioning, configuration, and software deployments with Bash, Python, Ansible, etc.<br/><br/> - Set up and manage monitoring/logging systems like Prometheus, Grafana, and ELK stack.<br/><br/> - Optimize system performance and troubleshoot critical infrastructure issues related to networking, filesystems, and services.<br/><br/> - Configure and maintain storage and filesystems including ext4, xfs, LVM, NFS, iSCSI, and potentially Lustre.<br/><br/> - Manage PXE boot infrastructure using Cobbler/Kickstart, and create/maintain custom ISO images.<br/><br/> - Implement infrastructure security best practices, including IAM, encryption, and firewall policies.<br/><br/> - Act as a DevOps thought leader, mentor junior engineers, and recommend tooling and process improvements.<br/><br/> - Maintain clear and concise documentation of systems, processes, and best practices.<br/><br/> - Collaborate with cross-functional teams to ensure reliable and scalable application delivery.<br/><br/><b>Required Skills & Experience : </b><br/><br/> - 5+ years of experience in DevOps, SRE, or Infrastructure Engineering.<br/><br/> - Deep expertise in Linux system administration, especially around storage, networking, and process control.<br/><br/> - Strong proficiency in scripting (e.g., Bash, Python) and configuration management tools (Chef, Ansible).<br/><br/> - Proven experience in managing on-premise data center infrastructure, including provisioning and PXE boot tools.<br/><br/> - Familiar with CI/CD systems, Agile workflows, and Git-based source control (Gerrit/GitHub).<br/><br/> - Experience with cloud services, preferably AWS, and hybrid cloud models.<br/><br/> - Knowledge of virtualization (e.g., KVM, Vagrant) and containerization (Docker, Podman, Kubernetes).<br/><br/> - Excellent communication, collaboration, and documentation skills.<br/><br/><b>Nice to Have : </b><br/><br/> - Hands-on with Lustre or other distributed/parallel filesystems.<br/><br/> - Experience in HPC (High-Performance Computing) environments.<br/><br/> - Familiarity with Kubernetes deployments in hybrid clusters.<br/></p><br/></p> (ref:hirist.tech)