Job Overview
Company
Wits Innovation Lab
Location
Sahibzada Ajit Singh Nagar
Category
Computer Occupations
Ready to Apply?
Take the Next Step in Your Career
Join Wits Innovation Lab and advance your career in Computer Occupations
Apply for This Position
Click the button above to apply on our website
Job Description
<p><p><b>Site Reliability Engineer (SRE) Senior Role</b><br/><br/><b>Location : Mohali</b><br/><br/><b>Experience : 4+ years</b><br/><br/>We are looking for an experienced Site Reliability Engineer (SRE) to strengthen our cloud and infrastructure team.
The role involves owning reliability, availability, and scalability of distributed platforms, while driving automation and observability best practices.<br/><br/><b>Key Responsibilities :</b><br/><br/>- Build, implement, and maintain monitoring, logging, and alerting systems across production and non-production environments.<br/><br/></p><p>- Lead incident management, root cause analysis, and drive improvements for faster recovery.<br/><br/></p><p>- Define and test disaster recovery and backup strategies.<br/><br/></p><p>- Partner with development and product teams to establish and enforce SLAs, SLOs, and SLIs.<br/><br/></p><p>- Optimize AWS cloud environments for performance, resilience, and cost-efficiency.<br/><br/></p><p>- Develop automation for provisioning, scaling, deployment, and recovery.<br/><br/></p><p>- Manage infrastructure using Terraform, GitLab CI/CD, Kubernetes, and related tooling.<br/><br/></p><p>- Participate in on-call rotations and incident handling.<br/><br/><b>Skills & Experience Required :</b><br/><br/>- 4+ years in SRE, DevOps, or cloud infrastructure roles.<br/></p><p><br/></p><p>- Hands-on with AWS services (EC2, EKS, RDS, Cognito, CloudWatch).<br/><br/></p><p>- Proficient in Kubernetes administration in production environments.<br/><br/></p><p>- Experience with Infrastructure as Code (Terraform, CloudFormation).<br/><br/></p><p>- Scripting in Python, Bash, or Shell.<br/><br/></p><p>- Familiarity with automation tools (Chef, Ansible).<br/><br/></p><p>- Strong observability background: Prometheus, Grafana, ELK, distributed tracing.<br/><br/></p><p>- Experience managing relational databases (PostgreSQL or similar, with replication).<br/><br/></p><p>- Solid understanding of networking, load balancing, and security practices.<br/><br/></p><p>- CI/CD exposure (GitOps, pipelines).<br/><br/></p><p>- Worked with tools like Splunk, Datadog, or Dynatrace.<br/><br/><p><b>Preferred Qualifications :</b></p><p><br/>- AWS Certified Solutions Architect / DevOps Engineer.</p><br/></p><p>- Certified Kubernetes Administrator (CKA).</p><br/></p> (ref:hirist.tech)
About Wits Innovation Lab
Don't Miss This Opportunity!
Wits Innovation Lab is actively hiring for this Senior Site Reliability Engineer - Cloud Infrastructure position
Apply Now