System Reliability Design, build, and maintain reliable and scalable infrastructure solutions to support our applications and services.
Automation: Develop automation tools and processes to improve efficiency, streamline operations, and reduce manual intervention.
Monitoring and Alerting: Implement robust monitoring and alerting systems to proactively identify and resolve issues before they impact customers.
Incident Management: Participate in incident response, troubleshooting, and resolution to ensure minimal downtime and optimal performance.
Performance Optimization: Continuously optimize system performance, capacity, and resource utilization.
Reliability Engineering: Apply engineering principles to design resilient systems, implement fault-tolerant solutions, and conduct post incident reviews.
Collaboration: Work closely with cross functional teams including software engineers, DevOps, and product managers to ensure seamless integration and deployment of new 5 PLUS years of experience in a Site Reliability Engineering or similar role.Communication: Excellent communication skills with the ability to collaborate effectively with technical and non technical stakeholders.
Education and Experience:
Bachelors degree in Computer Science, Engineering, or a related field (or equivalent experience).
5 PLUS years of experience in a Site Reliability Engineering or similar role.
System Reliability Design, build, and maintain reliable and scalable infrastructure solutions to support our applications and services.
Automation: Develop automation tools and processes to improve efficiency, streamline operations, and reduce manual intervention.
Monitoring and Alerting: Implement robust monitoring and alerting systems to proactively identify and resolve issues before they impact customers.
Incident Management: Participate in incident response, troubleshooting, and resolution to ensure minimal downtime and optimal performance.
Performance Optimization: Continuously optimize system performance, capacity, and resource utilization.
Reliability Engineering: Apply engineering principles to design resilient systems, implement fault-tolerant solutions, and conduct post incident reviews.
Collaboration: Work closely with cross functional teams including software engineers, DevOps, and product managers to ensure seamless integration and deployment of new 5 PLUS years of experience in a Site Reliability Engineering or similar role.Communication: Excellent communication skills with the ability to collaborate effectively with technical and non technical stakeholders.
Education and Experience:
Bachelors degree in Computer Science, Engineering, or a related field (or equivalent experience).
5 PLUS years of experience in a Site Reliability Engineering or similar role.