Description
We are seeking an experienced SRE DevOps Engineer to join our team in India.
The ideal candidate will have a strong background in system reliability and automation, with a passion for improving system performance and ensuring high availability.
Responsibilities
- Design, implement, and maintain highly available systems and infrastructure.
- Monitor system performance and troubleshoot issues as they arise.
- Automate deployment processes and improve CI/CD pipelines.
- Collaborate with development teams to enhance application performance and reliability.
- Implement security best practices and manage access controls.
- Perform capacity planning and forecasting for system resources.
- Document system configurations and operational procedures.
Skills and Qualifications
- 7-15 years of experience in Site Reliability Engineering or DevOps roles.
- Strong knowledge of Linux/Unix administration.
- Experience with cloud platforms such as AWS, Azure, or Google Cloud.
- Proficiency in scripting languages like Python, Bash, or Ruby.
- Familiarity with containerization technologies such as Docker and orchestration tools like Kubernetes.
- Experience with configuration management tools such as Ansible, Puppet, or Chef.
- Knowledge of monitoring tools like Prometheus, Grafana, or Nagios.
- Understanding of networking concepts and protocols.
Skills Required
Kubernetes, Terraform, Docker, Prometheus, Grafana, Python, Linux Administration, Monitoring Tools, Networking, Gcp