Job Description
Designation: DevOps Engineer/Lead DevOps Engineer
Experience: 8-15 years
Location: Remote
Summary
Work with talented DevOps and Cloud operations engineers and architects to deliver Sycamore SaaS product offerings to our Bio-Pharma customers using exciting, cutting-edge technologies.
Develop, execute, maintain, and improve procedures, automation scripts, and infrastructure implementations to support Sycamore SaaS Operations.
Roles and Responsibility
Specific roles and responsibilities include:
- Provide technical expertise and leadership when needed to SaaS Operations Production Operations teams.
- Help Implement the Cloud Operations team's goals and deliverables as determined by Sycamore Leadership - Ensure smooth operations of Sycamore SaaS products - Take Complete ownership of Customer Implementations, including SLA and SLO.
- Automate, enhance and maintain critical processes in Cloud Operations, such as Change Control, Monitoring & Alerting - Drive critical processes in SaaS Operations such as Change Control, Problem & Incident Management, and Reporting, as well as key tools for Monitoring & Alerting - Drive Disaster Recovery and failover procedures, training, testing, and team readiness - Coordinate focus groups across all teams on process improvements and technical improvements that lead to better stability and reliability - Contribute to process improvements and technical improvements that lead to increased stability and reliability - Support continuous improvements in SaaS Operations by - developing platform services and tooling for modern cloud operations, including metrics monitoring, CI/CD pipelines, etc.
- improving automation of provisioning, deployment, monitoring, alerting, and escalation - Support Secure operations by - implementing best-in-class recommendations for secure operations - Carry out ongoing Production Ops activities with precision and quality - Define, build, and deliver a high-quality SaaS Platform for Work with third-party vendors and partners to help develop a complete solution set on the SaaS platform - Representing Cloud Operations in InfoSec meetings and developing and driving secure procedures - Help obtain and maintain various certifications - Being a good team player & a leader when needed for a high-performance Cloud/SaaS delivery team by - Reviewing personal/team performance, quality reviews, - Manage operations and operational issues.
- Establish a culture of high performance, ownership, delivery focus, and continuous improvement.
Excellence in Operations
- Implement and carry out procedures and policies to ensure high-quality SaaS operations with appropriate levels of management controls.
- Act as an internal contact for platform services issues for a customer - Work with cross-functional departments: Sales, Professional Service, Customer Support, Engineering, and QA
Desired Experience
- Has experience in implementing, managing, maintaining, and decommissioning complex cloud-based Information system components in a secure and controlled manner.
- Must be experienced in coordinating cross-functional teams such as support, escalation, and engineering software teams to address product issues successfully.
- Strong understanding of how to build, scale, and manage complex multi-product/service environments - Record of building lean, automated, scalable support structures versus labor-intensive environments.
- Strong innovation mindset, analytical skills, excellent oral and written communication skills, and experience effectively communicating project/program mission and objectives.
- Must exhibit a practical customer service attitude and lead a team in resolving difficult customer situations.
Skills Required
- Very Strong Linux Knowledge & Troubleshooting Skills - Scripting using – Bash, Python, PowerShell, etc - Kubernetes, helm Charts - Terraform, Ansible - Windows Terminal Services, AD, LDAP - Hands-on experience in cloud technology – AWS, Azure – AWS preferred - Change, Problem & Incident Management - Implementation awareness of Vulnerability/Penetration Testing, Security - Strong Networking Skills - Tools and frameworks used for monitoring, performance management, logging - CI/CD pipeline - SRE – Including Datadog.
- Datadog
Certification
- RHEL - AWS - Kubernetes