Know ATS Score
CV/Résumé Score
  • Expertini Resume Scoring: Our Semantic Matching Algorithm evaluates your CV/Résumé before you apply for this job role: Manager, Cloud Operations Engineering.
India Jobs Expertini

Urgent! Manager, Cloud Operations Engineering Job Opening In Bengaluru – Now Hiring MongoDB

Manager, Cloud Operations Engineering



Job description

MongoDB’s mission is to empower innovators to create, transform, and disrupt industries by unleashing the power of software and data.

We enable organizations of all sizes to easily build, scale, and run modern applications by helping them modernize legacy workloads, embrace innovation, and unleash AI.

Our industry-leading developer data platform, MongoDB Atlas, is the only globally distributed, multi-cloud database and is available in more than 115 regions across AWS, Google Cloud, and Microsoft Azure.

Atlas allows customers to build and run applications anywhere—on premises, or across cloud providers.

With offices worldwide and over 175,000 new developers signing up to use MongoDB every month, it’s no wonder that leading organizations, like Samsung and Toyota, trust MongoDB to build next-generation, AI-powered applications.

Cloud Operations Engineers are responsible for building internal tools and process automation.

Day-to-day duties are creating and monitoring systems alert dashboards, reviewing critical event and system logs, accessing customer instances that underpin their production databases, and performing server administration duties including performance troubleshooting.

Applicants must be critical thinkers who are quick to detect, resolve, or escalate issues that are sometimes broad in scope and difficult to trace.


We are looking to speak to candidates who are based in Bengaluru for our hybrid working model.


Responsibilities

  • Help scale the Cloud Operations Engineering team with the strategic implementation and refinement of processes and tools

  • Provide career development feedback and advice to direct reports

  • Identify and measure team health indicators and performance metrics

  • Ensure proper team focus on priorities, objectives, and related deliverables

  • Collaborate with technical and non-technical teams across the company

  • Balance your time between leading your team, working on customer incidents and being involved in projects

  • Be a source of guidance and advice to your own team members and other teams within MongoDB

  • Build a relationship with your team around trust

  • Successfully coordinate with a global team of Cloud Operations Engineers who are tasked with ensuring our uptime guarantees to the MongoDB Atlas customer base

  • Participate in designing and building internal tools

  • Assist in scoping, designing and deploying systems that reduce Mean Time to Resolve for customer incidents

  • Monitor and detect emerging customer-facing incidents on the Atlas platform; assist in their proactive resolution

  • Automate internal processes, routine monitoring and troubleshooting tasks

  • Diagnose live incidents, differentiate between platform issues versus usage issues, and take the next steps toward resolution

  • Cooperate with our Product Management and Cloud Engineering organizations by identifying areas for improvement in the management applications powering the Atlas infrastructure

  • Coordinate and participate in a weekly on-call rotation, where you will handle short term customer incidents (from direct surveillance or through alerts via our Technical Services Engineers)
  • Requirements

  • Management skills, with hands-on experience running small to mid sized Engineering Teams in a rapid-growth environment 

  • Strong diagnostic/troubleshooting process, with significant experience troubleshooting end-to-end technical issues in production environments

  • Experience supervising, leading and monitoring progress of Software Development projects.

  • Patience, empathy, and a genuine desire to help others

  • Excellent communication skills, both written and verbal

  • Ability to think on your feet, remain calm under pressure, and find solutions to challenges in real-time

  • Experience with being an oncall DevOps, SRE, or Cloud Operations engineer

  • Expertise with Linux system administration and networking technologies

  • Knowledge of database and distributed system operations and concepts

  • Knowledgeable about a wide range of web and internet technologies

  • Familiarity with Amazon Web Services and other Cloud infrastructure platforms (e.g. GCP, Azure)

  • Experience in monitoring, system performance data collection and analysis, and reporting

  • Capability to write programs/scripts to solve both short-term systems problems and long term strategic objectives for the Atlas product

  • A CS/CE degree or equivalent experience

  • At least 2 of the following programming languages: Java, Go, Python, Typescript

  • A keen interest in learning new skills and competencies

  • Required Skill Profession

    Operations Specialties Managers



    Your Complete Job Search Toolkit

    ✨ Smart • Intelligent • Private • Secure

    Start Using Our Tools

    Join thousands of professionals who've advanced their careers with our platform

    Rate or Report This Job
    If you feel this job is inaccurate or spam kindly report to us using below form.
    Please Note: This is NOT a job application form.


      Unlock Your Manager Cloud Potential: Insight & Career Growth Guide