Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives.
The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best.
Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities.
Come make an impact on the communities we serve as you help us advance health optimization on a global scale.
Join us to start Caring.
Connecting.
Growing together.
Primary Responsibilities:
Drive reliability, scalability, and performance across our systems, with a solid focus on leveraging AI and automationImplement AI/ML models for predictive alerting, anomaly detection, and capacity planningIntegrate AI tools into incident management workflows to reduce MTTR and improve root cause analysisDrive adoption of AI-powered observability platformsDesign and implement cloud-native solutions using AWS and GCP servicesArchitect scalable, resilient, and secure infrastructure using Infrastructure as Code (IaC) tools like Terraform or CloudFormationCollaborate with development, DevOps, and security teams to integrate cloud solutions into CI/CD pipelinesArchitectural experience on performing SRE activities on their ownDevelop and enforce security policies, standards, and proceduresMonitor cloud environments and optimize performance, cost, and reliabilityTriage and RCA of production incidents and managementProvide technical leadership and mentorship to junior engineersStay current with cloud trends and recommend best practices and new technologies
Comply with the terms and conditions of the employment contract, company policies and procedures, and any and all directives (such as, but not limited to, transfer and/or re-assignment to different work locations, change in teams and/or work shifts, policies in regards to flexibility of work benefits and/or work environment, alternative work arrangements, and other decisions that may arise due to the changing business environment).
The Company may adopt, vary or rescind these policies and directives in its absolute discretion and without any limitation (implied or otherwise) on its ability to do soRequired Qualifications:
Undergraduate degree or equivalent experience8+ years of experience in SRE, DevOps, or infrastructure engineering2+ years in a leadership role managing SRE or platform teams4+ years of solid understanding of cloud security (AWS, GCP, Azure), network security, and application securityExperience with monitoring and alerting tools, especially those with AI capabilitiesExperience implementing AI/ML models for operational intelligence, observability and automationHands-on experience with security tools and platformsGood experience on infrastructure architectureKnowledge of AIOps platforms and frameworksProven excellent communication and stakeholder management skillsPreferred Qualifications:
Knowledge of cloud principlesKnowledge of software security principlesAt UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone.
We believe everyone-of every race, gender, sexuality, age, location and income-deserves the opportunity to live their healthiest life.
Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes.
We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes - an enterprise priority reflected in our mission.