SRE III,
Reliability Engineering
India
Technology
Center
WHO YOU’LL
WORK WITH
You
will be a part of a team of talented Site Reliability Engineers
focused on delivering reliabile and observable software used by
millions of athletes* around the world.
You will be a part of
the Resilience Engineering organization which includes Reliability
Engineering, Live Site Support Engineering, Peak Event Management,
Insights & Efficiency Engineering, and Enterprise Systems
Engineering.
While a variety of engagement
methods exist, SREs are primarily embedded with product delivery
teams across Global Technology.
These teams span all of
Nike’s most critical digital properties: Nike.com, Nike App, SNKRS,
brick & mortar retail, wholesale platforms, and supply chain
technologies.
WHO WE
ARE LOOKING FOR
The
ideal candidate will have a strong software engineering background,
a demonstrated ability to influence and partner, and show a passion
for learning and mentoring.
This engineer will have a track
record of delivering reliable and observable digital experiences
through the application of concepts from Site Reliability
Engineering, DevOps, and other relevant
disciplines.
- Bachelor’s degree in
Computer Science, Information Systems, or other relevant subject
areas - 4-7 years of professional experience in
software engineering
- Deep
understanding of how to deliver large scale software with modern
reliability and resilience concepts (multi-region, multi-cloud,
active/active, canary deploys, synthetic testing, containers,
etc.) - Hands-on experience building, deploying,
and operating software using modern cloud-based distributed system
techniques and micro-service architecture patterns.
AWS
experience
preferred
- Ability to build
strong relationships with partners/stakeholders and use technical
credibility and influence to drive positive
outcomes - Demonstrated experience implementing
Service Level Objectives, error budgets, and the associated
cultural change - A history of finding and
reducing toil within complex systems and
processes
- Experience with
modern observability tooling, processes, and mindset – Splunk,
SignalFx, New Relic, CatchPoint, etc.
Bonus points for
experience with Open Source observability stacks.
Extra bonus
points for experience with AI Ops,
AI/ML
WHAT
YOU’LL WORK ON
As a
Site Reliability Engineer, you will be focused on maximum
availability, observability, reliability, security, and performance
for Nike Digital Experiences.
SREs perform deep problem analysis,
detect infrastructure or code defects, define, report, and create
observability processes for Key Performance Indicators (KPIs), and
work with product delivery teams to provide long-term solutions to
production
issues.
- Partner
with leaders in product, engineering, business, and operations to
identify and address risks, vulnerabilities, and limits in our
end-to-end systems - As an embedded engineer
deliver product roadmaps for key Nike initiatives like Nike.com,
Nike App, Retail systems, Wholesale platforms, Supply Chain
systems, and more - Influence systems design
decisions and patterns across business-value engineering teams,
infrastructure teams, and architecture - Make the
life of on-call engineers safe by delivering deep observability,
actionable alerts and runbooks, and iterative Service Level
Objectives that truly align with consumer
experience - Identify, curate, implement, and
adapt key metrics for end-to-end system health and
performance