Job description
**Introduction**
A career in IBM Software means you’ll be part of a team that transforms our customer’s challenges into solutions.
Seeking new possibilities and always staying curious, we are a team dedicated to creating the world’s leading AI-powered, cloud-native software solutions for our customers.
Our renowned legacy creates endless global opportunities for our IBMers, so the door is always open for those who want to grow their career.
IBM’s product and technology landscape includes Research, Software, and Infrastructure.
Entering this domain positions you at the heart of IBM, where growth and innovation thrive.
**Your role and responsibilities**
Databases and event streams are complementary infrastructure in modern software architecture.
You will be highly involved in the design, implementation, and operation of Astra Streaming (Pulsar) and our mission to enable the world’s leading enterprises as we scale up and deliver an amazing developer experience.
You will also be responsible for helping us ensure high uptimes and satisfied customers across our various production and non-production environments.
What you will do:
* Ensure production stability and high-uptimes and assist debugging and root causing user-facing issues
* Contribute to open-source and proprietary projects that interface with Pulsar
* Perform software upgrades and configuration updates in a production environment
* Perform security analysis and apply changes to comply with security policies
* Maintain monitoring systems, configure alerting and log collection.
* Work in a fast-moving environment to rapidly prototype, iterate and evolve solutions for real-world developer need
* Perform regular code reviews among peers
**Required technical and professional expertise**
* 4 - 6 years of relevant experience
* Systems level proficiency in Java, Golang, or another popular language.
* Experience working on and operating large scale distributed production systems
* Kubernetes (EKS, AKS, GKE), Helm, and CRD’s (Operators)
* Infrastructure as Code, CI/CD (ArgoCD), Jenkins or similar
* Metrics, Alerting and Logging, Grafana, Prometheus, Splunk
* Knowledge of highly scalable services that achieve massive scalability and availability
* Cloud Infrastructure Providers, GCP, Azure, AWS, or similar.
* Experience in SDLC having contributed at each step: Plan, Track, Code, Build, Test, Deploy and Monitor
**Preferred technical and professional experience**
* Experience maintaining a production Apache Pulsar or Kafka cluster is a plus.
* Experience with Prometheus and either Thanos or another metrics aggregation is a plus.
* Experience with Terraform is a plus
* Experience with Apache Cassandra is a plus.
IBM is committed to creating a diverse environment and is proud to be an equal-opportunity employer.
All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, caste, genetics, pregnancy, disability, neurodivergence, age, veteran status, or other characteristics.
IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.
Required Skill Profession
Other General