Know ATS Score
CV/Résumé Score
  • Expertini Resume Scoring: Our Semantic Matching Algorithm evaluates your CV/Résumé before you apply for this job role: Senior cloudops & ml engineer.
India Jobs Expertini

Urgent! Senior cloudops & ml engineer Position in Pune - Rapid7

Senior cloudops & ml engineer



Job description

⚙️ Senior Cloud Ops, MLops Infrastructure Engineer Job Description ⚙️
Join Rapid7: Secure the Future with AI
Overview
We are seeking an experienced and highly specialised Senior MLops Infrastructure Engineer to manage, automate, and secure our production cloud infrastructure and Machine Learning (ML)/Large Language Model (LLM) operational pipelines.

This role is strictly focused on the operations and infrastructure that support our data science and engineering teams—it is not a data science or core LLM development position.
Key Responsibilities and Required Expertise
The successful candidate will be an expert in all the following areas, driving high availability, scalability, and security.
I.

Cloud Infrastructure & Automation
- Infrastructure as Code (Ia C): Deep expertise in managing and provisioning infrastructure using Terraform.
- Containerization & Orchestration: Advanced deployment, scaling, and management of services using Docker/Kubernetes.
- Networking & Services: Architecting and maintaining high-performance API Layers & Microservices.
- AWS Cloud Ops: Expert proficiency in AWS operational services, including Event Bridge and Step Functions, for building robust automation flows.
- Data Storage: Managing and optimizing critical AWS data services, including S3, Dynamo DB, Redshift, and Kinesis.
II.

MLOps Tooling & Monitoring
- ML/LLM Tooling Support: Provide and maintain the operational infrastructure for ML/LLM systems, including Model Registry/Versioning tools like MLflow/Sage Maker.
- Pipeline Automation (CI/CD): Designing and implementing robust CI/CD pipelines for ML/LLM deployments using tools like Git Hub Actions/Jenkins.
- Model Operations: Building the infrastructure to support Drift Detection & Retraining capabilities.
- Monitoring & Alerting: Implementing comprehensive observability stacks using Prometheus/Grafana/Cloud Watch.
- Incident Management: Leading resolution efforts for production issues, including expertise with Pager Duty and On-call responsibilities.
III.

Security & Compliance (Fin Ops)
- Cloud Security: Establishing and enforcing strong security policies and best practices across the cloud environment (IAM, VPC, Secrets).
- AWS Security Services: Expert knowledge and application of specific AWS security tools like IAM, KMS, and Secrets Manager.
- Cost Optimization: Leading initiatives for Cost Optimization (Fin Ops), balancing performance and efficiency across all cloud resources.


Required Skill Profession

Other General



Your Complete Job Search Toolkit

✨ Smart • Intelligent • Private • Secure

Start Using Our Tools

Join thousands of professionals who've advanced their careers with our platform

Rate or Report This Job
If you feel this job is inaccurate or spam kindly report to us using below form.
Please Note: This is NOT a job application form.


    Unlock Your Senior cloudops Potential: Insight & Career Growth Guide