Job description
 
                         Data Platform Engineer DevOps
Bengaluru, KA
WFO
We’re hiring a Software Engineer with deep expertise in distributed systems who can work independently to deliver platform features and provide DevOps support for existing services.
This role combines data platform engineering—designing and evolving distributed systems that power ingestion, streaming, storage, and governance—with hands-on operational responsibilities.
In addition to building and shipping new features, you will support incident response, on-call coverage, system maintenance, and operational health monitoring (including coverage during Indian business hours).
Your work will directly improve the reliability, scalability, and efficiency for core data platforms.
Key Responsibilities
Platform Engineering
- Independently design, build, and deliver distributed services and features end-to-end.
- Enhance platform capabilities across ingestion, streaming, lakehouse/warehouse, catalog, and governance.
- Write production-grade code with observability (metrics, logs, traces) and SLOs.
- Debug, optimize, and scale services for reliability, cost, and performance.
- Contribute to design docs and reviews, ensuring best practices in security and efficiency.
DevOps & Support
- Provide DevOps coverage for platform services during Indian business hours.
- Participate in incident response and on-call support for Java platform services.
- Diagnose and resolve application-level issues (memory leaks, GC pauses, thread deadlocks, CPU bottlenecks).
- Apply short-term fixes (service restarts, config changes, rollbacks) and escalate critical issues as needed.
- Conduct operational maintenance: upgrades, patching, configuration validation, and health monitoring.
- Maintain runbooks, response procedures, and documentation.
- Collaborate during deployments, analyze logs/metrics/traces, and partner with infra/DB/observability teams to tune systems.
Required Skills & Experience
- 6+ years of professional software engineering experience.
- Strong foundation in distributed systems (replication, consensus, partitioning, fault tolerance).
- Proficiency in Java is must or Python, with experience in CI/CD workflows.
- Hands-on experience with Kafka/Flink, Spark, Delta/Iceberg, Kubernetes, or NoSQL/columnar stores.
- Proven ability to work independently with minimal supervision.
- Excellent debugging, system design, and performance optimization skills
 
                    
                    
Required Skill Profession
 
                     
                    
                    Computer Occupations