DevOps Engineer
Location: Bengaluru, India (On-site)
Experience: 4–6 years | Type: Full-Time
Industry: Fintech / High-Performance Systems
Role Overview
We are seeking a DevOps Engineer with proven experience in managing on-premise, large-scale, and low-latency infrastructure.
The ideal candidate will have strong hands-on expertise in systems performance optimisation, infrastructure automation, and reliability engineering across high-traffic production environments.
This role demands deep technical ownership of infrastructure — from compute and networking to CI/CD and monitoring — ensuring speed, scalability, and uptime at all times.
Key Responsibilities
- Infrastructure Ownership: Design, deploy, and manage on-premise data centre environments — compute, networking, and storage.
- Performance Engineering: Optimise system configurations for low latency, high throughput, and resilience under scale.
- Automation & CI/CD: Build and maintain automated CI/CD pipelines (Jenkins, GitLab CI/CD, GitHub Actions), enabling fast and reliable releases.
- Monitoring & Observability: Implement end-to-end observability using Prometheus, Grafana, ELK, or Datadog for proactive system insights.
- Security & Reliability: Enforce security, access control, and compliance within high-availability environments.
- Collaboration: Partner with backend, platform, and data teams to design scalable and fault-tolerant systems.
Required Skills and Qualifications
- 4–6 years of hands-on experience in DevOps, Infrastructure, or SRE roles.
- On-premise infrastructure experience is mandatory — including setup, scaling, and maintenance.
- Strong expertise in Linux systems administration, networking, and system performance tuning.
- Proficiency with Infrastructure as Code tools (Terraform, Ansible, or similar).
- Hands-on experience with containerization and orchestration (Docker, Kubernetes).
- Deep understanding of monitoring and logging tools (Prometheus, Grafana, Datadog, ELK).
- Strong scripting skills in Python, Bash, or Go for automation and tool development.
- Experience managing high-traffic, low-latency systems in production.
Preferred Skills
- Experience with distributed systems and message brokers (Kafka, RabbitMQ).
- Understanding of load balancing, disaster recovery, and high availability strategies.
- Familiarity with hardware provisioning, network latency tuning, and resource optimisation.
- Exposure to performance-critical fintech, trading, or real-time systems.