Position Description:
Job Title: Production Support Engineer – Level 3 (Steady State Support)
Location: Mumbai /Chennai / Bangalore /Hyderabad
Work Schedule:
24/7 Production Support | Night Shifts | Rotational On-Call | Weekend/Holiday Support Required
Job Summary:
We are seeking a highly motivated and detail-oriented Production Support Engineer to join our 24/7 Production Operations Support team.
The ideal candidate will be responsible for providing day-to-day engineering and technical support for steady state clients, ensuring the stability, reliability, and performance of infrastructure and applications in a non-traditional, dynamic environment.
This role requires an individual who is independent, proactive, and able to multitask, with strong communication skills and a collaborative approach to problem-solving.
Key Responsibilities:
.
Provide Level 3 technical support for infrastructure and application incidents in client environments.
.
Liaise between Client Teams and internal Technical Teams for steady state support issues.
.
Participate in night shifts, on-call rotation, and maintenance windows, including weekends and holidays.
.
Troubleshoot and resolve infrastructure and application issues while meeting SLA metrics.
.
Collaborate with internal planning and oversight teams for escalation and incident resolution.
.
Maintain accurate incident documentation and communication with stakeholders.
.
Perform regular vulnerability assessments, work with internal teams for remediation, and maintain monthly tracking records.
.
Provide support across multi-tiered architectures, including databases, microservices, containers, and network infrastructure.
Required Skills & Experience:
.
Experience in Infrastructure/Operations in a UNIX/LINUX/Wintel environment.
.
Knowledge of automated job scheduling tools (e.g., Job Scheduler, Autosys).
.
Understanding of multi-tiered application architecture and network connectivity.
.
Familiarity with microservices, containers, and container orchestration tools (e.g., Rancher).
.
Basic to intermediate skills in Power BI and Java.
.
Knowledge of certificate management and SSL/TLS fundamentals.
.
Solid understanding of databases such as Oracle, MS-SQL, and PostgreSQL.
.
Proficiency in shell scripting or similar programming languages.
.
Strong foundation in networking and security principles.
.
Experience with incident management and root cause analysis.
.
Excellent problem-solving and troubleshooting abilities.
.
Strong written and oral communication skills with the ability to explain technical issues clearly to stakeholders.
.
Ability to take ownership and accountability for technical decisions.
.
Flexible to collaborate with global support teams across multiple time zones.
Preferred/Good-to-Have Skills:
.
Jenkins (CI/CD tools)
.
Snowflake (cloud data platform)
.
Apache Kafka (event streaming platform)
.
Azure Fundamentals, with hands-on experience in Azure Portal
.
Exposure to Security Vulnerability Management practices and tools
Skills: