Job Description
            
                <p><p>We are seeking a skilled Datadog Observability & Automation Specialist with hands-on experience in building observability practices and implementing end-to-end automation, including AI and GenAI capabilities.
</p><p><br/></p><p>The ideal candidate will be responsible for configuring and optimizing observability platforms to deliver actionable insights into system performance and reliability across various industry use cases.</p><br/><p><b>Key Responsibilities :</b></p><p><br/></p><p>- Design, implement, and maintain heterogeneous observability solutions using infrastructure, logs, synthetic monitoring, automation, AI, and GenAI.</p><p><br/></p><p>- Create and manage dashboards, monitors, alerts, service maps, and user interfaces.</p><p><br/></p><p>- Collaborate with DevOps, Development, and Security teams to define and maintain SLIs, SLOs, and SLAs.</p><p><br/></p><p>- Develop integrations between observability platforms and other systems (e.g., hybrid cloud, on-prem data centers, end-user assets, Kubernetes, Terraform, CI/CD tools).</p><p><br/></p><p>- Optimize alerting mechanisms to reduce false positives and improve incident response.</p><p><br/></p><p>- Provide support during incidents, including root cause analysis and post-mortem reviews.</p><p><br/></p><p>- Conduct training sessions for internal teams on effective platform usage.</p><br/><p><b>Required Skills and Qualifications :</b></p><p><br/></p><p>- 6+ years of experience in development, automation, system monitoring, and DevOps.</p><p><br/></p><p>- 3+ years of hands-on experience with advanced automation and observability platforms such as Dynatrace, Datadog, AppDynamics, New Relic, Zabbix, ELK(Elasticsearch, Logstash, and Kibana.), AI/GenAI, and Machine Learning.</p><p><br/></p><p>- Strong understanding of infrastructure components including cloud platforms (AWS, Azure, GCP), containers (Docker, Kubernetes), networking, and operating systems.</p><p><br/></p><p>- Proficiency in scripting languages such as Python, Bash, or Shell.</p><p><br/></p><p>- Experience with CI/CD pipelines and automation tools (e.g., Jenkins, GitHub Actions, Terraform, Packer).</p><p><br/></p><p>- Familiarity with log collection, parsing, and automation using observability platforms.</p><p><br/></p><p>- Strong analytical and problem-solving skills with a product-oriented mindset.</p><br/><p><b>Preferred Qualifications :</b></p><p><br/></p><p>- Certifications in observability platforms (e.g., Datadog Certified Monitoring Professional, Dynatrace, AppDynamics, ELK).</p><p><br/></p><p>- Experience with additional monitoring tools (e.g., Prometheus, Grafana, New Relic, Nagios, ManageEngine).</p><p><br/></p><p>- Familiarity with ITIL processes and incident management tools (e.g., PagerDuty, ServiceNow).</p><br/></p> (ref:hirist.tech)