Know ATS Score
CV/Résumé Score
  • Expertini Resume Scoring: Our Semantic Matching Algorithm evaluates your CV/Résumé before you apply for this job role: HPC Infrastructure Engineer.
India Jobs Expertini

Urgent! HPC Infrastructure Engineer Job Opening In Gurugram – Now Hiring Confidential

HPC Infrastructure Engineer



Job description

Roles & Responsibilities

  • Provide enterprise-level operational support to Managed Services customers for incident, problem, and change management activities
  • Plan and perform maintenance activities
  • Assess customer environments for performance and design issues and propose resolutions
  • Work across technical teams to troubleshoot complex infrastructure issues
  • Create and maintain detailed documentation
  • Serve as a subject matter expert and escalation point for storage technologies
  • Work with vendors to resolve storage issues
  • Communicate with customers and internal team with transparency
  • Participate in on-call rotation
  • Completion of training and certification as assigned to further skills and knowledge

Skills Required

  • Bachelors degree or equivalent Information Systems or related field.

    Unique education, specialized experience, skills, knowledge, training, or certification may be substituted for education
  • 5+ years of expert level experience managing infrastructure in high-performance computing environments including configuration, troubleshooting, and best practice
  • 1+ years of experience with Nvidia DGX preferred
  • Experience with high-performance computing (HPC) schedulers (e.g., SLURM, PBS, Torque) required
  • Experience configuring, maintaining and troubleshooting Kubernetes
  • Experience with storage technology (e.g., Ceph, Vast Data Platform) and distributed file systems (e.g., Lustre, GPFS, NFS, GlusterFS)
  • Experience with machine learning or data science workflows in HPC/AI environments
  • Advances experience with Linux operating systems
  • Experience configuring, maintaining and troubleshooting Nvidia/Mellanox (Cumulus OS) switches a plus
  • Experience with both ethernet and InfiniBand networking a plus
  • 1+ years working with monitoring platforms (e.g., Prometheus, Grafana); Elastic Observability experience is a bonus
  • 1+ years working with an enterprise ITSM system Service Now is a bonus
  • Previous experience with automation tools such as Ansible, Puppet, or Chef a plus
  • Managed Services or consulting experience is required
  • Strong background with customer service
  • High level problem-solving and communication skills
  • Strong oral and written communications skills
  • Related network certifications are a bonus

Why AHEAD:

Through our daily work and internal groups like Moving Women AHEAD and RISE AHEAD, we value and benefit from diversity of people, ideas, experience, and everything in between.

We fuel growth by stacking our office with top-notch technologies in a multi-million-dollar lab, by encouraging cross department training and development, sponsoring certifications and credentials for continued learning.

USA Employment Benefits include

  • Medical, Dental, and Vision Insurance
  • 401(k)
  • Paid company holidays
  • Paid time off
  • Paid parental and caregiver leave
  • Plus more! See benefits https://www.aheadbenefits.com/ for additional details

The compensation range indicated in this posting reflects the On-Target Earnings (OTE) for this role, which includes a base salary and any applicable target bonus amount.

This OTE range may vary based on the candidates relevant experience, qualifications, and geographic location.


Skills Required
Linux, Ceph, Hpc, Kubernetes


Required Skill Profession

Computer Occupations



Your Complete Job Search Toolkit

✨ Smart • Intelligent • Private • Secure

Start Using Our Tools

Join thousands of professionals who've advanced their careers with our platform

Rate or Report This Job
If you feel this job is inaccurate or spam kindly report to us using below form.
Please Note: This is NOT a job application form.


    Unlock Your HPC Infrastructure Potential: Insight & Career Growth Guide