Job Title: DevOps Engineer (Onsite – EST Shift)
Location: Kharadi, Pune (Onsite only)
Work Hours: EST Shift (6:30 PM IST – 3:30 AM IST)
About the Role
We are looking for a hands-on DevOps Engineer who can own our complete server and infrastructure ecosystem.
This is a one-person army role where you’ll manage, monitor, optimize, and document all On-Prem and Cloud servers across Windows and Linux.
If you love being fully responsible for infrastructure and solving problems end-to-end, this role is for you.
Key Responsibilities
- Server & Infrastructure Management
- Manage Windows, Linux, Virtual Servers, Hyper-V, and Proxmox environments.
- Consolidate servers from multiple sources into streamlined setups.
- Handle On-Prem and Cloud-hosted servers with business continuity in mind.
- Cloud & GPU Hosting
- Expertise in AWS and GPU hosting platforms (Vast.ai, RunPod, Lambda Labs).
- Create/manage GPU usage tokens; predict busy vs.
idle times. - Optimize and reduce GPU and cloud infrastructure costs.
- Backup & Recovery
- Perform full backup and restore of entire websites, web applications, and databases.
- Manage snapshots and disaster recovery workflows.
- Ensure business continuity and near 100% server availability.
- Containerization & Orchestration
- Strong in VMs, Docker, Kubernetes.
- Build and maintain scalable clusters and deployments.
- Database Management
- Backup and restore expertise in MySQL, PostgreSQL, MongoDB.
- Ensure high data availability and recovery strategies.
- Monitoring & Dashboards
- Monitor cloud/GPU servers, traffic, and thresholds.
- Build a single dashboard for On-Prem and Cloud visibility.
- Provide network/server diagrams and proper documentation.
- Security & Compliance
- Knowledge of SOPHOS or equivalent security solutions is a plus.
- Enforce server hardening and compliance best practices.
Requirements
- 5+ years of hands-on experience in DevOps/Infrastructure Engineering.
- Strong expertise in Windows, Linux, Proxmox, Hyper-V.
- Experience with AWS and GPU hosting platforms (Vast, RunPod, Lambda).
- Strong in VMs, Docker, Kubernetes.
- Database experience: MySQL, PostgreSQL, MongoDB (backup & restore).
- Proven ability to optimize servers for performance and cost reduction.
- Experience with website/web app full backup & restore.
- Strong skills in network design, monitoring, and disaster recovery planning.
- Excellent problem-solving, documentation, and communication skills.
- Ability to work independently in a fast-paced, mission-critical environment.
Good to Have
- Familiarity with SOPHOS security tools.
- Deep knowledge of Proxmox clustering and management.
- Experience in capacity planning and multi-cloud optimization.
- Knowledge of VOIP and FreeSwitch
What We Offer
- Ownership of infrastructure—your playground to design, scale, and optimize.
- Exposure to cutting-edge GPU and Cloud hosting environments.
- Onsite role with a dynamic team building AI-driven recruitment solutions.
👉 Apply now with your updated resume and examples of server/web app backup-restore projects or GPU/cloud optimizations you’ve delivered.