Job description
Location: Remote (India) • Type: Full-time • Seniority: SDE II–III
Eligibility: IIT candidates only (B.Tech in progress or completed ).
Self-learners who love to upskill are welcome—but IIT status is mandatory .
Why this role
Help build resilient back-end services that power real-time communications analytics and automation at scale.
You’ll design APIs, process streaming data, and integrate with voice/telephony and cloud services to deliver low-latency, high-reliability experiences.
What you’ll do
Design and ship backend APIs (Django/DRF), background/async jobs (Celery/RQ), and secure integrations with external services.
Build and optimize real-time data pipelines for call/quality metrics; own observability, tracing, and SLOs (p95/p99).
Troubleshoot SIP/VoIP issues using packet captures and logs; automate analysis where possible.
Containerize workloads with Docker , contribute to CI/CD , and collaborate on cloud deployment (Azure/AWS).
Apply distributed-systems patterns (idempotency, retries, backoff, circuit breakers) for scale and fault tolerance.
Model data in PostgreSQL and MongoDB ; tune indexes/queries; maintain migration discipline.
(Good-to-have) Prototype AI/ML features for analytics, anomaly detection, and conversational agents.
What you’ll bring (must-have)
Python & Django : RESTful APIs, authentication/authorization, background tasks, testing, and performance tuning.
Networking fundamentals : TCP/IP, DNS, TLS, HTTP/2; practical debugging with Wireshark ; familiarity with NetFlow/SNMP .
Telephony domain : SIP signaling , RTP/RTCP, call flows, PSTN/VoIP basics; exposure to SBCs .
Cloud & DevOps : Working knowledge of Azure/AWS , Docker; Kubernetes exposure is a plus; comfort with CI/CD and secrets management.
Distributed systems : Microservices, event-driven design (queues/streams), horizontal scaling, resilience patterns.
Databases : Hands-on with PostgreSQL and MongoDB (schema design, indexing, migrations, backups/restore).
Mindset : Ownership, bias to automate, clear writing, and a product quality bar that values reliability and latency.
Nice to have
Kubernetes (Helm), IaC (Terraform/Bicep), metrics/log stacks (Prometheus/Grafana/ELK), OpenTelemetry.
Experience with Microsoft Graph/365 ecosystems, webhooks, or real-time comms SDKs.
Practical AI/ML : time-series anomaly detection, clustering, or LLM-powered workflows.
90-Day success profile
30 days: Ship a well-tested Django service with one external integration; add dashboards/alerts for core SLOs.
60 days: Deliver an automated SIP call-flow parser or PCAP → insights tool; reduce a p99 latency or error budget hot spot.
90 days: Lead a small initiative end-to-end (design → deploy → postmortem), document runbooks, and mentor a junior dev.
What we offer
Impact on critical, high-scale real-time systems.
Autonomy, fast feedback loops, and a builder culture.
Competitive compensation; remote-first with flexible hours aligned to IST.
Required Skill Profession
Computer Occupations