Location: Remote (India)
- Type: Full-time
- Seniority: SDE II–III
Eligibility: IIT candidates only (B.Tech in progress or completed).
Self-learners who love to upskill are welcome—but IIT status is mandatory.
Why this role
Help build resilient back-end services that power real-time communications analytics and automation at scale.
You’ll design APIs, process streaming data, and integrate with voice/telephony and cloud services to deliver low-latency, high-reliability experiences.
What you’ll do
- Design and ship backend APIs (Django/DRF), background/async jobs (Celery/RQ), and secure integrations with external services.
- Build and optimize real-time data pipelines for call/quality metrics;
own observability, tracing, and SLOs (p95/p99). - Troubleshoot SIP/VoIP issues using packet captures and logs;
automate analysis wherepossible. - Containerize workloads with Docker, contribute to CI/CD, and collaborate on cloud deployment (Azure/AWS).
- Apply distributed-systems patterns (idempotency, retries, backoff, circuit breakers) for scale and fault tolerance.
- Model data in PostgreSQL and MongoDB;
tune indexes/queries;
maintain migration discipline. - (Good-to-have) Prototype AI/ML features for analytics, anomaly detection, and conversational agents.
What you’ll bring (must-have)
- Python & Django: RESTful APIs, authentication/authorization, background tasks, testing, and performance tuning.
- Networking fundamentals: TCP/IP, DNS, TLS, HTTP/2;
practical debugging with Wireshark;
familiarity with NetFlow/SNMP. - Telephony domain: SIP signaling, RTP/RTCP, call flows, PSTN/VoIP basics;
exposure to SBCs. - Cloud & DevOps: Working knowledge of Azure/AWS, Docker;
Kubernetes exposure is a plus;
comfort with CI/CD and secrets management. - Distributed systems: Microservices, event-driven design (queues/streams), horizontal scaling, resilience patterns.
- Databases: Hands-on with PostgreSQL and MongoDB (schema design, indexing, migrations, backups/restore).
- Mindset: Ownership, bias to automate, clear writing, and a product quality bar that values reliability and latency.
Nice to have
- Kubernetes (Helm), IaC (Terraform/Bicep), metrics/log stacks (Prometheus/Grafana/ELK), OpenTelemetry.
- Experience with Microsoft Graph/365 ecosystems, webhooks, or real-time comms SDKs.
- Practical AI/ML: time-series anomaly detection, clustering, or LLM-powered workflows.
90-Day success profile
- 30 days: Ship a well-tested Django service with one external integration;
add dashboards/alerts for core SLOs. - 60 days: Deliver an automated SIP call-flow parser or PCAP → insights tool;
reduce a p99 latency orerror budget hot spot. - 90 days: Lead a small initiative end-to-end (design → deploy → postmortem), document runbooks, and mentor a junior dev.
What we offer
- Impact on critical, high-scale real-time systems.
- Autonomy, fast feedback loops, and a builder culture.
- Competitive compensation;
remote-first with flexible hours aligned to IST.