Job description
Senior Manager | Platform Engineering & API Ecosystems
Why This Role Matters
At Foodhub, every order and payment flows through our Order & Transaction Platform.
It’s a high-throughput backbone moving millions of requests daily — where reliability, speed, and financial accuracy cannot be compromised.
We are continuously evolving this core into a cloud-native, event-driven, AI-ready platform that scales globally.
This role is about shaping that platform: part architect, part hands-on engineer, part mentor.
You’ll be building systems that define what “real-time commerce at scale” looks like.
What You’ll Drive
Architecture & Platform Evolution
Design and scale event-driven microservices that power ordering, payments, reconciliation, and partner integrations.
Implement CQRS, event sourcing, and ledger-based models for auditability and financial correctness.
Build a modern API ecosystem — REST, GraphQL, gRPC — with schema governance and backward compatibility.
Create real-time data and event streams to power fraud detection, recommendations, and operational analytics.
Performance & Reliability at Scale
Ensure five-nines reliability for high-volume order and transaction systems.
Solve concurrency, partitioning, and eventual consistency challenges across distributed services.
Deploy fault-tolerance strategies — circuit breakers, idempotency, backpressure handling — to protect systems under extreme load.
Advance SRE practices with chaos engineering, observability-first design, and automated failover.
Leadership & Mentorship
Lead globally distributed teams of platform engineers.
Balance 50–60% IC work (coding, design reviews, prototyping) with strategic leadership.
Create a culture of ownership, resilience, and continuous learning.
Act as a technical mentor for engineers across the company, raising the bar in distributed system design.
Must-Have Skills
10+ years in backend/platform engineering with leadership in transaction-heavy platforms.
Deep expertise in distributed systems: ordering, payments, reconciliation, data integrity.
Hands-on mastery of Node.js/TypeScript in cloud-native and serverless-first architectures.
Proven experience running systems at high scale and high reliability (millions of daily transactions).
A “player-coach” mindset: comfortable diving into production issues, leading design sessions, and mentoring teams.
Desirable Experience
Languages & Frameworks: TypeScript, Node.js, Express.js, NestJS
Cloud & Data: AWS (Lambda, DynamoDB, RDS), PostgreSQL, Redis
Event Streaming: Kafka, Pulsar, Kinesis — designing high-throughput, low-latency pipelines
Architecture Patterns: Microservices, event-driven systems, CQRS, event sourcing, domain-driven design
APIs: REST, GraphQL, gRPC with schema-first governance and strong versioning discipline
Platform & Ops: k8s (EKS or equivalent), service mesh, distributed tracing, full observability stacks
AI Readiness: Building real-time fraud detection signals, anomaly detection, recommendation feeds
Domain Knowledge: High-scale commerce, payments, or financial transaction platforms
Certifications (Preferred)
- AWS Certification
- HashiCorp Certification
Additional Qualities
- Ability to adapt and thrive in a fast-changing, highly collaborative Agile environment.
- Strong aptitude for learning new languages and frameworks.
- Solid experience in supporting applications, ensuring stability, and improving operational performance.
Required Skill Profession
Other Management Occupations