Role Overview:    
  
We are looking for a skilled Senior Java Backend Engineer   to join our team focused on building scalable and high-performance backend systems for Generative AI applications.
You’ll play a key role in designing APIs, orchestrating AI agents, and integrating Large Language Models (LLMs) into production-ready systems.
This role is ideal for backend developers with a passion for modern AI technologies and distributed systems.
 
  
Key Responsibilities:    
- Design, build, and maintain scalable backend services using Java   and the Spring Boot   framework  
- Develop APIs to enable LLM integration   and support AI agent orchestration workflows  
- Architect microservices to power RAG (Retrieval-Augmented Generation)   and other LLM-driven systems  
- Optimize performance through efficient caching strategies   and vector database interactions    
- Integrate and manage connections with multiple LLM providers   (e.G., OpenAI, Gemini, Claude), including rate limiting   and failover handling    
- Implement real-time streaming   features for conversational AI systems using WebSockets   or similar technologies  
- Ensure robust system observability with logging  , monitoring  , and tracing    
  
Required Skills & Qualifications:    
3–7 years total, with a minimum of 1 year working on Generative AI projects  
  
Backend Development:    
- Minimum 3 years of hands-on experience with Java   and Spring Boot    
- Strong grasp of RESTful API   design principles and microservices architecture    
- Proficiency with core Spring modules: Spring Security  , Spring Data JPA  , Spring Cloud    
- Experience working with relational and NoSQL databases: PostgreSQL  , MongoDB  , Redis    
- Familiarity with message brokers   such as RabbitMQ   or Apache Kafka    
- Expertise in caching mechanisms   and system performance tuning  
  
Generative AI Integration:    
- Experience integrating LLM APIs   (OpenAI, Gemini, Claude, etc.) into backend services  
- Knowledge of vector databases   and semantic search   technologies  
- Familiarity with AI agent orchestration frameworks   (e.G., LangGraph  )  
- Understanding of RAG systems   and how to implement them effectively  
- Experience developing streaming responses   using WebSockets or server-sent events  
- Working knowledge of prompt templating and management systems    
  
Nice to Have:    
- Experience in fine-tuning LLMs   and managing model deployment pipelines    
- Knowledge of self-hosted LLM environments   and infrastructure management  
- Exposure to observability tools   like LangSmith or custom monitoring setups  
- Familiarity with natural language to SQL systems   or BI applications powered by LLMs  
  
Note:   If you are matching on to the above job description & skills, please feel free to fill in your details on the below Form enclosed (