Role Overview:  
We are looking for a skilled Senior Java Backend Engineer  to join our team focused on building scalable and high-performance backend systems for Generative AI applications.
You’ll play a key role in designing APIs, orchestrating AI agents, and integrating Large Language Models (LLMs) into production-ready systems.
This role is ideal for backend developers with a passion for modern AI technologies and distributed systems.
Key Responsibilities:  
- Design, build, and maintain scalable backend services using Java  and the Spring Boot  framework 
- Develop APIs to enable LLM integration  and support AI agent orchestration workflows 
- Architect microservices to power RAG (Retrieval-Augmented Generation)  and other LLM-driven systems 
- Optimize performance through efficient caching strategies  and vector database interactions  
- Integrate and manage connections with multiple LLM providers  (e.g., OpenAI, Gemini, Claude), including rate limiting  and failover handling  
- Implement real-time streaming  features for conversational AI systems using WebSockets  or similar technologies 
- Ensure robust system observability with logging , monitoring , and tracing  
Required Skills & Qualifications:  
3–7 years total, with a minimum of 1 year working on Generative AI projects 
Backend Development:  
- Minimum 3 years of hands-on experience with Java  and Spring Boot  
- Strong grasp of RESTful API  design principles and microservices architecture  
- Proficiency with core Spring modules: Spring Security , Spring Data JPA , Spring Cloud  
- Experience working with relational and NoSQL databases: PostgreSQL , MongoDB , Redis  
- Familiarity with message brokers  such as RabbitMQ  or Apache Kafka  
- Expertise in caching mechanisms  and system performance tuning 
Generative AI Integration:  
- Experience integrating LLM APIs  (OpenAI, Gemini, Claude, etc.) into backend services 
- Knowledge of vector databases  and semantic search  technologies 
- Familiarity with AI agent orchestration frameworks  (e.g., LangGraph ) 
- Understanding of RAG systems  and how to implement them effectively 
- Experience developing streaming responses  using WebSockets or server-sent events 
- Working knowledge of prompt templating and management systems  
Nice to Have:  
- Experience in fine-tuning LLMs  and managing model deployment pipelines  
- Knowledge of self-hosted LLM environments  and infrastructure management 
- Exposure to observability tools  like LangSmith or custom monitoring setups 
- Familiarity with natural language to SQL systems  or BI applications powered by LLMs 
Note:  If you are matching on to the above job description & skills, please feel free to fill in your details on the below Form enclosed (