Know ATS Score
CV/Résumé Score
  • Expertini Resume Scoring: Our Semantic Matching Algorithm evaluates your CV/Résumé before you apply for this job role: Generative AI Data Engineer.
India Jobs Expertini

Urgent! Generative AI Data Engineer Job Opening In India, India – Now Hiring IT Firm

Generative AI Data Engineer



Job description

<p><p><b>About the Role :<br/></b><br/>We are seeking a GenAI Data Engineer to design, build, and optimize data pipelines for unstructured and semi-structured content, integrating advanced AI/ML capabilities.

This role combines modern ETL expertise with Vector Database & GenAI integration to support intelligent document processing and semantic search applications.<br/><br/><b>Key Responsibilities :</b><br/><br/>- Develop and maintain data ingestion pipelines using Azure Data Factory (ADF) and Databricks for structured and unstructured data.<br/><br/>- Create notebooks to process PDF and Word documents, including extracting text, tables, charts, graphs, and images.<br/><br/>- Apply NLP / Embedding Models (e.g., OpenAI, Hugging Face, sentence-transformers) to convert extracted content into embeddings.<br/><br/>- Store embeddings and metadata into Vector Databases (e.g., FAISS, Pinecone, Milvus, Weaviate, ChromaDB).<br/><br/>- Design and implement semantic search and retrieval workflows to enable prompt-based query capabilities.<br/><br/>- Optimize ETL pipelines for scalability, reliability, and performance.<br/><br/>- Collaborate with data scientists and solution architects to integrate GenAI capabilities into enterprise applications.<br/><br/>- Follow best practices for code quality, modularity, and documentation.<br/><br/><b>Required Skills & Experience :</b><br/><br/>- Proven experience in Azure Data Factory (ADF) and Databricks for building ETL/ELT workflows.<br/><br/>- Strong programming experience in Python (pandas, PySpark, PyPDF, python-docx, OCR libraries, etc.).<br/><br/>- Hands-on experience with Vector Databases and semantic search implementation.<br/><br/>- Understanding of embedding models, LLM-based retrieval, and prompt engineering.<br/><br/>- Familiarity with handling multi-modal data (text, tables, images, charts).<br/><br/>- Strong knowledge of data modeling, indexing, and query optimization.<br/><br/>- Experience with cloud platforms (Azure preferred).<br/><br/>- Strong problem-solving, debugging, and communication skills.<br/><br/><b>Nice to Have :</b><br/><br/>- Experience with knowledge graphs or RAG (Retrieval-Augmented Generation) pipelines.<br/><br/>- Exposure to MLOps practices and LLM fine-tuning.<br/><br/>- Familiarity with enterprise-scale document management systems.</p><br/></p> (ref:hirist.tech)


Required Skill Profession

Computer Occupations



Your Complete Job Search Toolkit

✨ Smart • Intelligent • Private • Secure

Start Using Our Tools

Join thousands of professionals who've advanced their careers with our platform

Rate or Report This Job
If you feel this job is inaccurate or spam kindly report to us using below form.
Please Note: This is NOT a job application form.


    Unlock Your Generative AI Potential: Insight & Career Growth Guide