- Expertini Resume Scoring: See how well your CV/Résumé matches this job: Forage AI Data Pipeline Engineer Python/ETL/SQL.
Urgent! Forage AI - Data Pipeline Engineer - Python/ETL/SQL Vacancy | Forage AI
<p><p><b>Description : </b> Data Pipeline Engineer Web Services, WebCrawling, ETL, NLP(spaCy/LLM), AWS.
Experience Level : 5-7 years of relevant experience in data engineering.</p><br/><br/><p><b>About Forage AI : </b> </p><p><br/></p><p>Forage AI is a pioneering AI-powered data extraction and automation company that transforms complex, unstructured web and document data into clean, structured intelligence.
Our platform combines web crawling, NLP, LLMs, and agentic AI to deliver highly accurate firmographic and enterprise insights across numerous domains.
Trusted by global clients in finance, real estate, and healthcare, Forage AI enables businesses to automate workflows, reduce manual rework, and access high-quality data at scale.</p><br/><br/><p><b>About the Role : </b></p><p><br/></p><p>We are seeking a Data Pipeline Engineer to develop, optimize, and maintain production-grade data pipelines focused on web data extraction and ETL workflows.
This is a hands-on role requiring strong experience with Python (as the primary programming language), spaCy, LLMs, webcrawling, and cloud deployment in containerized environments.
</p><p><br/></p><p>Youll have opportunities to propose, experiment with, and implement GenAI-driven approaches, innovative automations, and new strategies as part of our product and pipeline evolution.
Candidates should have 5-8 years of relevant experience in data engineering, software engineering, or related fields.</p><br/><br/><p><b>Key Responsibilities : </b></p><br/><p><br/></p><p>- Design, build, and manage scalable pipelines for ingesting, processing, and storing web and API data.</p><br/><p><br/></p><p>- Develop robust web crawlers and scrapers in Python (Scrapy, lxml, Playwright) for structured and unstructured data.</p><br/><p><br/></p><p>- Create and monitor ETL workflows for data cleansing, transformation, and loading into PostgreSQL and MongoDB.</p><br/><p><br/></p><p>- Apply spaCy for NLP tasks and integrate/fine-tune modern LLMs for analytics.</p><br/><p><br/></p><p>- DriveGenAI-based innovation and automation in core data workflows.</p><br/><p><br/></p><p>- Develop and deploy secure REST APIs and web services for data access and Integrate RabbitMQ,Kafka, SQS(for distributed queueing), and Redis (for caching) into data workflows; also proficient with distributed queue tools such as Celery, TaskIQ.</p><br/><p><br/></p><p>- Containerize and deploy solutions using Docker on AWS(EC2, ECS, Lambda).</p><br/><p><br/></p><p>- Collaborate with data teams, maintain pipeline documentation, and enforce data quality standards.</p><br/><p><br/></p><p>- Maintain and enhance legacy in-house applications as required.</p><br/><br/><p><b>Technical Skills & Requirements : </b></p><br/><p><br/></p><p>- Primary programming language is Python; must have experience writing independent Python packages.</p><br/><p><br/></p><p>- Experience with multithreading and asynchronous programming in Python.</p><br/><p><br/></p><p>- Advanced Python skills, including web crawling (Scrapy, lxml, Playwright) and strong SQL/data handling abilities.</p><br/><p><br/></p><p>- Experience with PostgreSQL (SQL) and MongoDB (NoSQL).</p><br/><p><br/></p><p>- Proficient with workflow orchestration tools such as Airflow.</p><br/><p><br/></p><p>- Hands-on experience with RabbitMQ, Kafka, SQS(for queueing/distributed processing), and Redis (for caching).</p><br/><p><br/></p><p>- Practical experience with spaCy for NLP and integration of at least one LLM platform (OpenAI, HuggingFace, etc.).</p><br/><p><br/></p><p>- Experience with GenAI/LLMs, prompt engineering, or integrating GenAI features into data products.</p><br/><p><br/></p><p>- Proficiency with Docker and AWS services (EC2, ECS, Lambda).</p><br/><p><br/></p><p>- Experienced in developing secure, scalable REST APIs using FastAPI and/or Flask.</p><br/><p><br/></p><p>- Familiarity with third-party APIs integration, including authentication, data handling, and rate limiting.</p><br/><p><br/></p><p>- Proficient in using Git for version control and collaboration.</p><br/><p><br/></p><p>- Strong analytical, problem-solving, and documentation skills.</p><br/><p><br/></p><p>- Bachelors or Masters degree in Computer Science or related field.</p><br/><br/><p><b>What We Offer : </b></p><br/><p><br/></p><p>- High ownership and autonomy in shaping technical solutions and system architecture.</p><br/><p><br/></p><p>- Opportunities to learn modern technologies and propose technical initiatives including GenAI-based approaches.</p><br/><p><br/></p><p>- Collaborative, supportive, and growth-oriented engineering culture.</p><br/><p><br/></p><p>- Exposure to a broad set of business and technical problems.</p><br/><p><br/></p><p>- Structured onboarding and domain training.</p><br/><p><br/></p><p>- Work-from-Home : </b></p><br/><p><br/></p><p>- Business-grade computer (modern processor i7, i9 , 16 GB+ RAM) with no performance obstacles.</p><br/><p><br/></p><p>- Reliable high-speed internet for video calls and remote work.</p><br/><p><br/></p><p>- Quality headphones & camera for clear audio and video.</p><br/><p><br/></p><p>- Stable power supply and backup options in case of outages.</p><br/></p> (ref:hirist.tech)
✨ Smart • Intelligent • Private • Secure
Practice for Any Interview Q&A (AI Enabled)
Predict interview Q&A (AI Supported)
Mock interview trainer (AI Supported)
Ace behavioral interviews (AI Powered)
Record interview questions (Confidential)
Master your interviews
Track your answers (Confidential)
Schedule your applications (Confidential)
Create perfect cover letters (AI Supported)
Analyze your resume (NLP Supported)
ATS compatibility check (AI Supported)
Optimize your applications (AI Supported)
O*NET Supported
O*NET Supported
O*NET Supported
O*NET Supported
O*NET Supported
European Union Recommended
Institution Recommended
Institution Recommended
Researcher Recommended
IT Savvy Recommended
Trades Recommended
O*NET Supported
Artist Recommended
Researchers Recommended
Create your account
Access your account
Create your professional profile
Preview your profile
Your saved opportunities
Reviews you've given
Companies you follow
Discover employers
O*NET Supported
Common questions answered
Help for job seekers
How matching works
Customized job suggestions
Fast application process
Manage alert settings
Understanding alerts
How we match resumes
Professional branding guide
Increase your visibility
Get verified status
Learn about our AI
How ATS ranks you
AI-powered matching
Join thousands of professionals who've advanced their careers with our platform
Unlock Your Forage AI Potential: Insight & Career Growth Guide
Real-time Forage AI Jobs Trends in India, India (Graphical Representation)
Explore profound insights with Expertini's real-time, in-depth analysis, showcased through the graph below. This graph displays the job market trends for Forage AI in India, India using a bar chart to represent the number of jobs available and a trend line to illustrate the trend over time. Specifically, the graph shows 110097 jobs in India and 6616 jobs in India. This comprehensive analysis highlights market share and opportunities for professionals in Forage AI roles. These dynamic trends provide a better understanding of the job market landscape in these regions.
Great news! Forage AI is currently hiring and seeking a Forage AI Data Pipeline Engineer Python/ETL/SQL to join their team. Feel free to download the job details.
Wait no longer! Are you also interested in exploring similar jobs? Search now: Forage AI Data Pipeline Engineer Python/ETL/SQL Jobs India.
An organization's rules and standards set how people should be treated in the office and how different situations should be handled. The work culture at Forage AI adheres to the cultural norms as outlined by Expertini.
The fundamental ethical values are:The average salary range for a Forage AI Data Pipeline Engineer Python/ETL/SQL Jobs India varies, but the pay scale is rated "Standard" in India. Salary levels may vary depending on your industry, experience, and skills. It's essential to research and negotiate effectively. We advise reading the full job specification before proceeding with the application to understand the salary package.
Key qualifications for Forage AI Data Pipeline Engineer Python/ETL/SQL typically include Computer Occupations and a list of qualifications and expertise as mentioned in the job specification. Be sure to check the specific job listing for detailed requirements and qualifications.
To improve your chances of getting hired for Forage AI Data Pipeline Engineer Python/ETL/SQL, consider enhancing your skills. Check your CV/Résumé Score with our free Resume Scoring Tool. We have an in-built Resume Scoring tool that gives you the matching score for each job based on your CV/Résumé once it is uploaded. This can help you align your CV/Résumé according to the job requirements and enhance your skills if needed.
Here are some tips to help you prepare for and ace your job interview:
Before the Interview:To prepare for your Forage AI Data Pipeline Engineer Python/ETL/SQL interview at Forage AI, research the company, understand the job requirements, and practice common interview questions.
Highlight your leadership skills, achievements, and strategic thinking abilities. Be prepared to discuss your experience with HR, including your approach to meeting targets as a team player. Additionally, review the Forage AI's products or services and be prepared to discuss how you can contribute to their success.
By following these tips, you can increase your chances of making a positive impression and landing the job!
Setting up job alerts for Forage AI Data Pipeline Engineer Python/ETL/SQL is easy with Expertini. Simply visit our job alerts page here, enter your preferred job title and location, and choose how often you want to receive notifications. You'll get the latest job openings sent directly to your email for FREE!