Job Title: VLM Research Engineer
Location: Vapi, Gujarat
Employment Type: Full-Time
Overview
We are seeking a highly skilled VLM Research Engineer to build multimodal (vision-language-action) models for instruction following, scene grounding, and tool use across platforms.
The role involves developing advanced models that bridge perception and language understanding for autonomous systems.
Key Responsibilities
Must-Haves
Nice-to-Haves
Success Metrics
Domain Notes
Humanoids:
- Language-guided manipulation and tool use.
AGVs (Autonomous Ground Vehicles):
- Natural language tasking for warehouse operations;
semantic maps.
Cars:
- Gesture and sign interpretation;
driver interaction.
Drones:
- Natural language mission specification;
target search and inspection.
Application Instructions
Interested candidates may apply by sending their resume and cover letter to parijat.patel@merai.co with the subject line: “VLM Research Engineer Application”.