- Expertini Resume Scoring: Our Semantic Matching Algorithm evaluates your CV/Résumé before you apply for this job role: Senior AI Research Engineer, Model Inference (100% Remote).
 
  
  
    
    
  
      Urgent! Senior AI Research Engineer, Model Inference (100% Remote) Job Opening In New Delhi – Now Hiring Tether Operations Limited
Join Tether and Shape the Future of Digital Finance
At Tether, we’re not just building products, we’re pioneering a global financial revolution.
Our cutting-edge solutions empower businesses—from exchanges and wallets to payment processors and ATMs—to seamlessly integrate reserve-backed tokens across blockchains.
By harnessing the power of blockchain technology, Tether enables you to store, send, and receive digital tokens instantly, securely, and globally, all at a fraction of the cost.
Transparency is the bedrock of everything we do, ensuring trust in every transaction.
Innovate with Tether
Tether Finance: Our innovative product suite features the world’s most trusted stablecoin, USDT, relied upon by hundreds of millions worldwide, alongside pioneering digital asset tokenization services.
But that’s just the beginning:
Tether Power: Driving sustainable growth, our energy solutions optimize excess power for Bitcoin mining using eco-friendly practices in state-of-the-art, geo-diverse facilities.
Tether Data: Fueling breakthroughs in AI and peer-to-peer technology, we reduce infrastructure costs and enhance global communications with cutting-edge solutions like KEET, our flagship app that redefines secure and private data sharing.
Tether Education: Democratizing access to top-tier digital learning, we empower individuals to thrive in the digital and gig economies, driving global growth and opportunity.
Tether Evolution: At the intersection of technology and human potential, we are pushing the boundaries of what is possible, crafting a future where innovation and human capabilities merge in powerful, unprecedented ways.
Why Join Us?
Our team is a global talent powerhouse, working remotely from every corner of the world.
If you’re passionate about making a mark in the fintech space, this is your opportunity to collaborate with some of the brightest minds, pushing boundaries and setting new standards.
We’ve grown fast, stayed lean, and secured our place as a leader in the industry.
If you have excellent English communication skills and are ready to contribute to the most innovative platform on the planet, Tether is the place for you.
Are you ready to be part of the future?
About the job:
We are looking for an experienced AI Model Engineer with deep expertise in kernel development, model optimization, fine-tuning, and GPU acceleration.
The engineer will extend the inference framework to support inference and fine-tuning for Language models with a strong focus on mobile and integrated GPU acceleration (Vulkan).
This role requires hands-on experience with quantization techniques, LoRA architectures, Vulkan backend, and mobile GPU debugging.
You will play a critical role in pushing the boundaries of desktop and on-device inference and fine-tuning performance for next-generation SLM/LLMs.
Responsibilities:
Implement and optimize custom inference and fine-tuning kernels for small and large language models across multiple hardware backends.
Implement and optimize full and LoRA fine-tuning for small and large language models across multiple hardware backends.
Design and extend datatype and precision support (int, float, mixed precision, ternary QTypes, etc.).
Design, customize, and optimize Vulkan compute shaders for quantized operators and fine-tuning workflows.
Investigate and resolve GPU acceleration issues on Vulkan and integrated/mobile GPUs.
Architect and prepare support for advanced quantization techniques to improve efficiency and memory usage.
Debug and optimize GPU operators (e.g., int8, fp16, fp4, ternary).
Integrate and validate quantization workflows for training and inference.
Conduct evaluation and benchmarking (e.g., perplexity testing, fine-tuned adapter performance).
Conduct GPU testing across desktop and mobile devices.
Collaborate with research and engineering teams to prototype, benchmark, and scale new model optimization methods.
Deliver production-grade, efficient language model deployment for mobile and edge use cases.
Work closely with cross-functional teams to integrate optimized serving and inference frameworks into production pipelines designed for edge and on-device applications.
Define clear success metrics such as improved real-world performance, low error rates, robust scalability, optimal memory usage and ensure continuous monitoring and iterative refinements for sustained improvements.
Proficiency in C++ and GPU kernel programming.
Proven Expertise in GPU acceleration with Vulkan framework.
Strong background in quantization and mixed-precision model optimization.
Experience and Expertise in Vulkan compute shader development and customization.
Familiarity with LoRA fine-tuning and parameter-efficient training methods.
Ability to debug GPU-specific performance and stability issues on desktop and mobile devices.
Hands-on experience with mobile GPU acceleration and model inference.
Familiarity with large language model architectures (e.g., Qwen, Gemma, LLaMA, Falcon etc.).
Experience implementing custom backward operators for fine-tuning.
Experience creating and curating custom datasets for style transfer and domain-specific fine-tuning.
Demonstrated ability to apply empirical research to overcome challenges in model
Important information for candidates
Recruitment scams have become increasingly common.
To protect yourself, please keep the following in mind when applying for roles:
Apply only through our official channels. We do not use third-party platforms or agencies for recruitment unless clearly stated.
All open roles are listed on our official careers page: https://tether.recruitee.com/
Verify the recruiter’s identity. All our recruiters have verified LinkedIn profiles.
If you’re unsure, you can confirm their identity by checking their profile or contacting us through our website.
Be cautious of unusual communication methods. We do not conduct interviews over WhatsApp, Telegram, or SMS.
All communication is done through official company emails and platforms.
Double-check email addresses. All communication from us will come from emails ending in @ or @
We will never request payment or financial details. If someone asks for personal financial information or payment at any point during the hiring process, it is a scam.
Please report it immediately.
When in doubt, feel free to reach out through our official website.
✨ Smart • Intelligent • Private • Secure
Practice for Any Interview Q&A (AI Enabled)
Predict interview Q&A (AI Supported)
Mock interview trainer (AI Supported)
Ace behavioral interviews (AI Powered)
Record interview questions (Confidential)
Master your interviews
Track your answers (Confidential)
Schedule your applications (Confidential)
Create perfect cover letters (AI Supported)
Analyze your resume (NLP Supported)
ATS compatibility check (AI Supported)
Optimize your applications (AI Supported)
O*NET Supported
O*NET Supported
O*NET Supported
O*NET Supported
O*NET Supported
European Union Recommended
Institution Recommended
Institution Recommended
Researcher Recommended
IT Savvy Recommended
Trades Recommended
O*NET Supported
Artist Recommended
Researchers Recommended
Create your account
Access your account
Create your professional profile
Preview your profile
Your saved opportunities
Reviews you've given
Companies you follow
Discover employers
O*NET Supported
Common questions answered
Help for job seekers
How matching works
Customized job suggestions
Fast application process
Manage alert settings
Understanding alerts
How we match resumes
Professional branding guide
Increase your visibility
Get verified status
Learn about our AI
How ATS ranks you
AI-powered matching
Join thousands of professionals who've advanced their careers with our platform
Unlock Your Senior AI Potential: Insight & Career Growth Guide
Real-time Senior AI Jobs Trends in New Delhi, India (Graphical Representation)
Explore profound insights with Expertini's real-time, in-depth analysis, showcased through the graph below. This graph displays the job market trends for Senior AI in New Delhi, India using a bar chart to represent the number of jobs available and a trend line to illustrate the trend over time. Specifically, the graph shows 491071 jobs in India and 17222 jobs in New Delhi. This comprehensive analysis highlights market share and opportunities for professionals in Senior AI roles. These dynamic trends provide a better understanding of the job market landscape in these regions.
Great news! Tether Operations Limited is currently hiring and seeking a Senior AI Research Engineer, Model Inference (100% Remote) to join their team. Feel free to download the job details.
Wait no longer! Are you also interested in exploring similar jobs? Search now: Senior AI Research Engineer, Model Inference (100% Remote) Jobs New Delhi.
An organization's rules and standards set how people should be treated in the office and how different situations should be handled. The work culture at Tether Operations Limited adheres to the cultural norms as outlined by Expertini.
The fundamental ethical values are:The average salary range for a Senior AI Research Engineer, Model Inference (100% Remote) Jobs India varies, but the pay scale is rated "Standard" in New Delhi. Salary levels may vary depending on your industry, experience, and skills. It's essential to research and negotiate effectively. We advise reading the full job specification before proceeding with the application to understand the salary package.
Key qualifications for Senior AI Research Engineer, Model Inference (100% Remote) typically include Computer Occupations and a list of qualifications and expertise as mentioned in the job specification. Be sure to check the specific job listing for detailed requirements and qualifications.
To improve your chances of getting hired for Senior AI Research Engineer, Model Inference (100% Remote), consider enhancing your skills. Check your CV/Résumé Score with our free Resume Scoring Tool. We have an in-built Resume Scoring tool that gives you the matching score for each job based on your CV/Résumé once it is uploaded. This can help you align your CV/Résumé according to the job requirements and enhance your skills if needed.
 
            Here are some tips to help you prepare for and ace your job interview:
Before the Interview:To prepare for your Senior AI Research Engineer, Model Inference (100% Remote) interview at Tether Operations Limited, research the company, understand the job requirements, and practice common interview questions.
Highlight your leadership skills, achievements, and strategic thinking abilities. Be prepared to discuss your experience with HR, including your approach to meeting targets as a team player. Additionally, review the Tether Operations Limited's products or services and be prepared to discuss how you can contribute to their success.
By following these tips, you can increase your chances of making a positive impression and landing the job!
Setting up job alerts for Senior AI Research Engineer, Model Inference (100% Remote) is easy with India Jobs Expertini. Simply visit our job alerts page here, enter your preferred job title and location, and choose how often you want to receive notifications. You'll get the latest job openings sent directly to your email for FREE!