- Expertini Resume Scoring: Our Semantic Matching Algorithm evaluates your CV/Résumé before you apply for this job role: AI Inference Kernel Engineer (CUDA).
 
  
  
    
    
  
      Urgent! AI Inference Kernel Engineer (CUDA) Job Opening In kottayam – Now Hiring Phinity
We look forward to when AI can discover the next quantum AI accelerator, or when AI can make RL much more compute-efficient.
We want to enable AI to bootstrap its own intelligence, to discover new computational paradigms.
Just as AlphaEvolve discovered a 23% speedup in Gemini's critical kernels and achieved 32.5% improvements in FlashAttention, we're building the infrastructure that will enable every AI model to optimize its own compute stack.
Of course, to automate algorithm and hardware discovery, we need to break the data barrier.
CUDA is a low-resource language, and kernel optimization depends a lot on context and hardware that models simply are not trained on.
Phinity is building the canonical training data infrastructure that will enable agentic hardware engineering and optimization, which will fuel algorithmic discovery.
We are building environments for agents to learn to write kernel from a spec and optimize them on specific hardware, and eventually, to discover new hardware breakthroughs.
Our customers include one of the largest frontier model labs.
We're seeking top engineers for a contractor role who can optimize hardware for model training and inference workloads, who can bake their industry experience into a model.
This is a hybrid Systems Engineer/AI research role where you will be looking through and debugging model reasoning traces and designing the optimal CUDA problems to teach unreleased models to automate your work in industry.
Please do not apply unless you have optimized kernels before.
Skill requirements:
Languages: CUDA, C++, Python,
Frameworks: JAX/XLA, PyTorch, TensorFlow (at the C++ level), Pallas
Libraries: cuBLAS, cuDNN, CUTLASS, CUB, Thrust
Compiler Tools: NVCC, PTX assembly, MLIR/XLA understanding
Hardware Knowledge: SM architecture, tensor cores, memory hierarchies (HBM, L2, shared, registers)
Apply if you have:
✨ Smart • Intelligent • Private • Secure
Practice for Any Interview Q&A (AI Enabled)
Predict interview Q&A (AI Supported)
Mock interview trainer (AI Supported)
Ace behavioral interviews (AI Powered)
Record interview questions (Confidential)
Master your interviews
Track your answers (Confidential)
Schedule your applications (Confidential)
Create perfect cover letters (AI Supported)
Analyze your resume (NLP Supported)
ATS compatibility check (AI Supported)
Optimize your applications (AI Supported)
O*NET Supported
O*NET Supported
O*NET Supported
O*NET Supported
O*NET Supported
European Union Recommended
Institution Recommended
Institution Recommended
Researcher Recommended
IT Savvy Recommended
Trades Recommended
O*NET Supported
Artist Recommended
Researchers Recommended
Create your account
Access your account
Create your professional profile
Preview your profile
Your saved opportunities
Reviews you've given
Companies you follow
Discover employers
O*NET Supported
Common questions answered
Help for job seekers
How matching works
Customized job suggestions
Fast application process
Manage alert settings
Understanding alerts
How we match resumes
Professional branding guide
Increase your visibility
Get verified status
Learn about our AI
How ATS ranks you
AI-powered matching
Join thousands of professionals who've advanced their careers with our platform
Unlock Your AI Inference Potential: Insight & Career Growth Guide
Real-time AI Inference Jobs Trends in kottayam, India (Graphical Representation)
Explore profound insights with Expertini's real-time, in-depth analysis, showcased through the graph below. This graph displays the job market trends for AI Inference in kottayam, India using a bar chart to represent the number of jobs available and a trend line to illustrate the trend over time. Specifically, the graph shows 110899 jobs in India and 829 jobs in kottayam. This comprehensive analysis highlights market share and opportunities for professionals in AI Inference roles. These dynamic trends provide a better understanding of the job market landscape in these regions.
Great news! Phinity is currently hiring and seeking a AI Inference Kernel Engineer (CUDA) to join their team. Feel free to download the job details.
Wait no longer! Are you also interested in exploring similar jobs? Search now: AI Inference Kernel Engineer (CUDA) Jobs kottayam.
An organization's rules and standards set how people should be treated in the office and how different situations should be handled. The work culture at Phinity adheres to the cultural norms as outlined by Expertini.
The fundamental ethical values are:The average salary range for a AI Inference Kernel Engineer (CUDA) Jobs India varies, but the pay scale is rated "Standard" in kottayam. Salary levels may vary depending on your industry, experience, and skills. It's essential to research and negotiate effectively. We advise reading the full job specification before proceeding with the application to understand the salary package.
Key qualifications for AI Inference Kernel Engineer (CUDA) typically include Other General and a list of qualifications and expertise as mentioned in the job specification. Be sure to check the specific job listing for detailed requirements and qualifications.
To improve your chances of getting hired for AI Inference Kernel Engineer (CUDA), consider enhancing your skills. Check your CV/Résumé Score with our free Resume Scoring Tool. We have an in-built Resume Scoring tool that gives you the matching score for each job based on your CV/Résumé once it is uploaded. This can help you align your CV/Résumé according to the job requirements and enhance your skills if needed.
 
            Here are some tips to help you prepare for and ace your job interview:
Before the Interview:To prepare for your AI Inference Kernel Engineer (CUDA) interview at Phinity, research the company, understand the job requirements, and practice common interview questions.
Highlight your leadership skills, achievements, and strategic thinking abilities. Be prepared to discuss your experience with HR, including your approach to meeting targets as a team player. Additionally, review the Phinity's products or services and be prepared to discuss how you can contribute to their success.
By following these tips, you can increase your chances of making a positive impression and landing the job!
Setting up job alerts for AI Inference Kernel Engineer (CUDA) is easy with India Jobs Expertini. Simply visit our job alerts page here, enter your preferred job title and location, and choose how often you want to receive notifications. You'll get the latest job openings sent directly to your email for FREE!