• Expertini Resume Scoring: Our Semantic Matching Algorithm evaluates your CV/Résumé before you apply for this job role.
India Jobs Expertini

AI Inference Kernel Engineer (CUDA) Job Opening In pushkar – Now Hiring Phinity


Job description

We look forward to when AI can discover the next quantum AI accelerator, or when AI can make RL much more compute-efficient.

We want to enable AI to bootstrap its own intelligence, to discover new computational paradigms.

Just as AlphaEvolve discovered a 23% speedup in Gemini's critical kernels and achieved 32.5% improvements in FlashAttention, we're building the infrastructure that will enable every AI model to optimize its own compute stack.

Of course, to automate algorithm and hardware discovery, we need to break the data barrier.

CUDA is a low-resource language, and kernel optimization depends a lot on context and hardware that models simply are not trained on.


Phinity is building the canonical training data infrastructure that will enable agentic hardware engineering and optimization, which will fuel algorithmic discovery.

We are building environments for agents to learn to write kernel from a spec and optimize them on specific hardware, and eventually, to discover new hardware breakthroughs.

Our customers include one of the largest frontier model labs.


We're seeking top engineers for a contractor role who can optimize hardware for model training and inference workloads, who can bake their industry experience into a model.

This is a hybrid Systems Engineer/AI research role where you will be looking through and debugging model reasoning traces and designing the optimal CUDA problems to teach unreleased models to automate your work in industry.

Please do not apply unless you have optimized kernels before.


Skill requirements:

Languages: CUDA, C++, Python,

Frameworks: JAX/XLA, PyTorch, TensorFlow (at the C++ level), Pallas

Libraries: cuBLAS, cuDNN, CUTLASS, CUB, Thrust

Compiler Tools: NVCC, PTX assembly, MLIR/XLA understanding

Hardware Knowledge: SM architecture, tensor cores, memory hierarchies (HBM, L2, shared, registers)


Apply if you have:

  • Achieved >10x speedups on production ML workloads
  • Written kernels that outperform vendor libraries
  • Optimized attention, GEMM, or convolution at the assembly level
  • Built custom fusions that beat XLA/Triton compiler output
  • Published papers or open-source kernels used in production

Required Skill Profession

Other General


  • Job Details

Related Jobs

Phinity hiring Ai inference kernel engineer (cuda) Job in Anand, Gujarat, India
Phinity
Anand, Gujarat, India
Phinity hiring Ai inference kernel engineer (cuda) Job in Eluru, Andhra Pradesh, India
Phinity
Eluru, Andhra Pradesh, India
Phinity hiring Ai inference kernel engineer (cuda) Job in Pune, Maharashtra, India
Phinity
Pune, Maharashtra, India
Phinity hiring Ai inference kernel engineer (cuda) Job in Kurnool, Andhra Pradesh, India
Phinity
Kurnool, Andhra Pradesh, India
Phinity hiring Ai inference kernel engineer (cuda) Job in Salem, Tamil Nadu, India
Phinity
Salem, Tamil Nadu, India
Phinity hiring Ai inference kernel engineer (cuda) Job in Belgaum, Karnataka, India
Phinity
Belgaum, Karnataka, India
Phinity hiring Ai inference kernel engineer (cuda) Job in Bhavnagar, Gujarat, India
Phinity
Bhavnagar, Gujarat, India
Phinity hiring Ai inference kernel engineer (cuda) Job in Delhi, Delhi, India
Phinity
Delhi, Delhi, India
Phinity hiring AI Inference Kernel Engineer (CUDA) Job in Hubli, Hubli, India
Phinity
Hubli, Hubli, India
Phinity hiring AI Inference Kernel Engineer (CUDA) Job in Jodhpur, Jodhpur, India
Phinity
Jodhpur, Jodhpur, India
Phinity hiring AI Inference Kernel Engineer (CUDA) Job in Thiruvananthapuram, Thiruvananthapuram, India
Phinity
Thiruvananthapuram, Thiruvananthapuram, India
Phinity hiring AI Inference Kernel Engineer (CUDA) Job in Mumbai, Mumbai, India
Phinity
Mumbai, Mumbai, India
Phinity hiring AI Inference Kernel Engineer (CUDA) Job in Thrissur, Thrissur, India
Phinity
Thrissur, Thrissur, India
Phinity hiring AI Inference Kernel Engineer (CUDA) Job in Udaipur, Udaipur, India
Phinity
Udaipur, Udaipur, India
Phinity hiring AI Inference Kernel Engineer (CUDA) Job in Salem, Salem, India
Phinity
Salem, Salem, India
Phinity hiring AI Inference Kernel Engineer (CUDA) Job in Thoothukudi, Thoothukudi, India
Phinity
Thoothukudi, Thoothukudi, India
Phinity hiring AI Inference Kernel Engineer (CUDA) Job in Prayagraj, Prayagraj, India
Phinity
Prayagraj, Prayagraj, India
Phinity hiring AI Inference Kernel Engineer (CUDA) Job in Madurai, Madurai, India
Phinity
Madurai, Madurai, India
Phinity hiring AI Inference Kernel Engineer (CUDA) Job in Varanasi, Varanasi, India
Phinity
Varanasi, Varanasi, India
Phinity hiring AI Inference Kernel Engineer (CUDA) Job in Belgaum, Belgaum, India
Phinity
Belgaum, Belgaum, India
Phinity hiring AI Inference Kernel Engineer (CUDA) Job in Nellore, Nellore, India
Phinity
Nellore, Nellore, India
Phinity hiring AI Inference Kernel Engineer (CUDA) Job in Thane, Thane, India
Phinity
Thane, Thane, India
Phinity hiring AI Inference Kernel Engineer (CUDA) Job in Jamnagar, Jamnagar, India
Phinity
Jamnagar, Jamnagar, India
Phinity hiring AI Inference Kernel Engineer (CUDA) Job in Kollam, Kollam, India
Phinity
Kollam, Kollam, India
Phinity hiring AI Inference Kernel Engineer (CUDA) Job in Kota, Kota, India
Phinity
Kota, Kota, India
Phinity hiring AI Inference Kernel Engineer (CUDA) Job in Tirupati, Tirupati, India
Phinity
Tirupati, Tirupati, India
Phinity hiring AI Inference Kernel Engineer (CUDA) Job in Hyderabad, Hyderabad, India
Phinity
Hyderabad, Hyderabad, India
Phinity hiring AI Inference Kernel Engineer (CUDA) Job in Ballari, Ballari, India
Phinity
Ballari, Ballari, India
Phinity hiring AI Inference Kernel Engineer (CUDA) Job in Mangalore, Mangalore, India
Phinity
Mangalore, Mangalore, India
Phinity hiring AI Inference Kernel Engineer (CUDA) Job in Kottayam, Kottayam, India
Phinity
Kottayam, Kottayam, India
Phinity hiring AI Inference Kernel Engineer (CUDA) Job in Ghaziabad, Ghaziabad, India
Phinity
Ghaziabad, Ghaziabad, India
Phinity hiring AI Inference Kernel Engineer (CUDA) Job in Amravati, Amravati, India
Phinity
Amravati, Amravati, India

Unlock Your AI Inference Potential: Insight & Career Growth Guide


Real-time AI Inference Jobs Trends (Graphical Representation)

Explore profound insights with Expertini's real-time, in-depth analysis, showcased through the graph here. Uncover the dynamic job market trends for AI Inference in pushkar, India, highlighting market share and opportunities for professionals in AI Inference roles.

110899 Jobs in India
110899
828 Jobs in Pushkar
828
Download Ai Inference Jobs Trends in Pushkar and India

Are You Looking for AI Inference Kernel Engineer (CUDA) Job?

Great news! is currently hiring and seeking a AI Inference Kernel Engineer (CUDA) to join their team. Feel free to download the job details.

Wait no longer! Are you also interested in exploring similar jobs? Search now: .

The Work Culture

An organization's rules and standards set how people should be treated in the office and how different situations should be handled. The work culture at Phinity adheres to the cultural norms as outlined by Expertini.

The fundamental ethical values are:

1. Independence

2. Loyalty

3. Impartiapty

4. Integrity

5. Accountabipty

6. Respect for human rights

7. Obeying India laws and regulations

What Is the Average Salary Range for AI Inference Kernel Engineer (CUDA) Positions?

The average salary range for a varies, but the pay scale is rated "Standard" in pushkar. Salary levels may vary depending on your industry, experience, and skills. It's essential to research and negotiate effectively. We advise reading the full job specification before proceeding with the application to understand the salary package.

What Are the Key Qualifications for AI Inference Kernel Engineer (CUDA)?

Key qualifications for AI Inference Kernel Engineer (CUDA) typically include Other General and a list of qualifications and expertise as mentioned in the job specification. The generic skills are mostly outlined by the . Be sure to check the specific job listing for detailed requirements and qualifications.

How Can I Improve My Chances of Getting Hired for AI Inference Kernel Engineer (CUDA)?

To improve your chances of getting hired for AI Inference Kernel Engineer (CUDA), consider enhancing your skills. Check your CV/Résumé Score with our free Tool. We have an in-built Resume Scoring tool that gives you the matching score for each job based on your CV/Résumé once it is uploaded. This can help you align your CV/Résumé according to the job requirements and enhance your skills if needed.

Interview Tips for AI Inference Kernel Engineer (CUDA) Job Success

Phinity interview tips for AI Inference Kernel Engineer (CUDA)

Here are some tips to help you prepare for and ace your AI Inference Kernel Engineer (CUDA) job interview:

Before the Interview:

Research: Learn about the Phinity's mission, values, products, and the specific job requirements and get further information about

Other Openings

Practice: Prepare answers to common interview questions and rehearse using the STAR method (Situation, Task, Action, Result) to showcase your skills and experiences.

Dress Professionally: Choose attire appropriate for the company culture.

Prepare Questions: Show your interest by having thoughtful questions for the interviewer.

Plan Your Commute: Allow ample time to arrive on time and avoid feeling rushed.

During the Interview:

Be Punctual: Arrive on time to demonstrate professionalism and respect.

Make a Great First Impression: Greet the interviewer with a handshake, smile, and eye contact.

Confidence and Enthusiasm: Project a positive attitude and show your genuine interest in the opportunity.

Answer Thoughtfully: Listen carefully, take a moment to formulate clear and concise responses. Highlight relevant skills and experiences using the STAR method.

Ask Prepared Questions: Demonstrate curiosity and engagement with the role and company.

Follow Up: Send a thank-you email to the interviewer within 24 hours.

Additional Tips:

Be Yourself: Let your personality shine through while maintaining professionalism.

Be Honest: Don't exaggerate your skills or experience.

Be Positive: Focus on your strengths and accomplishments.

Body Language: Maintain good posture, avoid fidgeting, and make eye contact.

Turn Off Phone: Avoid distractions during the interview.

Final Thought:

To prepare for your AI Inference Kernel Engineer (CUDA) interview at Phinity, research the company, understand the job requirements, and practice common interview questions.

Highlight your leadership skills, achievements, and strategic thinking abilities. Be prepared to discuss your experience with HR, including your approach to meeting targets as a team player. Additionally, review the Phinity's products or services and be prepared to discuss how you can contribute to their success.

By following these tips, you can increase your chances of making a positive impression and landing the job!

How to Set Up Job Alerts for AI Inference Kernel Engineer (CUDA) Positions

Setting up job alerts for AI Inference Kernel Engineer (CUDA) is easy with India Jobs Expertini. Simply visit our job alerts page here, enter your preferred job title and location, and choose how often you want to receive notifications. You'll get the latest job openings sent directly to your email for FREE!