Senior Applied AI Engineer - Model Optimization & Strategy

syncarp India
Relocation
Apply
AI Summary

We are looking for an expert-level Applied AI Engineer who operates at the cutting edge of model optimization and large-scale AI systems. This role offers a unique global career path with opportunities for growth and development. The ideal candidate will have a deep understanding of AI and machine learning concepts, as well as strong technical skills and leadership abilities.

Key Highlights
Model Optimization & Fine-Tuning
Performance, Quantization & Inference
State-of-the-Art Innovation
Key Responsibilities
Implement PEFT, LoRA, QLoRA to fine-tune open-source LLMs
Customize models for domain-specific, production-grade use cases
Act as a trusted AI advisor to C-level leaders
Technical Skills Required
PyTorch TensorFlow Transformers Attention mechanisms vLLM TGI custom model serving quantization GPU optimization Model Ops
Benefits & Perks
Relocation package provided
Japanese language learning support
Global career path
Nice to Have
Experience with training data curation, synthetic data, and RLHF concepts
Executive-level communication and influence

Job Description


Hiring for a Global IT service provider, in AI Forward Deployed Engineers.

Experience: 8+ Years, based out of Chennai/Bangalore


🚀 Hiring: Forward Deployed Engineers/AI Architect – Model Optimization & Strategy

Experience: 8+ Years | Location: Chennai / Bangalore → Japan (Relocation)


Job Type: Full-time | Global AI Leadership Role

🌍 About the Role

We are looking for an expert-level Applied AI Engineer who operates at the cutting edge of model optimization and large-scale AI systems. This is not an API-only role—you will optimize, fine-tune, and deploy models at the core level, define technical standards, and act as a strategic advisor to senior leadership.


This role offers a unique global career path:

  • First 18 months: Based in Chennai or Bangalore
  • Post 18 months: Relocation to Japan
  • Language: Willingness to learn Japanese (company-supported)


🧠 What You’ll Do


Model Optimization & Fine-Tuning

  • Implement PEFT, LoRA, QLoRA to fine-tune open-source LLMs (LLaMA-class, Mistral-class)
  • Customize models for domain-specific, production-grade use cases
  • Handle complex edge cases in large-scale deployments


Performance, Quantization & Inference

  • Optimize inference cost and latency using quantization techniques (GGUF, AWQ)
  • Manage GPU memory efficiently and squeeze maximum performance from hardware
  • Optimize dense vectors and embedding pipelines


State-of-the-Art Innovation

  • Continuously evaluate and integrate emerging research (e.g., State Space Models, long-context optimization)
  • Translate cutting-edge research into real-world client deliverables


Strategic & Executive Engagement

  • Act as a trusted AI advisor to C-level leaders
  • Define the “Art of the Possible” for enterprise AI
  • Shape long-term AI roadmaps balancing cost, risk, and performance


Thought Leadership

  • Represent the organization at industry forums, conferences, and internal playbooks
  • Define technical culture and standards across AI teams

🔧 Technical Requirements

  • Expert-level PyTorch (TensorFlow exposure is a plus)
  • Deep understanding of Transformers & Attention mechanisms
  • Experience with vLLM, TGI, and custom model serving
  • Strong grasp of quantization, GPU optimization, and Model Ops
  • Experience with training data curation, synthetic data, and RLHF concepts


🌟 Leadership & Soft Skills

  • Executive-level communication and influence
  • Ability to lead cross-org initiatives and resolve conflicts
  • Strategic decision-making mindset
  • Passion for building long-term AI vision and culture
  • Openness and commitment to learning Japanese for Japan relocation


✈️ Global Mobility

  • Initial 18 months: Chennai or Bangalore
  • Thereafter: Long-term relocation to Japan
  • Japanese language learning support provided


Similar Jobs

Explore other opportunities that match your interests

Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

pelatro

India

TinyML / Embedded AI Principal Engineer

Programming
2d ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

L&T Technology Services

India
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Internship

ct automotive

India

Subscribe our newsletter

New Things Will Always Update Regularly