1. The role requires architectural judgment, hands-on AI/LLM expertise, and the ability to define best practices, performance benchmarks for scalable, low-latency AI products.
2. You will lead will lead design and development of distributed, provider-agnostic AI architectures (multi-provider LLMs, STT/TTS chains, microservices) with performance guarantees including low speech-to-speech latency, resilient failover, distributed scaling, load balancing, and cost-efficient inference paths. They will define architectural "dos and don'ts" for GenAI systems: prompt design and safety patterns, context shaping, fallback logic, caching, and real-time agent orchestration.
2. At least +2.5 years of experience of LLM, RAG, predictive/prescriptive analytics including Machine Learning algorithms (Supervised and Unsupervised) and deep learning algorithms and Artificial Neural Networks such as Regression , classification, ensemble model,RNN,LSTM,GRU.
3. You will lead AI model governance, including evaluation and selection frameworks for multiple LLM providers, routing logic, benchmarking, prompt-safety controls, context-window optimization, token efficiency, and overall cost management.
4. Proficiency in LangChain and Open LLM frameworks to perform summarization, classification, Name entity recognition, Question answering
5. Proficiency in Generative techniques prompt engineering, Vector DB, LLMs such as OpenAI,LlamaIndex, Azure OpenAI, Open-source LLMs will be important
6. Hands-on experience in GenAI technology areas including RAG architecture, fine tuning techniques, inferencing frameworks etc
7. Familiarity with Ai technologies/frameworks.
8. Strong proficiency in programming languages like Python and SQL with more than 10 years total experience.
Job Types: Full-time, Permanent
Pay: From ?4,200,000.00 per year
Benefits:
Cell phone reimbursement
Commuter assistance
Flexible schedule
Food provided
Health insurance
Internet reimbursement
Leave encashment
Life insurance
Paid sick time
Paid time off
Provident Fund
Work from home
Experience:
Python: 5 years (Required)
total work: 10 years (Required)
Work Location: Hybrid remote in Pune, Maharashtra
MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.