We're a cutting-edge technology company building enterprise-grade AI solutions that transform how businesses operate. Our platform leverages the latest in Generative AI to create intelligent applications for document processing, automated decision-making, and knowledge management across industries.
Role Overview
We're seeking an exceptional Gen-AI Tech Lead to architect, build, and scale our next-generation AI-powered enterprise applications. You'll lead the technical strategy for implementing Large Language Models, fine-tuning custom models, and deploying production-ready AI systems that serve millions of users.
Design and implement enterprise-scale Generative AI applications using custom LLMs or (GPT, Claude, Llama, Gemini)
Lead fine-tuning initiatives for domain-specific models and custom use cases
Build and optimize model training pipelines for large-scale data processing
Develop RAG (Retrieval-Augmented Generation) systems with vector databases and semantic search
Implement prompt engineering strategies and automated prompt optimization
Create AI evaluation frameworks and model performance monitoring systems
Enterprise Application Development
Build scalable Python applications integrating multiple AI models and APIs
Develop microservices architecture for AI model serving and orchestration
Implement real-time AI inference systems with sub-second response times
Design fault-tolerant systems with fallback mechanisms and error handling
Create APIs and SDKs for enterprise AI integration
Build AI model version control and A/B testing frameworks
MLOps & Infrastructure
Containerize AI applications using Docker and orchestrate with Kubernetes
Design and implement CI/CD pipelines for ML model deployment
Set up model monitoring, drift detection, and automated retraining systems
Optimize inference performance and cost efficiency in cloud environments
Implement security and compliance measures for enterprise AI applications
Technical Leadership
Lead a team of 3-5 AI engineers and data scientists
Establish best practices for AI development, testing, and deployment
Mentor team members on cutting-edge AI technologies and techniques
Collaborate with product and business teams to translate requirements into AI solutions
Drive technical decision-making for AI architecture and technology stack
Required Skills & Experience Core AI/ML Expertise
Python
: 5+ years of production Python development with AI/ML libraries
LLMs
: Hands-on experience with GPT-4, Claude, Llama 2/3, Gemini, or similar models
Fine-tuning
: Proven experience fine-tuning models using LoRA, QLoRA, or full parameter tuning
Model Training
: Experience training models from scratch or continued pre-training
Frameworks
: Expert-level knowledge of PyTorch, TensorFlow, Hugging Face Transformers
Vector Databases
: Experience with Pinecone, Weaviate, ChromaDB, or Qdrant
Technical StackAI/ML Stack
Models
: OpenAI GPT, Anthropic Claude, Meta Llama, Google Gemini
Frameworks
: PyTorch, Hugging Face Transformers, LangChain, LlamaIndex
Training
: Distributed training with DeepSpeed, Accelerate, or Fairscale
Serving
: vLLM, TensorRT-LLM, or Triton Inference Server
Vector Search
: Pinecone, Weaviate, FAISS, Elasticsearch
Infrastructure & DevOps
Containerization
: Docker, Kubernetes, Helm charts
Cloud
: AWS (ECS, EKS, Lambda, SageMaker), GCP Vertex AI
Databases
: PostgreSQL, MongoDB, Redis, Neo4j
Monitoring
: Prometheus, Grafana, DataDog, MLflow
CI/CD
: GitHub Actions, Jenkins, ArgoCD
Professional Growth
Work directly with founders and C-level executives
Opportunity to publish research and speak at AI conferences
Access to latest AI models and cutting-edge research
Mentorship from industry experts and AI researchers
Budget for attending top AI conferences (NeurIPS, ICML, ICLR)
Ideal Candidate Profile
Passionate about pushing the boundaries of AI technology
Strong engineering mindset with focus on production systems
Experience shipping AI products used by thousands of users
Stays current with latest AI research and implements cutting-edge techniques
Excellent problem-solving skills and ability to work under ambiguity
Leadership experience in fast-paced, high-growth environments
Apply now and help us democratize AI for enterprise customers worldwide.
Job Type: Full-time
Pay: ?900,000.00 - ?1,600,000.00 per year
Schedule:
Monday to Friday
Supplemental Pay:
* Performance bonus
Beware of fraud agents! do not pay money to get a job
MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.