Senior Ai Engineer Voice Ai (stt/tts)

Year    Coimbatore, Tamil Nadu, India

Job Description

Senior AI Engineer - Voice AI (STT/TTS)
Position Title: Senior AI Engineer - Voice AI (Speech-to-Text / Text-to-Speech)
Reports To: Engineering Lead, Voice AI
Location: Coimbatore (Preferred)
Experience Level: 5+ years
About the Role
We are seeking exceptional Senior AI Engineers to lead the development and optimization of enterprise-grade speech processing capabilities within Convogent AI, our conversational agentic AI voice platform. You will own the end-to-end architecture, performance optimization, and multi-lingual intelligence of our voice AI systems that power human-like autonomous voice agents for regulated industries.
Core Responsibilities

  • Design and architect production-grade STT/TTS pipelines handling 250+ concurrent voice sessions with sub-100ms latency requirements
  • Develop and fine-tune multi-lingual LLM models to enable authentic conversations across diverse linguistic landscapes with contextual awareness and natural flow
  • Integrate and optimize industry-leading platforms (ElevenLabs, Deepgram, and custom solutions) while maintaining flexibility for customer-specific STT/TTS requirements
  • Implement advanced features including accent variation, human-like prosody, emotion detection, objection handling, and ambient noise management
  • Drive performance optimization initiatives for inference latency, throughput, and cost efficiency across voice processing stacks
Establish best practices for model evaluation, A/B testing, and continuous improvement of voice quality metrics
Must-Have Qualifications
Deep STT/TTS Expertise: Proven track record building production systems with ElevenLabs, Deepgram, or equivalent platforms; understanding of underlying models and optimization techniques
Multi-lingual LLM Expertise: Expert-level knowledge of LLM architectures (transformers, attention mechanisms, fine-tuning) with demonstrated experience deploying multi-lingual models in production
AI/ML Systems: Strong foundation in ML ops, model optimization (quantization, pruning, distillation), and deployment frameworks (ONNX, TensorRT, vLLM)
Python Proficiency: Advanced Python expertise for model development, optimization, and integration testing
Cloud Infrastructure: Hands-on experience with AWS services (SageMaker, Bedrock, Lambda, EC2) or equivalent platforms
Production Mindset: Experience shipping AI products at scale with focus on latency, reliability, and monitoring
Nice-to-Have Qualifications
  • Experience with speech synthesis evaluation metrics and voice quality assessment frameworks
  • Knowledge of conversational AI architecture and dialogue management systems
  • Contributions to open-source voice AI projects or published research in speech processing Experience with containerization (Docker, Kubernetes) and CI/CD pipelines for ML models
  • Familiarity with regulated industry requirements (financial services, healthcare) for voice systems
  • Background in NLP, phonetics, or linguistics
  • Experience optimizing models for edge deployment or resource-constrained environment
What You'll Work With
  • Advanced voice processing frameworks and LLM infrastructure
  • Multi-tenancy cloud platforms deployed on AWS with enterprise-grade monitoring
  • Cutting-edge models including GPT-4, Claude, and specialized voice models
  • Collaborative teams building enterprise accelerators for regulated industries
  • Mentor junior engineers and collaborate cross-functionally with platform, infrastructure, and product teams

Skills Required

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD4791293
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Coimbatore, Tamil Nadu, India
  • Education
    Not mentioned
  • Experience
    Year