Senior Ai Engineer Voice Ai (stt/tts)

Year Coimbatore, Tamil Nadu, India

Apply Now

Job Description

Senior AI Engineer - Voice AI (STT/TTS)
Position Title: Senior AI Engineer - Voice AI (Speech-to-Text / Text-to-Speech)
Reports To: Engineering Lead, Voice AI
Location: Coimbatore (Preferred)
Experience Level: 5+ years
About the Role
We are seeking exceptional Senior AI Engineers to lead the development and optimization of enterprise-grade speech processing capabilities within Convogent AI, our conversational agentic AI voice platform. You will own the end-to-end architecture, performance optimization, and multi-lingual intelligence of our voice AI systems that power human-like autonomous voice agents for regulated industries.
Core Responsibilities

Design and architect production-grade STT/TTS pipelines handling 250+ concurrent voice sessions with sub-100ms latency requirements

Develop and fine-tune multi-lingual LLM models to enable authentic conversations across diverse linguistic landscapes with contextual awareness and natural flow

Integrate and optimize industry-leading platforms (ElevenLabs, Deepgram, and custom solutions) while maintaining flexibility for customer-specific STT/TTS requirements

Implement advanced features including accent variation, human-like prosody, emotion detection, objection handling, and ambient noise management

Drive performance optimization initiatives for inference latency, throughput, and cost efficiency across voice processing stacks

Establish best practices for model evaluation, A/B testing, and continuous improvement of voice quality metrics
Must-Have Qualifications
Deep STT/TTS Expertise: Proven track record building production systems with ElevenLabs, Deepgram, or equivalent platforms; understanding of underlying models and optimization techniques
Multi-lingual LLM Expertise: Expert-level knowledge of LLM architectures (transformers, attention mechanisms, fine-tuning) with demonstrated experience deploying multi-lingual models in production
AI/ML Systems: Strong foundation in ML ops, model optimization (quantization, pruning, distillation), and deployment frameworks (ONNX, TensorRT, vLLM)
Python Proficiency: Advanced Python expertise for model development, optimization, and integration testing
Cloud Infrastructure: Hands-on experience with AWS services (SageMaker, Bedrock, Lambda, EC2) or equivalent platforms
Production Mindset: Experience shipping AI products at scale with focus on latency, reliability, and monitoring
Nice-to-Have Qualifications

Experience with speech synthesis evaluation metrics and voice quality assessment frameworks

Knowledge of conversational AI architecture and dialogue management systems

Contributions to open-source voice AI projects or published research in speech processing Experience with containerization (Docker, Kubernetes) and CI/CD pipelines for ML models

Familiarity with regulated industry requirements (financial services, healthcare) for voice systems

Background in NLP, phonetics, or linguistics

Experience optimizing models for edge deployment or resource-constrained environment

What You'll Work With

Advanced voice processing frameworks and LLM infrastructure

Multi-tenancy cloud platforms deployed on AWS with enterprise-grade monitoring

Cutting-edge models including GPT-4, Claude, and specialized voice models

Collaborative teams building enterprise accelerators for regulated industries

Mentor junior engineers and collaborate cross-functionally with platform, infrastructure, and product teams

Skills Required

Architecture

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.

Job Detail

Job Id

JD4791293
Industry

Not mentioned
Total Positions

1
Job Type:

Full Time
Salary:

Not mentioned
Employment Status

Permanent
Job Location

Coimbatore, Tamil Nadu, India
Education

Not mentioned
Experience

Year

MNC Jobs India

Jobs by Function

Popular Job Skills

Popular Industries

Popular Cities

Jobseekers

Employers