Lead research on ASR models focused on accuracy, latency, and robustness
Design and train speech-to-text models for noisy, accented, and low-resource settings
Improve streaming and real-time decoding pipelines
Experiment with architectures, loss functions, and data strategies (augmentation, semi-supervised learning, distillation)
Translate research ideas into production-ready systems
Collaborate closely with infra, product, and voice engineering teams
What we're looking for
Strong background in ASR / speech research
Hands-on experience with deep learning frameworks (PyTorch preferred)
Experience with streaming or low-latency ASR systems
Familiarity with modern ASR architectures (CTC, Transducers, attention-based, hybrid)
Ability to think end-to-end: data model deployment
Prior work in multilingual or accented speech is a strong plus
Great to have
Publications in top speech / ML conferences
Experience deploying models in real-time production systems
Exposure to conversational AI
Years of Experience
3-6 years of specialized experience in speech through academia or industry
Education
Masters or PhD in Speech
-
Note - we often make exceptions and hire brilliant candidates regardless of years of experience or education - proof of work is paramount
Compensation Range: $60K - $100K
Beware of fraud agents! do not pay money to get a job
MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.