What you’ll do Lead research on Text to Speech models focused on naturalness, expressiveness, latency, and robustness Design and train TTS systems for real world voices across accents, languages, and speaking styles Improve streaming and low latency speech synthesis pipelines…
What you’ll do Lead research on ASR models focused on accuracy, latency, and robustness Design and train speech to text models for noisy, accented, and low resource settings Improve streaming and real time decoding pipelines Experiment with architectures, loss functions,…