1) Job Role and Responsibilities
Design, train, and fine-tune deep learning models for speech synthesis, facial animation,
image/video generation, and multimodal AI.
Develop end-to-end AI pipelines covering data preprocessing, training, evaluation, and
deployment.
Work with large-scale multimedia datasets (video, audio, and text) and optimize models for
real-time GPU inference.
Integrate AI models with backend systems using APIs or microservices.
Research and implement state-of-the-art algorithms in Generative AI, Computer Vision, and
Speech AI.
Collaborate with backend, frontend, and product teams for seamless model integration.
Maintain documentation, experiment logs, and reproducible workflows.
2) Technical Skills Required
Programming Languages: Python (primary), C++ (optional)
Frameworks: PyTorch, TensorFlow, Keras
Model Architectures: CNNs, RNNs, Transformers, GANs, VAEs, Diffusion Models
Libraries: OpenCV, NumPy, Pandas, Scikit-learn, Librosa, Dlib, MediaPipe
Domains (hands-on in any one): o Computer Vision (object detection, face modeling, video
synthesis) o Speech AI (TTS, ASR, voice cloning) o Generative AI (GANs, diffusion models)
DevOps & Deployment: Docker, Git, Linux, TorchServe, FastAPI / Flask, CUDA optimization
Video/Audio Tools: FFmpeg, MoviePy, Torchaudio
Bonus Skills: Multimodal AI, emotion/gesture modeling, ONNX Runtime, TensorRT, model
quantization, AWS/GCP/Azure cloud training
Soft Skills: Analytical thinking, collaboration, communication, curiosity for emerging AI
technologies
3) Eligibility
Experience: 2-3 years as an AI / ML Developer (professional experience required).
Education: Bachelor's or Master's degree in Computer Science, Artificial Intelligence, Data
Science, or a related field.
Work Setup: Offline (Work from Office).
Job Type: Full-time
Pay: ?15,000.00 - ?20,000.00 per month
Work Location: In person
MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.