OURS GLOBAL is a leading offshore IT & ITeS outsourcing company delivering innovative solutions across software development, cloud, SaaS, BI, and IT support services. With global delivery capabilities and a strong focus on scalability, transparency, and performance, we empower businesses across industries to accelerate growth and efficiency through technology.
for cloud and edge environments. You will optimize real-time inference services for
GenAI models (LLMs and beyond)
running on
NVIDIA Jetson devices,
while collaborating with AI/ML teams to enable scalable hybrid deployments. This role combines
cloud orchestration expertise with edge performance optimization,
making it critical to our AI-driven solutions strategy.
Key Responsibilities:
Design & implement a
Docker-based deployment pipeline
for seamless cloud and edge integration.
Optimize and adapt
Python/FastAPI inference services
for real-time GenAI performance on edge devices.
Build & maintain
Kubernetes deployments
for hybrid workloads (cloud + NVIDIA Jetson).
Collaborate with AI/ML teams to integrate and deploy inference models to edge environments.
Troubleshoot, optimize, and stabilize inference workloads under constrained edge hardware conditions.
Skills & Qualifications:
10+ years of professional software engineering experience.
Strong proficiency in
Python
(FastAPI experience preferred).
Proven expertise with
Kubernetes and container orchestration at scale.
Hands-on experience with
real-time AI inference on embedded GPUs (NVIDIA Jetson or similar).
Solid knowledge of
performance optimization and resource constraints
in edge environments.
Strong problem-solving ability and collaborative mindset across AI/ML and infrastructure teams.
Preferred Qualifications:
Experience with
GPU acceleration frameworks
(CUDA, TensorRT).
Familiarity with
CI/CD pipelines for containerized deployments.
Knowledge of
network optimization for cloud-to-edge communication.
Background in
distributed systems and scalable architectures.
What We Offer:
Opportunity to work on
cutting-edge AI inference solutions
across cloud and edge.
Exposure to
hybrid cloud architectures with global clients.
A collaborative and
innovation-driven work culture.
Competitive salary packages aligned with industry standards.
Continuous learning opportunities in
AI, edge computing, and advanced cloud technologies.
Job Application Details:
Candidates fulfilling the above requirements may email their resume to careers@oursglobal.com, walk-in or you can submit your resume online by clicking the "Apply Now" button below.
Beware of fraud agents! do not pay money to get a job
MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.