Staff Backend Engineer Ai Workspace

Year    KA, IN, India

Job Description

Coupang is reimagining the shopping experience with the goal of wowing each customer from the instant they open the Coupang app to the moment an order is delivered to their door. Powered by an outstanding end-to-end e-commerce and logistics network and a fanatical culture of customer centricity, Coupang has broken tradeoffs around speed, selection and price. Today, we provide exceedingly fast shipping speeds on millions of items including fresh groceries, delivered within hours nationwide, 365 days a year. We are doing this for millions of consumers in Korea. Korea is home to one of the largest and fastest growing e-commerce opportunities anywhere in the world.



Coupang has been added into the 2023 fortune 500 list, a ranking U.S-based companies by revenue. We have been named as one of the '50 Smartest Companies in the World' by MIT Technology Review, and as one of Forbes magazine's '30 Global Game Changers.' In 2020, we placed second on CNBC's 'Disruptor 50' list.


Job Overview:





The AI Workspace - Software Engineering team builds and maintains the foundational infrastructure and developer tools that power the AI lifecycle. Our responsibilities include a robust Resource Manager and a suite of AI development tools such as Workflow Orchestration, Model Insights, Model Registry, and more--enabling customers to seamlessly create, manage, and monitor their AI workloads.



We are looking for an experienced Staff Software Engineer to help architect and scale mission-critical systems that support intelligent workload management, resource optimization, and operational transparency. In this role, you will work across key components of our platform, driving innovation in workload orchestration, model tracking, and resource governance. Your contributions will ensure our customers have a reliable, intuitive, and efficient environment for building and deploying AI solutions at scale.


What You Will Do





As a

Staff Software Engineer

, you will partner with leaders of multiple platform teams. You will work closely with the product team to define and implement simple solutions of complex infrastructure problems while ensuring to build a highly scalable, reliable and efficient platform for our customers. You will technically guide teams working on full stack development teams using JAVA, AWS, Kafka, Kubernetes, Kubeflow, Argo CD and gRPC. You will be accountable for hiring and developing technical leaders and top talent, raising the bar on engineering and operational excellence.



In this role, you will:


Lead the design of scalable, event-driven microservices for optimized resource management. Design and implement the development of a next-generation Resource Manager and AI developer tools, enabling efficient, secure, and policy-compliant data access across hybrid environments. Ensure reliability, observability, and performance of backend systems through rigorous testing and monitoring. Mentor junior engineers and contribute to architectural decisions that shape the future of AI infrastructure. Hands-on develop critical infrastructure components. Decompose complex problems into simple, straightforward solutions, providing mechanisms for the teams to prioritize ruthlessly and "move with urgency". Envision roadmaps for the scalable and robust growth of Workspace infrastructure. Collaborate with Product owner, TPMs, and Compliance teams to define end-to-end ML model development lifecycle experiences. Proactively drive Workspace initiatives and align with stakeholders. Demonstrate excellence resulting in scalable systems and services with the highest quality architecture and design. Dive deep into critical system issues, proactively addressing similar root causes, and raise the bar on Operational Excellence. Collaborate with other Coupang tech leaders to make the service extensible to unlock opportunities for innovations.

Qualifications




Bachelor's degree in computer science or related technical fields. 8+ years professional software development experience One who is fluent in one or more among Java, Go and Python Deep expertise in Java Spring Boot, Hibernate, and RESTful API design. Proven track record of delivering mission critical systems. Experience developing and growing senior individual contributors globally. Experience with cloud computing using AWS or Azure or GCP. Experienced in Machine learning job deployment infrastructure like Argo CD w




Preferred Qualifications




Experience in Kubernetes, gRPC, Spring. Experience in AI model lifecycle. Understanding data governance, compliance, and access control in distributed systems. Experience with Kubernetes, Docker, and container orchestration in production environments. Experience in concurrency, multi-threading, synchronization, and non-blocking IO. Deep understanding of operating system kernel and distributed system such as Kafka, Cassandra, ClickHouse and Mongo DB. Experience integrating with AI/ML frameworks or pipelines (e.g., TensorFlow, PyTorch, ONNX). Experience with observability tools (e.g., Prometheus, Grafana, ELK stack). Familiarity with cloud platforms (AWS, GCP, Azure) and cloud-native development practices. Ability to handle multiple competing priorities in a fast-paced environment and leading the delivery of large-scale services for complex business offerings. * Ability to influence cross functional stakeholders, prioritize ruthlessly, Aim High and Find a Way to deliver results with grit.

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD3876034
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    KA, IN, India
  • Education
    Not mentioned
  • Experience
    Year