Software Engineer Aidp Reliability Engineering

Year    TS, IN, India

Job Description

The Applied Machine Learning team in AI and Data Platform org has been at the forefront of accelerating digital transformation through machine learning across Apple's enterprise ecosystem. We build and operate ML, GenAI, Inference and Data Platforms and Services to provide a comprehensive suite of capabilities-serving business-critical needs across Apple's enterprise. We work on interesting and hard challenges related to scale and performance across diverse set of open source and cutting edge technologies. As part of the Site Reliability Engineering (SRE) team, you will design, develop, and maintain core platform components that support the fraud decisioning and solutioning infrastructure. The role involves developing automation and tooling to improve operational efficiency, and you will collaborate closely with engineering teams to ensure high availability, resiliency, and security of critical systems while monitoring and optimizing production platforms for performance and reliability. We are looking for a talented engineer to join our team and bring passion for building and operating large scale platform and distributed systems leveraging cutting edge open source technologies across hybrid cloud environments!



Description



We are looking for engineers who have strong coding skills and computer science foundation with passion for building resilient and highly performant distributed systems. As a software engineer in AiDP reliability engineering you will work on one or many projects related to GenAI, ML Inference in highly scalable and distributed system. You will:




Build, enhance, and maintain multi-tenant systems employing diverse technologies.


Collaborate with multi-functional teams to deliver impactful customer features.


Lead projects through full lifecycle, from design discussions to release delivery.


Operate, scale, and optimize high-throughput and highly concurrent services.


Diagnose, resolve, and prevent production and operational challenges.


We are looking for enthusiastic engineers passionate about building and maintaining solutioning platform components on cloud and Kubernetes infrastructure. The ideal candidate will go beyond traditional SRE responsibilities by collaborating with stakeholders, understanding the applications hosted on the platform, and designing automation solutions that enhance platform efficiency, reliability, and value.","responsibilities":"The successful candidate will be amenable to working in an exciting, fast paced, dynamic, collaborative environment. The person filling this position must be a hands-on, hardworking, self-motivated developer with strong initiative. You will have a real passion for extraordinary user experiences and an eye for details.



Preferred Qualifications



Excellent analytical & problem solving skills.



Exposure to Machine Learning and GenAI technologies.



Exposure to datasets management and cost optimisation in cloud.



Exposure to Ray and Ray Serve, for building scalable, distributed, and model-serving platform components.



Minimum Qualifications



Bachelor's Degree in Computer Science, Computer Engineering or equivalent technical degree



Proficient programming knowledge in Python or Java and ability to read and explain open source codebase.



Good foundation of Operating Systems, Networking and Security Principles



Exposure to DevOps tools such as Ansible and Terraform, with experience integrating platform components into Kubernetes and AWS Cloud environments.

","internalDetails":null,"eeoContent":null

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD4663806
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    TS, IN, India
  • Education
    Not mentioned
  • Experience
    Year