Aml Software Site Reliability Engineer

Year    KA, IN, India

Job Description

Collection of our people and their ideas encourage innovation in everything we do. Imagine what you could do here! Join Apple, and help us leave the world better than we found it. At Apple, new ideas have a way of becoming phenomenal products, services, and customer experiences very quickly. Every single day, people do amazing things at Apple. Do you want to be part of a team that builds cutting edge software service, a team that is continually innovating and is proud of making a difference? If so, bring your passion and talent and come join us to be part of something big and amazing. Apple's AML (Applied Machine Learning) team is looking for highly motivated and dedicated BigData Software Site Reliability Engineers to build and operate the next generation of platform, frameworks and software services that powers several critically important applications.




Description





We work on Apple scale opportunities and challenges. We are engineers at heart. We like solving technical problems. We believe that a good SRE must be a good software engineer and can code anything which has a logic and pattern to it. We believe a good engineer has the curiosity to dig into inner workings of technology and is always experimenting, reading and in constant learning mode. If you are a software engineer with passion to code and dig deeper into any technology, love knowing the internals, fascinated by distributed systems architecture, we want to hear from you. The person should be capable of exhibiting deftness to handle multiple simultaneous competing priorities and deliver solutions in a timely manner. The person will have to participate in 12x7 on-call rotation and provide incident resolution for the production issues in timely manner. The person should be able to understand complex architectures and be comfortable working with different teams




Minimum Qualifications



Experience: 3+ years in software site reliability engineering or software development roles. Programming: Proficient in at least one of Python, Golang, or Java. Data Structures & Algorithms: Strong foundation and application experience. Distributed Systems: Solid understanding and hands-on experience managing at least one distributed system (e.g., Kafka, Cassandra, Hadoop, Redis, or similar). Kubernetes: Expertise in Kubernetes ecosystem (deployment, configuration, monitoring, and operation). Cloud Platforms: Hands-on experience with at least one major cloud platform (AWS, Azure, or Google Cloud Platform). KEY RESPONSIBILITIES Design, develop, and automate: Build tools, frameworks and solutions to improve reliability, scalability, and efficiency across systems. Monitor and maintain: Implement advanced monitoring and alerting for cloud and containerized workloads. Troubleshoot and solve: Respond to and resolve complex production incidents, and perform root cause analysis. Collaborate: Work closely with development and operations teams to integrate reliability best practices throughout the software lifecycle. Optimize: Proactively recommend improvements in architecture, deployment, and operations for distributed systems.


Preferred Qualifications



Problem Solving: Demonstrated ability to independently troubleshoot and resolve complex technical issues. Creative Thinking: A track record of proposing and implementing innovative solutions to technical challenges. Strong communication and collaboration abilities. Willingness to learn and adapt to new technologies rapidly. Ownership mindset and accountability for deliverables.





Submit CV

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD4018877
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    KA, IN, India
  • Education
    Not mentioned
  • Experience
    Year