Principal, Site Reliability Engineering

Year    Bengaluru, Karnataka, India

Job Description


Software AG has been a data pioneer from the beginning. Our product offerings empower our customers to turn their data into value. We have been delivering customer-centric innovation to thousands of market-leading organizations worldwide for over 50 years. Today Software AG employs more than 5000 people in 70+ countries and had total revenues of €890 million in 2019.

The Role:
Solve problems relating to mission critical services and build automation to prevent problem recurrence, with the goal of automating response to all non-exceptional service conditions. You have deep expertise in analyzing complex systems, anticipating problems and finding ways to mitigate risk. By incorporating your knowledge of SRE processes to be focused on maximum availability, reliability, security, and performance for Software AG cloud services.
You will be an integral part of our webMethods.io iPaaS platform Cloud Engineering Operations global team, who are responsible for providing Continuous Cloud Operations and Continuous Cloud Innovation for webMethods.io product portfolio.
Responsibilities:


  • Design, write and deliver software to improve the availability, scalability, latency, and efficiency of webMethods.io iPaaS cloud services

  • Expertise of Observability Platform (application telemetry, tracing, and Log aggregation).

  • Influence and create new designs, architectures, standards and methods for large-scale distributed systems

  • Collaborate with a world-class engineering team to propose features that solve recurring patterns of customer complaints

  • Engage in service capacity planning and demand forecasting, software performance analysis and system tuning

  • Participate in on call rotation, Participate, collaborate and provide guidance in retrospectives.

  • Find scalability bottlenecks and areas for performance improvements

  • Deep technical knowledge in Cloud Infrastructure, Operations, Support, Networking, Systems, IAC, Automated Deployments, Cloud Platforms and Dev Ops

  • You will identify and implement automation opportunities to drive down repetitive processes, reduce technical debt and improve system reliability

Qualifications
Requirements:

  • Bachelor’s degree in software engineering, computer science, computer engineering, or related technical field

  • Experience in Cloud Software Engineering, Cloud Site Reliability Engineering, & Cloud Operations

  • Experience with Amazon Web Services and/or any other public cloud

  • Experience with Infrastructure as a Service (IaaS) and Platform as a Service (PaaS) technology stacks.

  • Experience with containers and HA clusters; experience with Docker and Amazon ECS /Kubernetes is mandatory

  • Good knowledge of virtualization technologies and container technologies

  • Firm grasp of at least one modern programming language (Java/Go/Python/Ruby), beyond basic scripting (Shell,Perl,Bash)

  • Solid experience using configuration management frameworks (e.g. Ansible/Chef/Puppet)

  • Manage capacity, build Security into every layer and reduce cost

  • Implement secure Networking, key management, user management, access management, process management, image management.

  • Maintain services once they are live by measuring and monitoring availability, latency and overall system reliability.

  • Proven experience in handling large Infrastructure and distributed systems like Kafka, Elastic Search etc..

  • Release software through tooling (git, Jenkins, custom scripts, Helm, Docker)

Additional Information

  • Familiar with Cloud Availability Patterns ( SLI , SLO , SLA etc..)

  • Expertise in designing, analyzing and troubleshooting large-scale distributed systems

  • Exp with Unix/Linux-OS Internals and administration (e.g. Filesystems, inodes, system calls, etc) or Networking (e.g. TCP/IP, routing, network topologies, and hardware, SDN, etc)

  • Basic understanding of most of the following: Jira, Splunk

  • Experience with algorithms, data structures, complexity analysis and software design.

  • Familiar with AIOPS tools

What You Can Expect
An opportunity to join a world-class team working in an exciting and rapidly growing domain.

  • Drive the strategy of cloud operations for a large enterprise organization on a transformational journey

  • A competitive compensation package including a performance-driven bonus.

  • A generous benefits package including pension and comprehensive medical insurance.

  • A great working environment with R&D,

  • Opportunities to travel to Software AG’s offices across the world overseas.

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Related Jobs

Job Detail

  • Job Id
    JD2897457
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Bengaluru, Karnataka, India
  • Education
    Not mentioned
  • Experience
    Year