Site Reliability Developer

Year    IN, India

Job Description

Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.


In this role you will need to: Take ownership of the implementation and production operations of a wide array of core system platform solutions React to production deficiencies by continuously implementing automation, self-learning, and real-time monitoring to production systems Be a strong contributor to development of platform services including architecture, provisioning, configuration, deployment, and support Partner with the distributed team in prototyping new database platform services Stay informed of cloud infrastructure stacks Innovate.



Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.


Preferred Qualifications: Degree level : BE/BS/MS Programming languages like Python and bash , technical skills of Cloud platforms , Chef , Grafana and Terraform Fair knowledge and experience of the Oracle Engineered systems and subsystems. Ability to troubleshoot and resolve hardware/software issues, restore environments to an operational state, perform root cause analysis and provide forward thinking mitigation strategies Fair level understanding, implementation experience and troubleshooting of Oracle Database technology including RAC, Dataguard, ASM, RMAN etc Demonstrated operations experience with Linux platform (i.e. RHEL, OEL) including administration, management, and troubleshooting Strong communication and analytical skills Familiarity with security practices in web application delivery and General knowledge of network topology Experience with configuration management tools

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD4615106
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    IN, India
  • Education
    Not mentioned
  • Experience
    Year