Sre Architect

Year    Mumbai, Maharashtra, India

Job Description

job details

10+ Years of IT Experience with minimum 5 years in Site Reliability EngineeringAbility to drive and work with various stakeholders to define SLO, SLIs and error budgetExperience in Architecture principles for performance and resiliency of IT systemHas worked as SRE engineer for atleast one greenfield implementation from requirement gathering to run phase for a microservice based application (on cloud or on-prim)Exposure to performance engineering and Chaos engineeringExperience working in IT operations - Application MaintenanceAble to architect Observability & Alerting solution for production systems (apm tools, infra tools, network tools, storage tools & Alerting tools like Pager Duty, Opsgenie, Splunk on-call etc...)Experience in implementing Open Telemetry and Open Tracing framework and tools like Prometheus, Jaeger, Grafanaknowledge on DevOps/ ITIL / AIOps process (Continuous Delivery, release management, incident, problem, SCM, CMDB). Must be able to define and implement automation use cases Experience working on ITSM & ITOM tools (e.g. Service Now / Remedy). Able to do integration with observability tools, configure CMDB, Auto ticketing and workflow automationProficient in atleast two scripting language e.g. Shell scripting, python, R, java, PowerShellCan develop process automation solution for IT operations using various automation tools like Ansible, puppet, run deck, Saltstack, terraformfamiliar with IT Infrastructure (modern architecture - e.g containerization, API, virtualization)Experience in IT system architecture (e.g. microservices, 3 Tier Application, Java, dot net, php)Has good understanding of RDBMS and No-SQL Database architecture (oracle, mysql, SQL server, Elasticsearch, mongo dB, Splunk). Able to write SQL Queries. Able to architect log mining and analytics solution (ELK, Splunk). Must be able to build data pipeline with exposure to Kafka.Knowledge on AI / ML for Anomaly detection, Predictive analytics and intelligent automationCollaborate with operations & engineering teams, application developers, management and infrastructure teams to assess system reliability and provide solutions.Knowledge of cloud native and open sources tools for monitoring and automation (Cloud Watch, Terraform, App Insight etc...) ...

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Related Jobs

Job Detail

  • Job Id
    JD2954223
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Mumbai, Maharashtra, India
  • Education
    Not mentioned
  • Experience
    Year