JD
Responsibilities:
Work with the customer Development, DevSecOps, and IT teams to ensure operational excellence and maximize the reliability and availability of client systems.
Collaborate with cross-functional teams (DevSecOps, Development, IT) to implement SRE principles throughout the software development life cycle.
Establish and manage Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for critical services, monitoring and maintaining performance against defined targets.
Implement and enhance observability, alerting, and incident response processes to proactively address issues and minimize downtime.
Architect and design highly scalable and available infrastructure solutions, integrating best practices in reliability engineering and automation.
Drive continuous improvement initiatives by identifying bottlenecks and optimizing the infrastructure and application stack.
Develop and maintain documentation related to system architecture, configuration, and procedures.
Stay current with industry trends, recommending and adopting new tools and practices to enhance system reliability.
Qualifications:
Must Have Skills
Solid understanding of SRE principles and practices.
Strong understanding of full-stack observability, with hands-on experience using Datadog.
Strong background in managing highly available and scalable infrastructure.
Experience with container orchestration platforms, serverless architectures, CI/CD pipelines(Azure DevOps,Git Actions), and Infrastructure as Code (IaC) implementations (Ansible & Terraform/ Pulumi).
Hands-on experience working with EKS
Good to Have Skills
Hands-on experience working with Amazon Cloud Services (ECS, Lambda, EC2, API Gateway, CloudFront, SQS, SNS, etc.).
Excellent problem-solving skills with the ability to troubleshoot complex issues in production environments.
Proficiency in scripting and automation using Python, Typescript or Shell.
Relevant certifications in SRE, DevOps, Cloud, etc., are a plus.
Strong communication and leadership skills, fostering effective collaboration with cross-functional teams.
MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.