At Avenga, we believe that human creativity empowers technology that matters. Operating globally, our 6000+ specialists provide a full spectrum of services, including business and tech advisory, enterprise solutions, CX, UX and Ul design, managed services, product development, and software development.
This is the job
In
Bangalore
we are seeking a Senior Site Reliability Engineer to join an international team at a leading cloud security company. You'll help ensure the stability, availability, and performance of production systems while contributing to monitoring, incident response, and operational best practices. This role bridges Operations, Engineering, and Product Management, driving product improvements and uptime. It's a great opportunity to work on cutting-edge cloud security solutions, protecting data and users from evolving digital threats.
The role follows a rotating schedule --1 week aligned with Pacific Time business hours, followed by 3 weeks aligned with Bangalore Time business hours.
The position includes an on-call rotation one week per month, requiring daily availability of 12 hours for 7 days, outside of business hours.
This is you
Bachelor's degree in computer science, electrical engineering or a related area, with 7+ years of SRE experience in a large enterprise organization
System admin experience on Linux environments.
Experience with Prometheus, Grafana, ELK, Opensearch, Cloudwatch, PagerDuty
Solid experience with Cloud Technologies such as AWS and OCI.
Good experience with containerized workloads tools like Kubernetes.
Experience understanding and managing web servers (Apache, Tomcat, Nginx)
Experience with any configuration management tools like Salt or Puppet or Ansible
Experience with source control tools such as Github and SVN.
Experience with deployment tools Jenkins, Harness etc.
Experience with SQL and NoSQL databases like Redis, CouchBase, Cassandra, Crate, Elasticsearch.
Experience in performing and writing Root Cause Analysis documents
This is your role
Perform Incident Management and Change Management to maintain the continuous availability of all Cloud Infrastructure services.
Maintain a 24x7 production environment with a high level of service availability and perform quality reviews, manage operational issues.
Perform problem management by analyzing metrics, alarms and dashboards to troubleshoot problem areas, report issues to assist in performance tuning and fault finding.
Explore and innovate new technologies, features, and tools to improve the platform and automate operational tasks using Bash, Python or any other programming language.
Manage and maintain Runbooks and Standard Operating procedures
Manage, coordinate, and document all types of maintenance activities and outages.
Perform patching and upgrades for vulnerability management.
We take pride in the diverse skills and character of our teams, welcoming everyone to apply and contribute to our collective strength.
Beware of fraud agents! do not pay money to get a job
MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.
Job Detail
Job Id
JD3711302
Industry
Not mentioned
Total Positions
1
Job Type:
Full Time
Salary:
Not mentioned
Employment Status
Permanent
Job Location
KA, IN, India
Education
Not mentioned
Experience
Year
Apply For This Job
Beware of fraud agents! do not pay money to get a job
MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.