As a part of the organization as a Site Reliability Engineer you will be responsible for Site Reliability Engineering - Dev for one or multiple product through understand, design, test, automate and maintaining quality standards for EXFO's products. You will also be responsible for efficiency gains of team and improving product quality so that minimizing customer feedbacks.
What you'll do
-
Build required setup using software and hardware to manage infrastructure and applications
-
Hands on automation is a must to reduce SRE Toil
-
Good knowledge of monitoring tools like Prometheus, Grafana / Kibana , etc and build better metrics
-
Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding
-
Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve
-
good to have knowledge of service mesh
-
Will be responsible for solving technical problems or issues faced by team members.
-
Active participations in agile processes, identifying the process improvements and driving them.
-
Work on latest & diverse set of technologies and domains
-
Works towards scrum team goal, not only towards individual goal.
-
Be a motivated team member and think towards achieving end goal and not only finishing day to day task.
-
Go getter attitude.
What we're looking for
Technical skills
Must have 8+ years' of experience.
Must have atleast 2 - 3 yrs of experience as SRE
Must have experience in Automation testing and will be good to have worked on Robot Framework
Must have minimum 2-year experience with Linux operating system and knowledge in shell scripting.
Ability to perform POCs as needed to integrate the tool with the software application
Ability to create / build APIs as needed
understanding of pipelines and knowledge of building effective pipelines in Gitlab / Jenkins
Should have knowledge on programming languages like Python, Rust, Java and C++
Good knowledge on Docker, Kubernetes / openshift
A proactive approach to spotting problems, areas for improvement, and performance bottlenecks
Good to have knowledge of Networking and Telecom protocols, like TCP, UDP, SNMP, SIP, VoLTE etc
Must have knowledge of Databases and SQL
Should have worked in Agile environment.
Preferred : -
Knowledge and experience of defect management system like Jira.
Required aptitudes
Strong analytical and problem-solving skills
Must have
Language requirements: English
Education: Candidate should be from engineering background
MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.