Lead, Sre

Year    Chennai, Tamil Nadu, India

Job Description

Looking for payments professionals across a range of roles for our Payments Platform Business as we plan our expansion into Middle East, Africa & ASEAN markets. Locations in India, UAE & ASEAN .
Send your profiles for outside India to hemaliv@fss.co.in / for opportunities within India send it careers@fss.co.in

FSS is on a transformational journey developing next generation products in the area of payments. The products are being massively re-engineered following a platform, cloud and microservice first approach to attain an Internet scale, resiliency, security and performance. As we prepare ourselves for this mission, we’re aggressively onboarding enthusiastic engineers to be part of this once in a lifetime opportunity. As Lead – SRE, you will be responsible for understanding the requirements from product/engineering teams applying the lens of Site Reliability Engineering and subsequently build tools and solutions that effectively meet production demands of applications/services. As the whole engineering going through the modernization while building next generation software platforms, you will have an opportunity to define standards, processes and tools that are required to build the SRE grounds up. As one of the early engineers/leads, you will play instrumental role in building and shaping the SRE roadmap for FSS.

What you’ll do?

  • Proactively assess the application and systems architecture of various production application services and suggest appropriate level of logging, monitoring and alert management to ensure service reliability, performance and up-time.
  • Evaluate, standardize and set up monitoring, alerting and advanced logging platforms required to monitor various business systems and applications.
  • Proficient in understanding and developing SLIs, SLOs from the SLAs basis the product level SLAs and expected org-level availability SLAs.
  • Ensure reliability and availability by suggesting appropriate HA architectures, reliability techniques such as self-healing, graceful degradation and seamless switchover.
  • In collaboration with developers, ensure the applications and services developed has appropriate metrics instrumented and emitted to establish deeper debuggability, traceability and observability.
  • Set up incident management policies & practises to ensure the failed services/systems are restored in the shortest possible time.
  • Schedule and manage 24x7 production support for production services, prepare runbooks and escalation mechanisms.
  • Setup governance to review the effectiveness of SRE practices and suggest corrective measures to raise the availability/up-time and resiliency of FSS products/services.
  • Mentor/groom junior engineers in effectively using various SRE tools and techniques

What you bring on board?

  • 5-8y work experience as a full-fledged SRE engineer in a production environment, preferably in banking/fintech/payment domains.
  • Proficient in infra and app monitoring by using the tools such as ELK, Grafana, PQL, AWS Cloudwatch and Nagios.
  • Prior experience in supporting production critical applications both in Datacentre and at least on one of the public clouds I.e., AWS, Amazon and GCP is super essential
  • Have been a developer at some point, coded in at least one programming or scripting languages such as Python, Java/Go.
  • Good understanding ofvirtualization and hypervisors including VMWare/ESX, Hyper-V, KVM
  • Proficient in administering and perf turning various flavours of Linux OS and Unix in a production environment.
  • Understand how to use the source code management tools for version control (Bitbucket, Git, GitHub, and GitLab).
  • Proficient with infrastructure as a code using systems/config management tools such as Ansible, Chef and Terraform.
  • In addition to hiring, led/mentored/coached a team of SREs/Systems Engineers for 2-3 years
  • Continuous focus on learning new technologies, architecture concepts, and industry best practices

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Related Jobs

Job Detail

  • Job Id
    JD2886891
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Chennai, Tamil Nadu, India
  • Education
    Not mentioned
  • Experience
    Year