Senior Site Reliability Engineer With Python

Year    Hyderabad, Telangana, India

Job Description

SRE DevOps EngineerExperience Level: Senior/ LeadAbout NomiSo India: Nomiso is a product and services engineering company. We are a team of Software Engineers, Architects, Managers, and Cloud Experts with expertise in Technology and Delivery Management.Our mission is to Empower and Enhance the lives of our customers, through efficient solutions for their complex business problems.At Nomiso we encourage entrepreneurial spirit - to learn, grow and improve. A great workplace, thrives on ideas and opportunities. That is a part of our DNA. We're in pursuit of colleagues who share similar passions, are nimble and thrive when challenged. We offer a positive, stimulating and fun environment - with opportunities to grow, a fast-paced approach to innovation, and a place where your views are valued and encouraged.We invite you to push your boundaries and join us in fulfilling your career aspirations!Position Overview:

  • Tools Coverage - Assess the tools coverage and ensure sufficient monitoring is in place to enable mature observability and data driven decision making
  • Defining and educating Engineering teams - Process, Procedures, Guide Rails and best practices
  • Culture - Inculcate the culture of high performing teams and adopt the ways of working with the influence of SRE
The role will need to work with a global team responsible for a mission critical business function, and will partner with Infrastructure, DevOps and Core practices (like Security, Identity, ProdOps, Cloud platform and Tools) teams to identify and implement automation opportunities to drive down toil, reduce technical debt and improve system reliability.Roles and Responsibilities: The Site Reliability Engineer (SRE) will be responsible for both uplifting and maintaining our evolving technology platforms, infrastructure and technology controls. As an
SRE, the role will include both oversight for production operations of our systems, as well as development/engineering of solutions to maximise system reliability and automation.
  • Own the Infrastructure, APM and work with DevOps teams to Build, Release, Monitor and run the services to improve service reliably.
  • Write software to automate API-driven tasks at scale and contribute to the product codebase in Python , Go , java, JS, React, Node.
  • Work with Ansible, Puppet, Chef, Terraform or another configuration management / orchestration suite, know where it's broken, work towards fixing them and explore new alternatives.
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system reliability.
  • Performance and maturity base lining of DevOps process, tools maturity & coverage, metrics, technology and engineering practices.
  • Work closely with Engineering, QA, Operations teams in optimally delivering large scalesystems using CI/CD pipelines.
  • Evaluate technology options and define the build, delivery, and deployment pipeline for applications.
  • Understand, Define, Measure and improve Reliability Metrics (SLO/SLI), Observability
(Monitoring, Logging-Tracing solutions), Ops process (Incident, Problem Mgmt) and streamline - automate release management.
  • Strong believer of automation to bring in sustained continuous improvement by automating.
Toil, Runbooks, Improving ability of the applications to auto heal leading to improved reliability.
  • Should have supported Production Incidents (PIs) on critical applications of a company.
Troubleshoot,debug, and diagnose operational issues and drive them to closure.
  • Be a subject matter expert, able to upskill / cross skill engineering teams on SRE principles, tools and execution.
Must Have Skills:
  • 6+ years of experience in Software development
  • Application Performance Monitoring (APM) tool New Relic or with relevant tools for monitoring, logging, tracing.
  • 3+Years of Experience with designing and developing testing utilities using Python or similar for integrating test automation into the DevOps pipeline.
  • CI/CD Integration.
  • Containerization - Kubernetes, Docker, Rancher, etc
  • 3+ Years of Experience with version control systems, Git or similar.
  • Strong hands-on coding experience in one or more of programming languages such as Python.
  • Expert level hands on knowledge in public cloud platform AWS and/or Google CloudPlatform.
  • Professional level certificate on one of the public clouds is highly desirable.
  • Proven experience in handling large scale and growing infrastructure across Data Centres and heterogeneous Cloud platforms.
  • Understanding of software delivery life cycles, particularly Agile/Lean & DevOps
Good to Have Skills:
  • *Familiarity with handling: o Kafka, Yarn, ElasticSearch etc. o Source code management and Implementation of Security best practices. o Tech Stack - Python, Falcon, Elastic Search, MongoDB, AWS (SQS S3), Map
Reduce.
  • Networking knowledge
  • Qualification: Master's or Bachelor's degree in Computer Science Engineering, or a related technical degree
Location: Hyderabad / BangaloreJob Type: Full-timeSalary: ?2,000,000.00 - ?3,300,000.00 per yearBenefits:
  • Health insurance
Schedule:
  • Day shift
  • Morning shift
Supplemental pay types:
  • Yearly bonus
Ability to commute/relocate:
  • Hyderabad - 500033, Telangana: Reliably commute or planning to relocate before starting work (Preferred)
Experience:
  • SRE: 1 year (Required)
  • DevOps: 3 years (Required)
  • Python coding: 3 years (Required)
  • AWS: 1 year (Required)

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Related Jobs

Job Detail

  • Job Id
    JD2867145
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Hyderabad, Telangana, India
  • Education
    Not mentioned
  • Experience
    Year