Join us as we work to create a thriving ecosystem that delivers accessible, high-quality, and sustainable healthcare for all.
The Team
Stability Metrics and Toolkit Engineering is an infrastructure operations and development team that helps to fill many of the grey areas between our stakeholders. Our team is responsible for developing tools to facilitate visibility into our availability and related metrics.
We strive to deliver impactful, accurate, and valuable tools, monitoring, internal analytics services, and products, and serve as a front-line initial touchpoint for live Production incident triage, analysis, and remediation.
The ideal candidate will utilize their engineering expertise to help build solutions to novel problems in software development, both front-end and back-end; data engineering; and anomaly detection.
Job Responsibilities
Research, design, and implement operational monitoring instruments and enhancements including early failure detection using machine learning metrics
Perform routine operations to migrate data in order to distribute usage of our resources more evenly across clients and infrastructure
Develop tools and web applications to support business new and existing business functions
Analyze application logs and metrics to determine service availability and uptime while developing automation
Interface with stakeholders to keep them informed of availability trends associated with business critical functions
Maintain existing synthetic monitoring to ensure parity as new features are developed
MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.