Senior Site Reliability Engineer, K8s

Year Navi Mumbai, Maharashtra, India

Apply Now

Job Description

Description

Position at WebMD

About WebMD:

WebMD Health Corp., an Internet Brands Company, is the leading provider of health information services, serving patients, physicians, health care professionals, employers, and health plans through our public and private online portals, mobile platforms, and health-focused publications. The WebMD Health Network includes WebMD Health, Medscape, Jobson Healthcare Information, prIME Oncology, MediQuality, Frontline, QxMD, Vitals Consumer Services, MedicineNet, eMedicineHealth, RxList, OnHealth, Medscape Education, and other owned WebMD sites. WebMD\xc2\xae, Medscape\xc2\xae, CME Circle\xc2\xae, Medpulse\xc2\xae, eMedicine\xc2\xae, MedicineNet\xc2\xae, theheart.org\xc2\xae, and RxList\xc2\xae are among the trademarks of WebMD Health Corp. or its subsidiaries.

All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status.

For Company details, visit our website:

Education: B.E. Computer Science/IT degree (or any other engineering discipline)

Experience: 10+ years

Shift timings: 7:30pm-4:30am PM IST (9am-6pm EST)

About PulsePoint:

is a fast-growing healthcare technology company (with adtech roots) using real-time data to transform healthcare. We help brands and agencies interpret the hard-to-read signals across the health journey and unify these digital determinants of health with real-world data to produce the most dimensional view of the customer. Our award-winning advertising platforms use machine learning and programmatic automation to seamlessly activate this data, making marketing, predictive analytics, and decision support easy and instantaneous.

Sr. SRE, K8s:

As a part of the SRE team (working REMOTELY) you will be challenged, expected to grow your technical knowledge, challenge your fellow team members, and they will challenge you back. Our team is not competitive, but we are goal oriented and driven to succeed.

What you\'ll be doing:

Ensure reliability and scalability of our multi datacenter and hybrid Linux environments.

Managing the large-scale Linux infrastructure to ensure maximum uptime.

Performance and reliability testing. This may include reviewing configuration, software choices/versions, hardware specs, etc.

Advancing our technology stack with innovative ideas and new creative solutions.

Participating in capacity management of core systems and services, application analysis and performance and security tuning. Provide operational support of systems and build automation to remediate and address the root cause; with the goal of automating response to all non-exceptional service conditions.

Create strategies for long term permanent fixes to critical production incidents.

Maintain documentation, build tooling, and create alerts to both identify and address infrastructure reliability.

Proactively identify system anomalies.

Who are you:

You will be required to work East Coast U.S. hours 9am-6pm EST

10+ years of experience

Immaculate knowledge of best practices for architecting cross-datacenter Kubernetes clusters running on-premise with automated etcd management using kubeadm

Profound knowledge of docker (docker-shim), containerd and runc internals at the kernel level

Ability to manually troubleshoot and solve certificate issues within kubernetes with zero downtime

Vast experience in development of custom kubernetes operators and autoscalers, as well as tailored ingress/egress controllers

Numerous successful major version upgrades of elasticsearch and fluentd in the past are a must, as well as kubedb operator expertise

Fluency in gitops automation tools (flux v1/v2), comprehensive knowledge of helm customize controller

In-depth understanding of kubernetes security ACLs, exhaustive previous exposure to RBAC configuration and complete knowledge of DEX

Secret management is essential to succeed in this role, vault expertise is also required

Ability to manage BGP configuration, mastery in kube-router and gobgp, as well as MetalLB

Expert-level skills in KubeDNS and CoreDNS

Understanding of the most intricate details in rook/ceph implementation for kubernetes

Thorough understanding of RPM based Linux systems.

Experience administering SQL/NoSQL databases (MySQL, ES, Redis, Cassandra).

Experience with scalable infrastructure monitoring solutions such as Icinga, Prometheus, ELK.

Any scripting language (Python/Ruby/Shell etc).

Understanding of basic networking concepts ( TCP/IP stack, DNS, CDN, load balancing, BGP).

Ability to resolve complex merge conflicts in git is an obvious requirement

WebMD and its affiliates is an Equal Opportunity/Affirmative Action employer and does not discriminate on the basis of race, ancestry, color, religion, sex, gender, age, marital status, sexual orientation, gender identity, national origin, medical condition, disability, veterans status, or any other basis protected by law.

WebMD

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.

Related Jobs

Senior Site Reliability Engineer GBS IND

Bank of America

Hyderabad, Telangana

Apply Now
Senior Site Reliability Engineer

RELX

Mumbai, Maharashtra

Apply Now

Senior Site Reliability Engineer Remote

Akamai

India

Apply Now
Senior Site Reliability Engineer

Fairmatic Services, Inc.

Remote

Apply Now

Job Detail

Job Id

JD3204977
Industry

Not mentioned
Total Positions

1
Job Type:

Full Time
Salary:

Not mentioned
Employment Status

Permanent
Job Location

Navi Mumbai, Maharashtra, India
Education

Not mentioned
Experience

Year

Jobs by Function

Popular Job Skills

Popular Industries

Popular Cities

Jobseekers

Employers

Senior Site Reliability Engineer, K8s

Job Description

Related Jobs

Senior Site Reliability Engineer GBS IND

Senior Site Reliability Engineer

Senior Site Reliability Engineer Remote