Sr Engineer, Site Reliability

Year Hyderabad, Telangana, India

Apply Now

Job Description

About TMUS Global Solutions
T-Mobile is Americas supercharged Un-carrier, challenging conventions and setting new standards in wireless. With the nations largest and fastest 5G network, T-Mobile delivers advanced connectivity and unmatched value to millions across the U.S. Were unwaveringly obsessed with providing the best possible service experience, driven by a spirit of disruption that fuels competition and innovation in wireless and beyond.
TMUS India Private Limited is a subsidiary of T-Mobile US, Inc. and operates as TMUS Global Solutions.

About the Role
The Senior Systems Reliability Engineer (SRE) ensures the stability, performance, and reliability of IT services and infrastructure. This role combines software engineering and operations expertise to build and maintain highly available, scalable systems. As a leader in DevOps and cloud reliability practices, the engineer supports continuous improvement of automation, deployment pipelines, observability, and incident management, while mentoring junior engineers and optimizing production workflows.
The position plays a critical part in enabling software to be delivered faster, better, and more reliably to support business and customer needs.
What Youll Do

Design and maintain CI/CD pipelines and DevOps automation solutions
Guide incident response and improve system resiliency and performance
Build monitoring tools, dashboards, and proactive alerting for non-production environments
Create and maintain infrastructure as code (IaC) for scalable environments
Work with containerization and microservices in cloud-native platforms
Mentor junior engineers and collaborate across teams on cloud and DevOps initiatives
Improve software delivery processes through automation, cloud migration, and service orchestration
Perform other duties and technical projects as assigned

What Youll Bring

Bachelors degree in Computer Science, Software Engineering, or related field (Masters preferred)
47 years of experience in systems reliability, DevOps, or cloud infrastructure engineering
Experience with CI/CD tools like Jenkins, GitLab CI, or CloudBees
Familiarity with infrastructure and configuration management tools (Ansible, Chef, Puppet)
Hands-on knowledge of public and private cloud platforms
Experience with application performance monitoring (APM) and log aggregation tools
Proven experience working in Agile and DevOps environments

Must Have Skills

Programming language: (Either Python/Java/JavaScript)
Cloud platforms, (AWS/Azure/GCP)
Automate infrastructure with IaC (Terraform, CloudFormation, Pulumi)
CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins, Argo CD).
Containers (Dockers, Kubernetes),
Observability: Prometheus, Splunk Grafana, CloudWatch, Datadog,
ITSM framework.

Nice To Have

Experience with Kubernetes, Docker, or other container technologies
Familiarity with AppDynamics, Splunk, ELK, Prometheus, or Grafana
Understanding of service-level objectives (SLOs), SLIs, and error budgets
Experience in cloud migration and cloud-native systems architecture
Background in performance, security, or availability testing practices

Skills Required

Computer

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.

Job Detail

Job Id

JD4266078
Industry

Not mentioned
Total Positions

1
Job Type:

Full Time
Salary:

Not mentioned
Employment Status

Permanent
Job Location

Hyderabad, Telangana, India
Education

Not mentioned
Experience

Year

MNC Jobs India

Jobs by Function

Popular Job Skills

Popular Industries

Popular Cities

Jobseekers

Employers