Sr Engineer, Site Reliability

Year    Hyderabad, Telangana, India

Job Description

About TMUS Global Solutions
T-Mobile is Americas supercharged Un-carrier, challenging conventions and setting new standards in wireless. With the nations largest and fastest 5G network, T-Mobile delivers advanced connectivity and unmatched value to millions across the U.S. Were unwaveringly obsessed with providing the best possible service experience, driven by a spirit of disruption that fuels competition and innovation in wireless and beyond.
TMUS India Private Limited is a subsidiary of T-Mobile US, Inc. and operates as TMUS Global Solutions.

About the Role
The Senior Systems Reliability Engineer (SRE) ensures the stability, performance, and reliability of IT services and infrastructure. This role combines software engineering and operations expertise to build and maintain highly available, scalable systems. As a leader in DevOps and cloud reliability practices, the engineer supports continuous improvement of automation, deployment pipelines, observability, and incident management, while mentoring junior engineers and optimizing production workflows.
The position plays a critical part in enabling software to be delivered faster, better, and more reliably to support business and customer needs.
What Youll Do

  • Design and maintain CI/CD pipelines and DevOps automation solutions
  • Guide incident response and improve system resiliency and performance
  • Build monitoring tools, dashboards, and proactive alerting for non-production environments
  • Create and maintain infrastructure as code (IaC) for scalable environments
  • Work with containerization and microservices in cloud-native platforms
  • Mentor junior engineers and collaborate across teams on cloud and DevOps initiatives
  • Improve software delivery processes through automation, cloud migration, and service orchestration
  • Perform other duties and technical projects as assigned
What Youll Bring
  • Bachelors degree in Computer Science, Software Engineering, or related field (Masters preferred)
  • 47 years of experience in systems reliability, DevOps, or cloud infrastructure engineering
  • Experience with CI/CD tools like Jenkins, GitLab CI, or CloudBees
  • Familiarity with infrastructure and configuration management tools (Ansible, Chef, Puppet)
  • Hands-on knowledge of public and private cloud platforms
  • Experience with application performance monitoring (APM) and log aggregation tools
  • Proven experience working in Agile and DevOps environments
Must Have Skills
  • Programming language: (Either Python/Java/JavaScript)
  • Cloud platforms, (AWS/Azure/GCP)
  • Automate infrastructure with IaC (Terraform, CloudFormation, Pulumi)
  • CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins, Argo CD).
  • Containers (Dockers, Kubernetes),
  • Observability: Prometheus, Splunk Grafana, CloudWatch, Datadog,
  • ITSM framework.
Nice To Have
  • Experience with Kubernetes, Docker, or other container technologies
  • Familiarity with AppDynamics, Splunk, ELK, Prometheus, or Grafana
  • Understanding of service-level objectives (SLOs), SLIs, and error budgets
  • Experience in cloud migration and cloud-native systems architecture
  • Background in performance, security, or availability testing practices

Skills Required

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD4266078
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Hyderabad, Telangana, India
  • Education
    Not mentioned
  • Experience
    Year