Sr Engineer, Software

Year    Hyderabad, Telangana, India

Job Description

About TMUS Global Solutions
T-Mobile is America's supercharged Un-carrier, challenging conventions and setting new standards in wireless. With the nation's largest and fastest 5G network, T-Mobile delivers advanced connectivity and unmatched value to millions across the U.S. We're unwaveringly obsessed with providing the best possible service experience, driven by a spirit of disruption that fuels competition and innovation in wireless and beyond.
Disclaimer: TMUS India Private Limited is a subsidiary of T-Mobile US, Inc. and operates as TMUS Global Solutions. TMUS India Private Ltd., and T-Mobile US, Inc., do not provide telecommunication services in India.

About the Role
As a Senior AIOps Engineer,you will be a key member of the CFL Platform Engineering and Operations team you will help design and implement next-generation intelligent operations that support AI/ML platforms, LLM-based applications, and large-scale distributed systems. Youll develop automation, observability, and remediation pipelines that enable predictive insights, reduce incident impact, and enhance the reliability of production environments.
This is a hands-on, technical role where youll work closely with SRE, DevOps, data, and platform teams to embed intelligent automation into core operational workflows.
What Youll Do

  • Develop automation pipelines for anomaly detection, root cause analysis, and self-healing
  • Build integrations between monitoring systems and AI/ML models for predictive alerting
  • Engineer real-time observability pipelines (logs, metrics, traces) across distributed platforms
  • Deploy and manage tools such as OpenTelemetry, Prometheus, Grafana, Splunk, and Datadog
  • Extend telemetry coverage for LLM-based systems, APIs, and hybrid cloud environments
  • Implement event-driven workflows for incident remediation and automated recovery
  • Contribute to intelligent alerting standards, dashboarding, and escalation logic
  • Collaborate with SRE and DevOps teams to define and implement reliability automation
  • Document playbooks, remediation flows, detection rules, and AIOps patterns
  • Partner with platform and data science teams on AIOps architecture and telemetry modeling
What Youll Bring
  • Bachelor's degree in Computer Science, Engineering, or a related field
  • 4-7 years of experience in SRE, DevOps, automation, or infrastructure roles
  • Hands-on experience with observability tools: Prometheus, Grafana, Splunk, OpenTelemetry
  • Proficient in scripting languages such as Python, Go, or Bash
  • Experience building CI/CD pipelines and integrating infrastructure telemetry
  • Working knowledge of Kubernetes, container operations, and cloud-native architectures
  • Familiarity with Azure (preferred), AWS or GCP
  • Understanding of incident response workflows, system health checks, and auto-remediation
Must Have Skills
  • Application & Microservice: Java, Spring boot, API & Service Design
  • Any CI/CD Tools: Gitlab Pipeline/Test Automation/GitHub Actions/ Jenkins /Circle CI
  • App Platform: Docker & Containers (Kubernetes)
  • Any Databases: SQL & NOSQL (Cassandra/Oracle/Snowflake/MongoDB)
  • Any Messaging: Kafka, Rabbit MQ
  • Any Observability/Monitoring: Splunk/ Grafana/ Open Telemetry /ELK Stack/ Datadog/ New Relic/ Prometheus)
  • AIOps Skills: GitOps/ArgoCD/Flux
Nice To Have
  • Fleet mgmt across EKS/AKS, Databricks integration
  • Measure adoption (time-to-first-deploy)
  • Mentor/coach product teams
  • Multi-cloud identity federation (OIDC, SPIFFE)
  • Standardized compositions, lifecycle governance

Skills Required

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD4756803
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Hyderabad, Telangana, India
  • Education
    Not mentioned
  • Experience
    Year