Act under guidance of DevOps; leading more than 1 Agile team.
Outcomes:
Interprets the DevOps Tool/feature/component design to develop/support the same in accordance with specifications
Adapts existing DevOps solutions and creates relevant DevOps solutions for new contexts
Codes debugs tests and documents and communicates DevOps development stages/status of DevOps develop/support issues
Selects appropriate technical options for development such as reusing improving or reconfiguration of existing components
Optimises efficiency cost and quality of DevOps process tools and technology development
Validates results with user representatives; integrates and commissions the overall solution
Helps Engineers troubleshoot issues that are novel/complex and are not covered by SOPs
Design install and troubleshoot CI/CD pipelines and software
Able to automate infrastructure provisioning on cloud/in-premises with the guidance of architects
Provides guidance to DevOps Engineers so that they can support existing components
Good understanding of Agile methodologies and is able to work with diverse teams
Knowledge of more than 1 DevOps toolstack (AWS Azure GCP opensource)
Measures of Outcomes:
Quality of Deliverables
Error rate/completion rate at various stages of SDLC/PDLC
# of components/reused
# of domain/technology certification/ product certification obtained
SLA/KPI for onboarding projects or applications
Stakeholder Management
Percentage achievement of specification/completeness/on-time delivery
Outputs Expected:
Automated components :
Deliver components that automates parts to install components/configure of software/tools in on premises and on cloud
Deliver components that automates parts of the build/deploy for applications
Configured components:
Configure tools and automation framework into the overall DevOps design
Scripts:
Develop/Support scripts (like Powershell/Shell/Python scripts) that automate installation/configuration/build/deployment tasks
Training/SOPs :
Create Training plans/SOPs to help DevOps Engineers with DevOps activities and to in onboarding users
Measure Process Efficiency/Effectiveness:
Deployment frequency
innovation and technology changes.
Operations:
Change lead time/volume
Failed deployments
Defect volume and escape rate
Meantime to detection and recovery
Skill Examples:
Experience in design installation and configuration to to troubleshoot CI/CD pipelines and software using Jenkins/Bamboo/Ansible/Puppet /Chef/PowerShell /Docker/Kubernetes
Experience in Integrating with code quality/test analysis tools like Sonarqube/Cobertura/Clover
Experience in Integrating build/deploy pipelines with test automation tools like Selenium/Junit/NUnit
Experience in Scripting skills (Python Linux/Shell Perl Groovy PowerShell)
Experience in Infrastructure automation skill (ansible/puppet/Chef/Poweshell)
Experience in repository Management/Migration Automation - GIT BitBucket GitHub Clearcase
Experience in build automation scripts - Maven Ant
Experience in Artefact repository management - Nexus/Artifactory
Experience in Dashboard Management & Automation- ELK/Splunk
Experience in configuration of cloud infrastructure (AWS Azure Google)
Experience in Migration of applications from on-premises to cloud infrastructures
Experience in Working on Azure DevOps ARM (Azure Resource Manager) & DSC (Desired State Configuration) & Strong debugging skill in C# C Sharp and Dotnet
Setting and Managing Jira projects and Git/Bitbucket repositories
Skilled in containerization tools like Docker & Kubernetes
Knowledge Examples:
Knowledge of Installation/Config/Build/Deploy processes and tools
+ Knowledge of IAAS - Cloud providers (AWS Azure Google etc.) and their tool sets
+ Knowledge of the application development lifecycle
+ Knowledge of Quality Assurance processes
+ Knowledge of Quality Automation processes and tools
+ Knowledge of multiple tool stacks not just one
+ Knowledge of Build and release Branching/Merging
+ Knowledge about containerization
+ Knowledge of Agile methodologies
+ Knowledge of software security compliance (GDPR/OWASP) and tools (Blackduck/ veracode/ checkmarxs)
Additional Comments:
o 5+ years of experience as an SRE, DevOps Engineer, or similar role. o Proficiency in scripting and automation (Bash, Python, Go, etc.). o Strong experience with containerization and orchestration (Docker, Kubernetes, Helm). o Solid understanding of Linux systems administration and networking fundamentals. o Experience with cloud platforms (AWS, Azure, or GCP). o Experience with IaC tools like Terraform or CloudFormation. o Familiarity with GitOps and modern deployment practices. o Hands-on experience with observability tools (e.g., Prometheus, Grafana, Datadog). o Strong troubleshooting and incident response skills. Preferred: o Experience in a high-traffic, microservices-based architecture. o Exposure to service meshes (Istio, Linkerd). o Certifications (AWS Certified DevOps Engineer, CKA, etc.) o Experience with security automation and compliance (e.g., SOC2, ISO27001). Soft Skills: o Strong communication and collaboration abilities. o Ability to thrive in a fast-paced, agile environment. o Analytical mindset and proactive approach to problem-solving. o A passion for automation, performance, and system design. Design, build, and maintain reliable, scalable, and secure cloud-based infrastructure (AWS, Azure, or GCP). o Develop and improve observability using monitoring, ing, logging, and tracing tools (e.g., Prometheus, Grafana, ELK, Datadog, etc.). o Automate repetitive tasks and infrastructure using Infrastructure-as-Code (Terraform, CloudFormation, Pulumi). o Create and maintain CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins, ArgoCD, etc.) to support fast and safe delivery. o Lead incident response, root cause analysis, and postmortems to ensure high uptime and rapid recovery. o Optimize system performance, reliability, and cost-effectiveness through proactive monitoring and tuning. o Collaborate with software engineering teams to define SLAs/SLOs and improve service reliability. o Implement and maintain security best practices across environments (e.g., secrets management, IAM, firewalls, etc.). o Maintain disaster recovery plans, backups, and high-availability strategies.
Skills
Kubernetes,Cloud Platform,Python Scripting,Sre
About UST
UST is a global digital transformation solutions provider. For more than 20 years, UST has worked side by side with the world's best companies to make a real impact through transformation. Powered by technology, inspired by people and led by purpose, UST partners with their clients from design to operation. With deep domain expertise and a future-proof philosophy, UST embeds innovation and agility into their clients' organizations. With over 30,000 employees in 30 countries, UST builds for boundless impact--touching billions of lives in the process.
Beware of fraud agents! do not pay money to get a job
MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.