Lead I Devops Engineering

Year    Pune, Maharashtra, India

Job Description


JOB DESCRIPTION Role Proficiency:Act under guidance of Lead II/Architect understands customer requirements and translate them into design of new DevOps (CI/CD) components. Capable of managing at least 1 Agile Team Outcomes: Interprets the DevOps Tool/feature/component design to develop/support the same in accordance with specifications Adapts existing DevOps solutions and creates own DevOps solutions for new contexts Codes debugs tests documents and communicates DevOps development stages/status of DevOps develop/support issues Select appropriate technical options for development such as reusing improving or reconfiguration of existing components Optimises efficiency cost and quality of DevOps process tools and technology development Validates results with user representatives integrates and commissions the overall solution Helps Engineers troubleshoot issues that are novel/complex and are not covered by SOPs Design install configure troubleshoot CI/CD pipelines and software Able to automate infrastructure provisioning on cloud/in-premises with the guidance of architects Provides guidance to DevOps Engineers so that they can support existing components Work with diverse teams with Agile methodologies Facilitate saving measures through automation Mentors A1 and A2 resources Involved in the Code Review of the team Measures of Outcomes: Quality of deliverables Error rate/completion rate at various stages of SDLC/PDLC # of components/reused # of domain/technology certification/ product certification obtained SLA for onboarding and supporting users and tickets Outputs Expected:Automated components : Deliver components that automat parts to install components/configure of software/tools in on premises and on cloud Deliver components that automate parts of the build/deploy for applications Configured components: Configure a CI/CD pipeline that can be used by application development/support teams Scripts: Develop/Support scripts (like Powershell/Shell/Python scripts) that automate installation/configuration/build/deployment tasks Onboard users: Onboard and extend existing tools to new app dev/support teams Mentoring: Mentor and provide guidance to peers Stakeholder Management: Guide the team in preparing status updates keeping management updated about the status Training/SOPs : Create Training plans/SOPs to help DevOps Engineers with DevOps activities and in onboarding users Measure Process Efficiency/Effectiveness: Measure and pay attention to efficiency/effectiveness of current process and make changes to make them more efficiently and effectively Stakeholder Management: Share the status report with higher stakeholder Skill Examples: Experience in the design installation configuration and troubleshooting of CI/CD pipelines and software using Jenkins/Bamboo/Ansible/Puppet /Chef/PowerShell /Docker/Kubernetes Experience in Integrating with code quality/test analysis tools like Sonarqube/Cobertura/Clover Experience in Integrating build/deploy pipelines with test automation tools like Selenium/Junit/NUnit Experience in Scripting skills (Python/Linux/Shell/Perl/Groovy/PowerShell) Experience in Infrastructure automation skill (ansible/puppet/Chef/Powershell) Experience in repository Management/Migration Automation - GIT/BitBucket/GitHub/Clearcase Experience in build automation scripts - Maven/Ant Experience in Artefact repository management - Nexus/Artifactory Experience in Dashboard Management & Automation- ELK/Splunk Experience in configuration of cloud infrastructure (AWS/Azure/Google) Experience in Migration of applications from on-premises to cloud infrastructures Experience in Working on Azure DevOps/ARM (Azure Resource Manager)/DSC (Desired State Configuration)/Strong debugging skill in C#/C Sharp and Dotnet Setting and Managing Jira projects and Git/Bitbucket repositories Skilled in containerization tools like Docker/Kubernetes Knowledge Examples: Knowledge of Installation/Config/Build/Deploy processes and tools Knowledge of IAAS - Cloud providers (AWS/Azure/Google etc.) and their tool sets Knowledge of the application development lifecycle Knowledge of Quality Assurance processes Knowledge of Quality Automation processes and tools Knowledge of multiple tool stacks not just one Knowledge of Build Branching/Merging Knowledge about containerization Knowledge on security policies and tools Knowledge of Agile methodologies Additional Comments: PTEC is looking for Site Reliability Engineer to manage end to end application and system stack and to work with one of the leading financial services organization in the US. Site Reliability Engineering (SRE) is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations. SRE is also an engineering approach to building and running production systems -engineer solutions to operational problems. As SREs are responsible for overall system operation, utilizing a breadth of tools and approaches to solve a broad set of problems. Practices such as limiting time spent on operational work, blameless postmortems, proactive identification, and prevention of potential outages. Responsibilities: As a Site Reliability Engineer, . You will be part of the team to migrate and transform the on-prem applications and data centers to public Cloud (GCP), and then. . You will engage in and improve the software development lifecycle - from inception and design, through development, deployment, operation and refinement . Develop and maintain the large-scale infrastructure . Own build tools and CI/CD automation pipeline . You will influence and design infrastructure, architecture, standards and methods for large-scale systems . You will support services prior to production via infrastructure design, software platform development, load testing, capacity planning and launch reviews . You will maintain services during deployment and in production by measuring and monitoring key performance and service level indicators including availability, latency, and overall system health . You will automate system scalability and continually work to improve system resiliency, performance and efficiency . Investigate, diagnose, and resolve performance and reliability problems in a wide range of large-scale and high-throughput services . Collaborate with architects and application engineers to ensure applications are maintainable, scalable, and follow appropriate disaster recovery and high availability strategies . Contributions to handbook, runbooks, and general documentation . You will remediate tasks within corrective action plan via sustainable, preventative, and automated measures whenever possible Requirements: . BS degree in Computer Science or related technical field, or equivalent job experience required . 4plus years of SRE experience in Cloud environments . 2+ years of experience developing and/or administering software in public cloud . Strong working knowledge and working experience on GCP (Google Cloud Platform) . Experience in DevOps and CI/CD pipelines and build tools like Jenkins. . 2 -4 years of experience in languages such as Python, Ruby, Bash, Java, Go, Perl, JavaScript and/or node.js . Experience managing Infrastructure as code via tools such as Terraform or CloudFormation . Must have great communication skills . Experience operating a production environment at high scale with emphasis on availability, latency . Deep knowledge of container orchestration tools such as Docker, Kubernetes . Familiar with configuration management tools and Deployment tools such as Chef, Octopus . Experience in software development in one or more of the following: C, C++, Java, Go and/or Perl, Python. . Prior experience in developing and/or administering software in Windows with Dotnet applications . Strong team player with a \'can do\' attitude, and the flexibility to jump in wherever needed . Demonstrable cross-functional knowledge with systems, storage, networking, security and databases . System administration skills, including automation and orchestration of Linux/Windows using Chef, Puppet, Ansible, Salt Stack and/or containers (Docker, Kubernetes, etc.) . Proficiency with continuous integration and continuous delivery tooling and practices . Strong analytical and troubleshooting skills . Ability and willingness to learn and apply new tools and technologies . Extra Points for any of the following: . Prior experience in developing applications in .NET technologies (desired) or Java . You have expertise designing, analyzing and troubleshooting large-scale distributed systems. . You take a system problem-solving approach, coupled with strong communication skills and a sense of ownership and drive . You are passionate for automation with a desire to eliminate toil whenever possible . You\'ve built software or maintained systems in a highly secure, regulated or compliant industry . You thrive in and have experience and passion for working within a DevOps culture and as part of a team

foundit

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Related Jobs

Job Detail

  • Job Id
    JD3041764
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Pune, Maharashtra, India
  • Education
    Not mentioned
  • Experience
    Year