DevOps Infrastructure Lead
Role Responsibilities:
1. Infrastructure Provisioning: Provision, configure, and manage infrastructure components using automation tools and cloud services to support development, testing, and production environments.
2. CI/CD Pipeline Management: Design, implement, and maintain CI/CD pipelines to automate software build, test, and deployment processes, enabling faster and more reliable releases.
3. Release Management: Coordinate and manage software releases, ensuring smooth deployment and rollback procedures, and facilitating collaboration between development, operations, and QA teams.
4. Monitoring and Incident Response: Setup and maintain monitoring and alerting to detect and respond to system failures, performance issues, and security threats in a timely manner.
5. Infrastructure Optimization: Identify opportunities to optimize infrastructure costs, improve resource utilization, and enhance system performance through automation and scalability.
6. Security and Compliance: Maintain security controls, best practices, and compliance requirements to protect infrastructure, applications, and data assets from unauthorized access and security vulnerabilities.
7. Documentation and Knowledge Sharing: Document infrastructure configurations, deployment processes, and troubleshooting procedures to facilitate knowledge sharing and system reliability and maintainability.
8. Continuous Improvement: Evaluate and improve DevOps practices, tools, and processes to enhance efficiency, productivity, and quality across the software development lifecycle.
Competencies:
1. AWS Cloud Management: Proficient in architecting and managing AWS cloud solutions, adept at designing and implementing VPC setups, managing EC2 instances, configuring auto-scaling groups, administering RDS databases, and monitoring performance using CloudWatch.
2. Automation: Proficient in scripting and automation tools such as Bash, Python, PowerShell, or Ruby to streamline deployment, configuration management, and infrastructure provisioning tasks.
3. Continuous Integration/Continuous Deployment (CI/CD): Experienced with CI/CD pipelines and tools such as Jenkins, GitLab CI/CD, U-deploy for automation building, testing, security compliance and deploying software releases.
4. Configuration Management: Knowledgeable of configuration management tools like Ansible, Puppet, or Chef to manage infrastructure as code and ensure consistency across environments.
5. Containerization: Expert with containerization technologies such as Docker and container orchestration platforms like Kubernetes to deploy and manage containerized applications.
6. Cloud Computing: Understands cloud platforms such as AWS, Google Cloud Platform (GCP) and proficiency in cloud services like AWS Elastic Compute Cloud (EC2), Google Kubernetes Engine (GKE).
7. Monitoring and Logging: Experience with monitoring tools such as Grafana, logging solutions such as ELK (Elasticsearch, Logstash, Kibana), stack, Splunk, Dynatrace for real-time visibility into system performance and application behavior. Backup, restoring, LVM management, Apache configuration, and NFS and CIFS server configuration.
8. Infrastructure as Code (IaC): Knowledge of IaC principles and tools like Terraform, CloudFormation, templates to provision and manage infrastructure using code.
9. Robust Linux System Administration: In-depth experience in Linux system administration, covering server setup, performance monitoring, package management, user/group administration, filesystem recovery, and troubleshooting, with scripting for automation and task scheduling.
10. Collaboration and Communication: Strong communication and collaboration skills and works effectively with cross-functional teams, including architects, developers, QA engineers, and security teams.
11. Security: Aware of security best practices and principles, including secure coding practices, vulnerability management, and compliance requirements.
12. Agile Practices: Certified in Agile methodologies - SAFe Agile, Scrum experience working in Agile development environments to support rapid and iterative software delivery.
13. Disaster recovery and business continuity plans: Lead the design and implementation of high availability architecture.
MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.