About Doyensys:
Doyensys is a Management & Technology Consulting company with expertise in Enterprise applications, Infrastructure Platform Support, and solutions. Doyensys helps clients to harness the power of innovation to thrive on change. The company leverages its technology expertise, global talent, and extensive industry experience to deliver powerful next-generation IT services and solutions. Doyensys Inc has operations in India, the US, Mexico, and Canada.
Job Requirement
Project Role: Devops Engineer
Project Role Description: A Site Reliability / DevOps Engineer will be responsible for building and managing production-grade infrastructure. The role requires solid problem-solving skills, deep technical expertise, and the ability to handle complex systems with ownership.
Work Experience: 6+ years
Work Location: Chennai - WFO
Shift - 6:30 pm to 3:30 am IST (9 am to 6 pm EST) - Monday to Friday
Technical Expertise
Must Have Skills:
Hands-on expertise with GCP, Kubernetes, Terraform, GitLab CI/CD, and ArgoCD.
Strong Linux administration background.
Practical experience with monitoring, alerting, and observability tools.
Incident response and on-call experience with PagerDuty or equivalent.
Knowledge of CDN technologies (Fastly or similar).
Strong networking and security fundamentals (VPC, firewalls, IAM).
Scripting knowledge in Python/Bash.
Exposure to multi-cloud environments (GCP, Azure, AWS) - GCP Exposure is Mandatory
Key Responsibilities:
Design, implement, and manage scalable infrastructure on GCP (knowledge of Azure is an added advantage).
Build and maintain CI/CD pipelines using GitLab and manage deployments with ArgoCD.
Operate and troubleshoot Kubernetes clusters with strong command of kubectl.
Containerize and manage applications using Docker.
Configure and optimize CDN services (preferably Fastly CDN).
Ensure system observability using Datadog (preferred) or similar tools, covering logs, metrics, and traces.
Respond to and manage incidents using PagerDuty, including root cause analysis and post-incident reviews.
Automate infrastructure provisioning and management at scale using Terraform.
Maintain and troubleshoot Linux systems with strong fundamentals.
Implement and support networking, security, IAM, and compliance best practices.
Contribute to disaster recovery, backup, high availability, and scaling strategies.
Continuously optimize cloud infrastructure for cost and performance.
Collaborate with development teams to improve release cycles and overall system reliability.
Write automation scripts in Bash / Python to improve workflows.
Document infrastructure processes and share knowledge within the team.
Act as a key problem solver for production and infrastructure issues.
Professional Attributes:
Strong Technical Skills: Site Reliability Engineer should have strong technical skills in cloud services, networking, security, and virtualization. They should also have experience in managing and troubleshooting services, managing storage and databases, and deploying applications.
Problem-solving Skills: Site Reliability Engineer should be able to analyze problems, identify the root cause, and develop effective solutions. They should also have experience in troubleshooting services, resolving issues, and implementing corrective actions.
Communication Skills: Site Reliability Engineer should have excellent communication skills to effectively interact with different stakeholders such as team members, clients, and management. They should be able to communicate technical information in a clear and concise manner.
Time Management Skills: Site Reliability Engineer should be able to manage their time effectively to meet deadlines, prioritize tasks, and work on multiple projects simultaneously. They should also be able to work under pressure and manage their workload efficiently.
Attention to Detail: Site Reliability Engineer should have a strong attention to detail and be able to perform tasks accurately. They should also be able to document their work clearly and comprehensively.
Team Player: Site Reliability Engineer should be able to work well in a team environment and collaborate effectively with other team members. They should also be open to learning new skills and sharing their knowledge with others.
Customer Service Skills: Site Reliability Engineer should have excellent customer service skills to interact with clients, understand their requirements, and provide effective solutions to their problems. They should also be able to provide timely and professional support to clients.
Educational Qualification:
Bachelor's degree in computer science or a related field
Understanding of cloud security best practices
Excellent problem-solving and troubleshooting skills.
Strong communication and collaboration skills
Ability to work independently and as part of a team.
Willingness to learn and adapt to new technologies and practices.
Behavioral Attribute:
Attention to detail.
Strong problem-solving skills
Ability to work in a team.
Strong communication skills
Ability to learn and adapt.
Time management skills
Customer focus
Flexibility and adaptability
Initiative
Professionalism
Job Types: Full-time, Permanent
Pay: ₹1,000,000.00 - ₹2,500,000.00 per year
Benefits:
Cell phone reimbursement
Health insurance
Provident Fund
Work Location: In person
MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.