Main Responsibilities
1. Ensure 24/7 Operations and Reliability: Ensure the reliability and
performance of data services in production GCP and on-premise Hadoop
environments.
2. Collaborate with Data Engineering Team: Collaborate with the data
engineering development team to design, build, and maintain scalable,
reliable, and secure data pipelines and systems.
3. Develop Monitoring and Incident Response Strategies: Develop and
implement monitoring, alerting, and incident response strategies to
proactively identify and resolve issues.
4. Implement Security and Reliability Best Practices: Drive the
implementation of security and reliability best practices across the
software development life cycle.
5. Contribute to Tool Development and Automation: Contribute to the
development of tools and automation to streamline the management and
operation of data services.
6. Participate in On-call Rotation: Participate in on-call rotation and
respond to incidents in a timely and effective manner.
7. Continuously Evaluate and Improve: Continuously evaluate and improve
the reliability, scalability, and performance of data services.
Profile Requirements (Must-Have Qualifications)
1. Bachelor//'s Degree: Bachelor//'s degree in Computer Science,
Engineering, or a related field.
2. Site Reliability Engineering Experience: 3+ years of experience in
site reliability engineering or a similar role.
3. GCP Services Experience: Strong experience with Google Cloud Platform
(GCP) services, including BigQuery, Dataflow, Pub/Sub, and Cloud
Storage.
4. On-premise Hadoop Experience: Experience with on-premise Hadoop
environments and related technologies (HDFS, Hive, Spark, etc.).
5. Programming Language Proficiency: Proficiency in at least one
programming language (Python, Scala, Java, Go, etc.).
6. Monitoring and Logging Experience: Experience with monitoring and
logging tools such as Stackdriver, Prometheus, or ELK stack.
7. Security Best Practices: Strong understanding of security best
practices and experience implementing security controls in data
services.
8. Problem-Solving and Troubleshooting Skills: Excellent problem-solving
and troubleshooting skills.
9. Communication and Collaboration Skills: Strong communication and
collaboration skills.
MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.