Senior Site Reliability Engineer

Year    Chennai, Tamil Nadu, India

Job Description

About Job
At Growfin.ai, we are seeking a highly motivated and detail-oriented Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in transforming our infrastructure from manually managed EC2 instances to a modern, automated, containerized platform. This is an exciting opportunity to work with cutting-edge technologies and contribute to the growth and reliability of our financial platform.
As a Site Reliability Engineer at Growfin.ai, you will have the chance to collaborate with cross-functional teams to understand application requirements and support deployment strategies that enable rapid development cycles. You will also have the opportunity to learn from senior engineers and contribute to team knowledge sharing through documentation and runbooks.
Skills & Qualification
Hands-on experience with AWS cloud services, including EC2, VPC, IAM, S3, and basic networking concepts.
Basic Linux system administration skills, including scripting, networking, and troubleshooting in development or staging environments.
Familiarity with Infrastructure as Code concepts, with eagerness to learn Terraform for automating infrastructure provisioning and management.
Understanding of containerization technologies, particularly Docker, with strong interest in learning Kubernetes and container orchestration.
Basic proficiency in scripting languages such as Python, Bash, or similar for automation and tool development.
Experience with CI/CD pipeline concepts using tools like GitHub Actions, GitLab CI, Jenkins, or similar platforms.
Strong problem-solving abilities and a growth mindset, with demonstrated eagerness to learn new technologies and tackle infrastructure challenges.
Familiarity with monitoring and observability concepts, with willingness to develop expertise in modern observability platforms (Datadog experience is a plus).
Understanding of JVM-based applications and interest in learning performance monitoring.
Excellent communication skills with the ability to work effectively with cross-functional teams and document technical processes clearly.
Some exposure to configuration management tools (Ansible, Chef, Puppet) or willingness to learn for managing server infrastructure.
Basic understanding of networking concepts, including DNS, load balancing, and security fundamentals.
Familiarity with database concepts and backup strategies for systems like MySQL, PostgreSQL, or similar technologies.
Passion for automation, continuous improvement, and building reliable systems that support business growth.
Enthusiasm for learning and knowledge sharing, with ability to contribute to team collaboration and development initiatives.
Responsibilities
Contribute to the transformation of our infrastructure from manually managed EC2 instances to a modern, automated, containerized platform under senior engineer guidance.
Learn and implement Infrastructure as Code solutions using Terraform to replace manual server management and enable version-controlled infrastructure.
Containerize existing applications using Docker and gain hands-on experience with container orchestration using Kubernetes.
Build and maintain CI/CD pipelines and GitOps workflows to automate deployment processes.
Implement monitoring and alerting solutions across infrastructure components and learn comprehensive observability practices.
Collaborate with development teams to understand application requirements and support deployment strategies that enable rapid development cycles.
Assist in establishing backup, disaster recovery, and security protocols to ensure high availability and data protection for our financial platform.
Monitor AWS resource utilization and help implement cost optimization strategies.
Create documentation and runbooks for new infrastructure processes to support team knowledge sharing.
Learn DevOps best practices, cloud technologies, and modern infrastructure patterns through hands-on experience and team collaboration.
Support the adoption of container security best practices and vulnerability scanning processes.
Participate in incident response efforts and learn from post-mortem analysis to improve system reliability.
Stay current with emerging DevOps technologies and contribute ideas for infrastructure improvements.
Support architecture discussions for new applications and learn about infrastructure strategy and technology planning.

Skills Required

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD4947980
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Chennai, Tamil Nadu, India
  • Education
    Not mentioned
  • Experience
    Year