We are looking for a highly skilled and motivated Site Reliability Engineer (SRE) to join our High Performance Computing (HPC) team. In this role, you will ensure the reliability, performance, and scalability of our critical HPC systems and infrastructure. You will work closely with engineering, infrastructure, and operations teams to design, implement, and manage systems that support compute-intensive workloads, enabling cutting-edge research, simulations, and data processing. Your role will combine software engineering expertise with system administration skills to continuously improve the reliability and performance of HPC environments, reduce operational toil, and respond effectively to incidents. If you enjoy working in high-performance, mission-critical environments, this position offers a unique opportunity to make a significant impact.
In your new role you will:
System Reliability and Performance
MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.