Backup and Storage Management Architect / SME (Cohesity, Rubrik, Pure Storage)
Location
: Hybrid
Job Type
: Full-time / Contract
Experience
: 8+ Years
:
We are looking for a highly skilled
Backup & Storage Management Architect/SME
to lead and optimize our Pure Storage, Cohesity, and Rubrik environments. The role involves ensuring robust backup strategies, efficient storage management, and strong disaster recovery readiness. The ideal candidate will bring deep hands-on expertise, strong troubleshooting skills, and the ability to align data protection solutions with business needs.
Job Responsibilities
Cohesity & Rubrik Management
Monitor backup job status, alerts, and troubleshoot failed or incomplete backup jobs.
Validate replication and archival jobs to cloud targets to ensure data integrity and availability.
Perform test restores for non-production environments to ensure backup reliability.
Review and clear alerts related to Cohesity/Rubrik clusters, health issues, or protection groups.
Ensure adherence to defined SLAs and retention policies across all backup configurations.
Pure Storage Management
Monitor Pure Storage array health using Pure1 or the management console.
Validate replication and snapshot schedules to ensure successful execution.
Track storage utilization, capacity trends, and trigger proactive capacity planning.
Perform routine mapping/unmapping of storage volumes based on approved requests.
Weekly / Scheduled Tasks
Generate weekly reports on backup success/failure with trend analysis.
Review storage consumption growth and prepare capacity forecasts.
Validate snapshot retention and cleanup activities per policy.
Conduct scheduled test restore drills and document results.
Confirm backup coverage for newly added VMs, databases, and volumes.
Track and document protection policy or dataset changes.
Monthly / Periodic Tasks
Update documentation for Cohesity protection groups, storage provisioning, and associated policies.
Perform monthly license utilization and capacity reviews for Cohesity and Pure Storage.
Deliver monthly performance reports (IOPS, throughput, latency) for storage arrays.
Assess readiness for Cohesity/Rubrik patches or upgrades.
Validate DR readiness by checking replication and remote cluster sync status.
Automation & Reporting
Script alert extraction and build backup success-rate dashboards using Cohesity REST API.
Automate storage usage reporting via Pure1 and generate forecast alerts.
Integrate backup job failure alerts with ITSM tools for automated ticket creation.
Architecture & Strategic Responsibilities
Collaborate with onshore teams to design new protection policies and SLAs.
Oversee storage provisioning activities for business applications and critical workloads.
Manage Cohesity cluster upgrades, expansions, and firmware updates.
Handle complex production or compliance-driven restore requests.
Perform root cause analysis for recurring backup failures related to apps or networks.
Implement API-based automation enhancements for production clusters.
Skills Required
Technical Skills
Strong expertise with Pure Storage arrays (health monitoring, replication, snapshots).
Hands-on experience with Cohesity & Rubrik platforms for backup, restore, and DR.
Knowledge of cloud backup and archival strategies.
Experience integrating backup alerts and failures into ITSM systems.
Familiarity with automation and reporting using APIs (Cohesity REST API, Pure1).
Soft Skills
Strong analytical, troubleshooting, and problem-solving capabilities.
Excellent communication skills for coordination with onshore/global teams.
Ability to prioritize and manage multiple tasks in a dynamic environment.