Translate client requirements into differentiated, deliverable solutions using in-depth knowledge of a technology, function, or platform. Collaborate with the Sales Pursuit and Delivery Teams to develop a winnable and deliverable solution that underpins the client value proposition and business case.
Must have skills :
Grafana
Good to have skills :
NA
Minimum
3
year(s) of experience is required
Educational Qualification :
15 years full time education
Job Overview: We are looking for a skilled Prometheus Implementation and Support Engineer to join our IT/DevOps team. The successful candidate will be responsible for the deployment, configuration, and ongoing support of the Prometheus monitoring and alerting platform. This role requires strong technical expertise, problem-solving skills, and the ability to work effectively with various teams to ensure the performance and reliability of our infrastructure and applications. Key Responsibilities: o Prometheus Implementation: o Plan, design, and execute the deployment of Prometheus across multiple environments. o Configure Prometheus to monitor key performance metrics, including custom dashboards, alerts, and data sources. o Integrate Prometheus with Grafana for enhanced visualization and monitoring. o Support and Maintenance: o Provide ongoing support for the Prometheus platform, ensuring continuous monitoring and optimal performance. o Troubleshoot and resolve issues related to Prometheus configurations, data collection, and alerting. o Perform regular upgrades and maintenance of the Prometheus platform. o Monitoring and Optimization: o Monitor infrastructure and application performance in real-time, identifying and diagnosing performance issues. o Work with development, operations, and infrastructure teams to implement performance improvements and optimizations. o Conduct root cause analysis of performance issues and provide actionable recommendations. o Collaboration and Training: o Collaborate with cross-functional teams (development, QA, operations) to integrate Prometheus into the monitoring and alerting lifecycle. o Train and support team members in the use of Prometheus for monitoring and troubleshooting. o Reporting and Documentation: o Generate and distribute regular reports on infrastructure and application performance. o Maintain comprehensive documentation of Prometheus configurations, monitoring setups, and troubleshooting procedures. o Continuous Improvement: o Stay current with the latest Prometheus features, best practices, and industry trends. o Proactively suggest enhancements to improve monitoring capabilities and performance. Qualifications: o Education: o Experience: o Minimum [X] years of experience in performance monitoring and management, specifically with Prometheus. o Proven experience in the end-to-end implementation and support of Prometheus in a complex environment. o Technical Skills: o Strong knowledge of Prometheus, including installation, configuration, and customization. o Experience with related technologies such as Grafana, Alertmanager, and exporters. o Proficiency in programming or scripting languages such as Python, Shell, or PowerShell. o Familiarity with cloud environments and container orchestration tools (e.g., Kubernetes, Docker). o Understanding of network protocols and application architectures. o Application and Infrastructure Monitoring - Expertise in monitoring applications and underlying infrastructure o Cloud Platform Knowledge: Understanding of cloud platforms (e.g., AWS, Azure, GCP) and their specific metrics for effective cloud monitoring. o Ability to diagnose and resolve performance issues and conduct thorough RCA. o Customization and Configuration: Proficiency in customizing dashboards, alerts, and reports to meet specific business and technical requirements. o Scripting and Automation: Knowledge of scripting languages (e.g., Python, PowerShell) to automate tasks and enhance monitoring capabilities. o Integration Other Tools with Grafana o Security and Compliance Awareness o Soft Skills: o Excellent problem-solving and analytical skills. o Strong communication and collaboration skills. o Ability to work independently and manage multiple tasks effectively. o Attention to detail and a proactive approach to identifying and resolving issues. Preferred Qualifications: o Prometheus certification(s). o Experience with other monitoring tools like Grafana, Nagios, or Zabbix. o Knowledge of DevOps practices and tools. o Experience in performance testing and tuning.
15 years full time education
Beware of fraud agents! do not pay money to get a job
MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.