Advanced Engineer Software

Year    Bangalore, Karnataka, India

Job Description

About Albertsons Companies India
Albertsons Companies is a leading food and drug retailer in the United States. As of February 22, 2025, the Company operated 2,270 retail stores with 1,728 in-store pharmacies, 405 associated fuel centers, 22 dedicated distribution centers and 19 manufacturing facilities. Albertsons Companies India is a vital extension of the Albertsons Companies Inc. workforce and important to the next phase in the company & technology journey to support millions of customers & lives every day.

JOB DESCRIPTION:
About the company:
Albertsons Companies is at the forefront of the revolution in retail. With a fixation on innovation and building culture, our team is rallying our company around a unique vision: forging a retail winner that is admired for national strength, deep roots in the communities we serve, and a team that has passion for food and delivering great service.
Albertsons is one of the largest retail employers, providing approximately 300,000 jobs across 2,200 stores, 22 distribution centers, 20 food and beverage plants and various support offices. We operate in 34 states and the District of Columbia under the Albertsons banner, as well as Safeway, Tom Thumb, Jewel Osco, Shaws and many more recognizable names.
What you will be doing:
This role will be an individual contributor responsible for building and finetuning the platform components for the Observability product. The candidate will work closely with the Lead engineer, performance team, data ingestion, platform DevOps and data visualization teams under Observability product. As a member of the platform team, the candidate needs to be able to support and maintain the applications onboarded to Grafana Observability, Ingestion and visualization written in PromQL, Log queries, etc., and monitoring technologies.
This position will preferably be based out of India GCC, Bangalore.
Key Responsibilities:

  • Lead technical support for applications and programs currently in production.
  • Analyze complex problems to determine solutions to problems to be implemented permanently into production.
  • Prepare for Production releases by ensuring appropriate alerts, dashboards, KB articles, Confluence pages and knowledge sharing are properly executed.
  • Ensures dashboards are being monitored daily to detect anomalies and corrections are shared with appropriate teams and team members.
  • Check that alerts are being responded to appropriately.
  • Ensures approvement agendas for services are being maintained and acted on with Development Engineering and DevOps Engineering partners. Experience in Observability and Monitoring initiatives as platform Engineer.
  • Troubleshoot platform issues and restore service by resolving customer-facing incidents
  • Development and implementation of build release pipelines with accountability for managing deployment schedules, issues, risks, and impediments.
  • Agile development experience with team member accountability for commitment and delivery each sprint.
  • Troubleshoot and implement corrections to problems associated with connectivity between the supported applications and the clients they serve
  • Provide technical guidance, in the diagnosis of issues as they arise in support of critical applications
  • Drive collaboration sessions among IT and business groups to facilitate optimal support and operation of the relevant applications
  • Provide Site Reliability Engineering techniques such as observability, alerting and performance tuning
  • Contribute to the design, implementation, and enhancement of critical applications
  • Perform proactive analysis and troubleshooting to predict and prevent production incidents
  • Define and contribute to monitoring capabilities for critical applications
  • Collaborate with key vendors on functional, performance and capacity improvements
  • Design and build tools to automate support and monitoring functions
  • Ensure that all implementations of observability meet the requirements prescribed by IT Services through the effective implementation or use of approved processes, methodologies, and deliverables.
  • Provide expertise and build solutions for observability applications as well as system integration with internal systems and external vendors.
  • Able to provide coding and technical direction to less experienced staff or develops highly complex original code.
  • Track infrastructure delivery and dependencies to implementation.
We are searching for someone with the following skills:
  • Experience with gathering and organizing large volume of data to use for instrumentation into an Enterprise Observability solution.
  • Experience with recommending baseline monitoring thresholds, and performance monitoring KPIs and SLAs.
  • Experience with installing agents, forwarders, APIs, performance monitoring alerts, dashboards, and data trend analysis.
  • Good Knowledge and understanding of Azure foundation components e.g. App GW, APIM, Virtual Network, NSG, Load Balancer, Azure VM etc. is required.
  • Team-oriented, positively contributing to team morale and willing to help.
  • Learning-Focused, finding ways to improve in their field and use positive constructive feedback to grow personally and professionally
  • Think strategically and proactively anticipate future problems, needs or changes in the work
  • Experience with Databases Azure SQL, PostgreSQL, MySQL, MongoDB, TSDB or similar databases.
  • Experience with designing and implementing user-friendly web interfaces
  • Experience with developing and maintaining server-side applications
  • Experience with managing databases and optimizing their performance
  • Hands on experience in developing MCP clients using React
  • Experience with integrating the web applications with Grafana
  • Ensuring application responsiveness
  • Ensuring cross-platform optimization of web applications for mobile devices
  • Keeping up with the latest developments in web application technology
  • Experience with developing back-end web applications
  • Experience with developing front-end web architectures
  • Azure/GCPhands-on with details around pulling observability data from managed services
  • Experience must include at least one of the following languages: React, Node JS, Javascript, HTML, CSS, Java (required), Desired--Python, Go
  • Golang/Python coding or from solutioning background with experience on SRE development and Open telemetry implementation
  • Design and develop standard Grafana dashboards for critical metrics for various Azure/GCP services using the observability data
  • Knowledge of monitoring tools such as Log Analytics, App Dynamics, Grafana, Prometheus, Splunk, and Sitescope
  • Experience in working with ServiceNow or similar Service Management tools
  • Familiarity with Cloud technologies in Azure, AWS, and Google Cloud
  • Experience on PCF, Docker, Kubernetes platform is required.
  • Experience with DevOps and CI/CD tools and processes is required.
  • Experience in high-performance and high-frequency data streaming and health confirmation techniques (using Kafka etc.) and handling large volume of batch data is strongly preferred
  • In-depth advanced knowledge of current monitoring tools
  • In-depth advanced knowledge of at least one major cloud platform and Service Container/Instance concepts
  • In-depth advanced knowledge of querying and inspection techniques for service and other types of logs
  • In-depth advanced knowledge of the full software development lifecycle and software development methodologies (Agile).
  • Strong ability to understand client expectations and to resolve issues that may affect service.
  • Strong ability to mentor, coach and train other application support engineers
  • Self-starter, with a demonstrated ability to learn beyond formal training with a strong aptitude for delivering quality products.
We believe the successful candidate has these qualifications and experience:
  • 4-year degree (Computer Science, Information Systems, or relational functional field) and/or equivalent combination of education or work experience.
  • 3-6 years of experience on integration engineering related to Observability/Monitoring framework with open source technologies such as Grafana, Mimir, Loki, Tempo, Fluentbit, Vector etc.,
  • Hands-on experience with Tools and Technology is preferred.
  • 3+ years of developer experience in as a Full stack engineer is required.
  • Experience working with Open-source platforms and Open Telemetry libraries e.g. Grafana is preferred.

Skills Required

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD5141296
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Bangalore, Karnataka, India
  • Education
    Not mentioned
  • Experience
    Year