Senior Lead Data Engineer

Year    Bengaluru, Karnataka, India

Job Description


Site Name: India - Karnataka - Bengaluru
Posted Date: Nov 3 2023
GSK is one of the world\xe2\x80\x99s foremost pharmaceutical and healthcare companies, and we are proud to be part of an industry that improves the lives of others. We embark on a significant transformation journey to support GSK in becoming a top-quartile data-enabled organization. This is an exciting time to join GSK. We are embracing new data technologies to improve the development, manufacture, and distribution of GSK\xe2\x80\x99s vital products to patients and consumers worldwide. You will be part of a team building a robust data and analytics ecosystem, allowing GSK to drive higher value by placing data at the core of its strategic and operational decisions. The distinct behaviours demonstrated by Senior Lead Engineer- Data, are architecting and developing complex data pipelines, understanding data sources & databases, and creating automated scripts to solve complex data problems. This role is accountable for contributing within a team in technical software solution design, implementation, and continuous improvement of solutions for complex software products, embedding agile and DevOps principles and forming influential partnerships with other principal engineers and enterprise architects. This is a \xe2\x80\x9cT-shaped\xe2\x80\x9d role demonstrating both depth and breadth across key engineering competencies and the ability to successfully collaborate with engineers up and down the entire software stack; in a regulated, life sciences environment.
This is a collaborative role, functioning as part of a multi-disciplined team. The role will be part of a global team building advanced Scientific, vaccine and medicine design products within R&D Tech. GSK is going through a major transformation journey. The ambition is to become a data intelligent company. We are looking for an experienced cloud Data engineer lead that will help enable our strategy through the design of robust, reliable, performant, and cost-effective cloud data solutions. The ideal candidate will need to have: Technical skills:

  • Deep Azure & GCP data pipeline knowledge and understanding of product features.
  • Proven track record in designing, developing, and implementing data modelling & data pipelines, at a conceptual, logical and physical level.
  • Experience with accountability for Data Architecture and Lifecycle (including non-functional requirements and operations) management.
  • Experienced in industry recognized data pipeline standards.
  • Understands industry recognized data modelling patterns and standards.
  • Knowledge of the latest technologies to process large amount of data (Apache Spark; Databricks)
  • Experience in MDM, Metadata Management, Data Quality and Data Lineage tools.
  • A deep understanding of ETL and ELT tools and techniques
  • Very strong hands-on experience in using Microsoft Azure services like Azure Data Factory, Data brick services (like processing streaming data using Spark clusters), usage of Blob containers, ESB, Event Grid, Azure SQL server, Cosmos DB, Azure functions, Analytics (like Power BI) is a mandatory requirement.
  • Deep Azure & GCP data pipeline knowledge and understanding of product features.
  • Good Experience in Python esp with libraries like Numpy, Pandas, Matplotlib, PySpark, Flask, Scikit-learn
  • Good Knowledge of DataFactory , DataLake, DataBricks
  • Good knowledge of Spark, Scala, Airflow, Hadoop etc
  • Good knowledge of MySQL/PostgreSQL/MongoDB etc
  • Experience in working on Hadoop Cluster including cluster planning, designing, implementing, benchmarking, performance tuning and monitoring.
  • Experience in working with Hadoop ecosystem components: Hive, PIG, Sqoop, Flume, Zookeeper and Oozie.
  • Strong knowledge on Hadoop HDFS architecture and Map Reduce Framework.
  • Strong understanding and working experience on injecting and accessing data to and from a cluster.
  • Experience in working with large volumes of streaming data using Flume.
  • Knowledge in implementing business logic and optimize the queries using HiveQL by implementing partitioning and bucketing techniques.
  • Excellent documentation skills including designing of UML diagrams.
  • Experience in Scripting language like Python/R
  • Excellent communication skills, both verbal and written
  • Experience or knowledge in Data virtualization tool like Denodo
  • Good experience in working with data using SQLs, Python and Scala in Microsoft based resources like Azure Synapse and Data Lake.
  • Understanding/Exposure/Experience to Denodo will be a big plus.
  • Experience or Understanding in Neo4j and/or GraphQL would be a huge plus.
  • Understands overall IT system design, in particular networking, authorization and authentication protocols, data security, disaster recovery.
  • Exposure to Agile/Scrum is a must.
  • Data science (e.g. AI/ML), data analytics & data quality/integrity exposure is desired
Personal skills:
  • A great team player, with the ability to perfectly integrate and play a pivotal role in a team that also includes data scientists, data engineers and data analysts.
  • Able to appreciate short term vs. long term goals and take both tactical and strategic decisions.
  • Great communication skills, ability to communicate complex technical concepts to a non-technical audience.
  • Strong organizational skills, the ideal candidate has the ability to work in a fast-paced environment and has the ability to quickly adapt to changing priorities.
  • Works well as a technical leader and individual contributor.
  • Build processes supporting data transformation, data structures, metadata, dependency, and workload management.
  • Guide the development of an Agile data development experience, when required using DW automation tools.
  • Assist the project team in planning and execution to achieve daily and sprint goals.
  • Articulate documentation of project artefacts and deliverables.
  • Identifies and communicates risks clearly to the Product owner and the team.
Eligibility Criteria:
  • Years of Experience: Minimum 6- 9 years
  • Educational: Engineering, MCA, M. Tech / Ph.D. from leading institutes.
  • Primary skill : Cloud Data Engineering with exposure to Data Virtualization, non relational
database, strong in ETL Why GSK? Our values and expectations are at the heart of everything we do and form an important part of our culture. These include patient focus, transparency, respect, integrity along with courage, accountability, development, and teamwork. As GSK focuses on our values and expectations and a culture of innovation, performance, and trust, the successful candidate will demonstrate the following capabilities:
  • Agile and distributed decision-making\xe2\x80\x94using evidence and applying judgement to balance pace, rigour, and risk.
  • Managing individual performance.
  • Committed to delivering high quality results, overcoming challenges, focusing on what matters, execution.
  • Implementing change initiatives and leading change.
  • Sustaining energy and well-being, building resilience within in a team.
  • Continuously looking for opportunities to learn, build skills and share learning both internally and externally.
  • Translating strategy into action\xe2\x80\x94a compelling narrative, setting and achieving objectives.
  • Building strong relationships and collaboration, managing trusted stakeholder relationships internally and externally.
Our goal is to be one of the world\xe2\x80\x99s most innovative, best performing, and trusted healthcare companies. We believe that we all bring something unique to GSK and when we combine our knowledge, experiences, and styles together, the impact is incredible. Come join our adventure at GSK where you will be inspired to do your best work for our patients and consumers. A place where you can be you, feel good and keep growing. Why Us? GSK is a global biopharma company with a special purpose \xe2\x80\x93 to unite science, technology and talent to get ahead of disease together \xe2\x80\x93 so we can positively impact the health of billions of people and deliver stronger, more sustainable shareholder returns \xe2\x80\x93 as an organization where people can thrive. Getting ahead means preventing disease as well as treating it, and we aim to positively impact the health of 2.5 billion people by the end of 2030. Our success absolutely depends on our people. While getting ahead of disease together is about our ambition for patients and shareholders, it\xe2\x80\x99s also about making GSK a place where people can thrive. We want GSK to be a workplace where everyone can feel a sense of belonging and thrive as set out in our Equal and Inclusive Treatment of Employees policy. We\xe2\x80\x99re committed to being more proactive at all levels so that our workforce reflects the communities we work and hire in, and our GSK leadership reflects our GSK workforce. Important notice to Employment businesses/ Agencies GSK does not accept referrals from employment businesses and/or employment agencies in respect of the vacancies posted on this site. All employment businesses/agencies are required to contact GSK\'s commercial and general procurement/human resources department to obtain prior written authorization before referring any candidates to GSK. The obtaining of prior written authorization is a condition precedent to any agreement (verbal or written) between the employment business/ agency and GSK. In the absence of such written authorization being obtained any actions undertaken by the employment business/agency shall be deemed to have been performed without the consent or contractual agreement of GSK. GSK shall therefore not be liable for any fees arising from such actions or any fees arising from any referrals by employment businesses/agencies in respect of the vacancies posted on this site. It has come to our attention that the names of GlaxoSmithKline or GSK or our group companies are being used in connection with bogus job advertisements or through unsolicited emails asking candidates to make some payments for recruitment opportunities and interview. Please be advised that such advertisements and emails are not connected with the GlaxoSmithKline group in any way. GlaxoSmithKline does not charge any fee whatsoever for recruitment process. Please do not make payments to any individuals / entities in connection with recruitment with any GlaxoSmithKline (or GSK) group company at any worldwide location. Even if they claim that the money is refundable. If you come across unsolicited email from email addresses not ending in gsk.com or job advertisements which state that you should contact an email address that does not end in \xe2\x80\x9cgsk.com\xe2\x80\x9d, you should disregard the same and inform us by emailing askus@gsk.com, so that we can confirm to you if the job is genuine.

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Related Jobs

Job Detail

  • Job Id
    JD3190691
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Bengaluru, Karnataka, India
  • Education
    Not mentioned
  • Experience
    Year