Data Steward

Year    Hyderabad, Telangana, India

Job Description

Summary We are seeking a highly skilled Data Engineer with hands-on experience in SQL, PySpark, Databricks, Snowflake and CI/CD processes. The ideal candidate will be responsible for designing, developing, and maintaining scalable data pipelines and infrastructure to support our data analytics and business intelligence needs. You will work closely with data scientists, analysts, and Novartis internal customers (CPOs & Regional marketing and sales teams) to ensure the efficient processing and delivery of high-quality data.

Key Responsibilities:

  • Design, develop, and optimize data pipelines using Python / PySpark to process and analyze large datasets.
  • Write complex SQL queries for data extraction, transformation, and loading (ETL).
  • Work with Databricks to build and maintain collaborative and scalable data solutions.
  • Implement and manage CI/CD processes for data pipeline deployments to ensure seamless and efficient integration and deployment.
  • Collaborate with data scientists and business analysts to understand data requirements and deliver appropriate solutions.
  • Ensure data quality, integrity, and security across all data processes.
  • Monitor and troubleshoot data pipelines and workflows to resolve issues promptly.
  • Continuously improve data and code quality through automation and best practices.
  • Ensure projects are delivered on schedule and within established deadlines.
  • Aid in the creation and maintenance of Standard Operating Procedures (SOPs).
  • Support the development and upkeep of knowledge repositories that capture both qualitative and quantitative reports.
Qualifications:
  • Bachelor's degree in computer science, Engineering, Information Technology, or a related field with 4+ years of relevant work experience.
  • Proven experience with PySpark, including developing and tuning data processing applications.
  • Advanced proficiency in SQL and experience in writing complex queries and optimizing them for performance.
  • Hands-on experience with Databricks, including notebooks, clusters, and integration with other data tools.
  • Strong understanding of CI/CD pipelines and experience with tools such as Jenkins, GitLab CI/CD, or Azure DevOps.
  • Familiarity with cloud platforms (e.g., AWS, Azure, Google Cloud) and related data services.
  • Good understanding on data quality management concepts
  • Ability to lead and own engagements independently with
  • Excellent problem-solving skills and attention to detail.
  • Strong communication and collaboration skills, with the ability to work effectively in an Agile team environment.
Preferred Skills:
  • Understanding of healthcare / life sciences domain data and know-how of pharma ecosystem
  • Knowledge of data warehousing concepts and tools (e.g., Snowflake, Redshift).
  • Good to have knowledge on kedro framework.
Understanding and applying effective data governance methods
Skills Desired Advertising Campaigns, Alteryx, Analytical Thinking, Brand Awareness, Business Networking, Curiosity, Digital Marketing, Email Marketing, Marketing Communications, Marketing Plans, Marketing Strategy, Media Campaigns, Process Documentation, Strategic Marketing

Skills Required

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD4399526
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Hyderabad, Telangana, India
  • Education
    Not mentioned
  • Experience
    Year