Avp – Databricks Architect

Year    TS, IN, India

Job Description

Ready to shape the future of work?

At Genpact, we don't just adapt to change--we drive it. AI and digital innovation are redefining industries, and we're leading the charge. Genpact's

AI Gigafactory

, our industry-first accelerator, is an example of how we're scaling advanced technology solutions to help global enterprises work smarter, grow faster, and transform at scale. From large-scale models to

agentic AI

, our breakthrough solutions tackle companies' most complex challenges.

If you thrive in a fast-moving, tech-driven environment, love solving real-world problems, and want to be part of a team that's shaping the future, this is your moment.

Genpact (NYSE: G) is an

advanced technology services and solutions company that delivers

lasting

value for leading enterprises

globally.

Through our

deep business knowledge, operational excellence, and cutting-edge solutions - we help companies across industries get ahead and stay ahead.

Powered by curiosity, courage, and innovation

,

our teams

implement

data, technology, and AI

to

create tomorrow, today.

Get to know us at

genpact.com

and on

LinkedIn

,

X

,

YouTube

, and

Facebook

.

Inviting applications for the role of

Assistant Vice President- Lead Data Engineer

In this role, a Lead data engineer will lead the design and optimization of advanced data solutions. This role requires expertise in Databricks, Azure Data Factory (ADF), Python,

PySpark

and Unity Catalog to efficiently process and manage large datasets, along with a deep understanding of cloud architecture to build scalable, secure, and reliable data solutions on the Microsoft Azure platform. The primary responsibility of the lead data engineer with Unity Catalogue expertise is to apply advanced data engineering skills to optimize data integration, enhance data accessibility, and drive strategic decision-making through effective data governance, simplification, standardization, and innovative solutions across all supported units. This role will be implementing DevOps best practices and driving innovation using modern data platform capabilities such as Unity Catalog,

MLflow

, and Large Language Models (LLMs).

Responsibilities

Design and development.

Collaborate with business stakeholders and analysts to understand data requirements. Design, develop, and test data pipelines and workflows using Unity Catalogue to optimize end-to-end processes. Create reusable components, robust exception handling, and standardized frameworks for data solutions.

Solution Design

Develop and maintain robust data architectures using Lakehouse principles to ensure efficient data processing and storage. Comprehensive data architecture solutions using Databricks and Lakehouse principles to support advanced analytics and machine learning initiatives.

Explore and integrate Large Language Models (LLMs) and Copilot tools to drive automation and agility.

Leverage Databricks

MLflow

for model lifecycle management and operationalization

Leverage data best practices and tools and assist ML engineer in pulling, filtering, tagging, joining, parsing, and normalizing data sets for use.

Data Quality and Governance:

Ensure data quality frameworks, lineage, and monitoring are in place.

Implement data quality checks, validation rules, and governance policies to ensure the accuracy, reliability, and security of data assets.

Implement data security and privacy measures to protect sensitive information.

Data Integration and Analytics:

Pull data from different sources, transform and stitch it for advanced analytics activities.

Design, implement, and deploy data loaders to load data into the engineering sandbox.

Collaborate with data scientists and analysts to support their data requirements and prepare machine learning feature stores.

Pull/ingest data from different sources, transform and stitch, and wrangle it for advanced analytics activities.

Leadership and Mentorship:

Own complex, cross-functional data projects from ideation to production, including defining requirements, designing solutions, leading development, and ensuring successful deployment and long-term maintenance.

Provide guidance and technical leadership to a team of data engineers through in-depth code reviews, mentoring junior and mid-level engineers, and fostering a culture of technical excellence.

Mentor mid-level engineers and perform peer reviews.

Provide input to ML engineer/cloud engineer for the design and implementation of data management and/or architecture solutions

Process

improvement and efficiency.

Drive

continuous improvement initiatives in data processes and systems. Promote standardization and automation to enhance efficiency and accuracy. Support regional and global data projects

Qualifications We Seek in You!

Minimum Qualifications / Skills

Bachelor's degree in computer science, Information Systems, or

a related

field.

Experience in Databricks, Azure ADF, Python,

Pyspark

and Unity Catalog Dataflow, and Lakehouse architecture

Deep hands-on expertise in Azure Data Services (e.g., Azure Data Lake, Azure Data Factory, Synapse, etc.) and Databricks.

Strong experience in data pipeline design, ETL/ELT development, and data orchestration frameworks.

Proficiency in DevOps tools and practices (CI/CD pipelines,

IaC

, monitoring).

Knowledge of data lineage, cataloging, and enterprise data marketplace concepts.

Familiarity with integrating 3rd party data sources and managing data quality frameworks.

Ability to leverage LLMs and Copilot solutions to enhance data platform productivity.

Experience in building self-healing architecture for data pipelines.

Proven experience in managing data projects in complex environments, including global or multinational contexts

Hands-on experience with data pipeline development and optimization

Deep knowledge of data governance frameworks and tools, including Databricks Unity Catalog, to ensure data security, quality, and compliance at an enterprise level.

A strong understanding of

MLOps

for building data foundations that support machine learning.

Experience with DevOps practices to enhance data project delivery efficiency

Preferred Qualifications / Skills

Prior track record of leading

enterprise HR/People platforms

a plus

Leads multiple pods, mentoring senior and mid-level engineers

Experience in large-scale Lakehouse design, data mesh principles, and performance optimization

Certifications in Azure data engineering, Databricks or related fields

Why join Genpact?

Be a transformation leader

- Work at the cutting edge of AI, automation, and digital innovation

Make an impact

- Drive change for global enterprises and solve business challenges that matter

Accelerate your career

- Get hands-on experience, mentorship, and continuous learning opportunities

Work with the best

- Join 140,000+ bold thinkers and problem-solvers who push boundaries every day

Thrive in a values-driven culture

- Our courage, curiosity, and incisiveness - built on a foundation of integrity and inclusion - allow your ideas to fuel progress

Come join the tech shapers and growth makers at Genpact and take your career in the only direction that matters: Up.

Let's build tomorrow together.

Genpact is an Equal Opportunity Employer and considers applicants for all positions without regard to race, color, religion or belief, sex, age, national origin, citizenship status, marital status, military/veteran status, genetic information, sexual orientation, gender identity, physical or mental disability or any other characteristic protected by applicable laws. Genpact is committed to creating a dynamic work environment that values respect and integrity, customer focus, and innovation.

Furthermore, please do note that Genpact does not charge fees to process job

applicationsand applicants are not required to pay to participate in our hiring process in any other way. Examples of such scams include purchasing a 'starter kit,' paying to apply, or purchasing equipment or training.

JobAssistant Vice President


Primary LocationIndia-Hyderabad


ScheduleFull-time


Education LevelBachelor's / Graduation / Equivalent


Job PostingDec 26, 2025, 1:41:41 AM


Unposting DateJun 24, 2026, 7:41:41 AM


Master Skills ListDigital


Job CategoryFull Time

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD5023045
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    TS, IN, India
  • Education
    Not mentioned
  • Experience
    Year