3+ years experience of near Real Time (Streaming) & Batch Data Pipeline development in a large scale organization
7.5+ years of relevant experience in software development in total.
Experience in writing reusable/efficient code to automate analysis and data processes
2+ of business/marketing analytics experience, preferably in a consumer-based organisation
Experience successfully working on an independent project with very minimal supervision
Experience in processing structured and unstructured data into a form suitable for analysis and reporting with integration with a variety of data metric providers ranging from web analytics, consumer analytics, and advertising
Strong Experience with data modelling, batch data pipeline design and implementation
Strong Experience in software development and engineering principles
Experience implementing scalable, distributed, and highly available systems using AWS services such Kinesis, DynamoDB, S3
Exceptional communication skills, particularly in communicating and visualizing
quantitative findings in a compelling and actionable manner for business stakeholders
Experience in mentoring and supporting junior members of the team
High Proficiency in Python/PySpark, Scala or Java
High Proficiency in SQL
Experience with Databricks/Spark
Experience with orchestration tools such as Airflow (we use Astronomer)
Comfortable with CI/CD (we use GitHub Actions) Pipelines
Experience with Git version control, and other software adjacent tools
Terraform used as Infra as service tool.Please share your updated resume sowmya.r@cielhr.com