Design, develop and maintain data solutions for data generation, collection, and processing. Create data pipelines, ensure data quality, and implement ETL (extract, transform and load) processes to migrate and deploy data across systems.
Must have skills :
Apache Spark
Good to have skills :
NA
Minimum
5
year(s) of experience is required
Educational Qualification :
15 years full time education
Summary: As a Data Engineer, you will design, develop, and maintain data solutions that facilitate data generation, collection, and processing. Your typical day will involve creating data pipelines, ensuring data quality, and implementing ETL processes to migrate and deploy data across various systems, contributing to the overall efficiency and effectiveness of data management within the organization. Roles & Responsibilities: -Develop and maintain data processing pipelines using Apache Spark (PySpark). -Write efficient and optimized Python code to handle large datasets. -Design and implement ETL workflows for ingestion, transformation, and storage of data. -Optimize Spark jobs for performance and scalability. -Work with big data platforms (HDFS, Hive, Delta Lake, etc.) and cloud storage (AWS S3, Azure Data Lake, GCP BigQuery). -Collaborate with data engineers, analysts, and data scientists to deliver reliable data solutions. -Implement unit testing, debugging, and monitoring of Spark applications. -Ensure data quality, security, and compliance in all developed solutions. -Document workflows, architecture, and code best practices Professional & Technical Skills: - Must To Have Skills: Proficiency in Apache Spark. -Work with big data platforms (HDFS, Hive, Delta Lake, etc.) and cloud storage (AWS S3, Azure Data Lake, GCP BigQuery). - Strong understanding of data pipeline architecture and design. - Experience with ETL processes and data integration techniques. - Familiarity with data quality frameworks and best practices. - Knowledge of cloud platforms and services related to data storage and processing. Additional Information: - The candidate should have minimum 7 years of experience in Apache Spark. - This position is based at our Bengaluru office. - A 15 years full time education is required.
15 years full time education
Beware of fraud agents! do not pay money to get a job
MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.