Senior Software Engineer (big Data)

Year    India, India

Job Description


Overview Please note: We are a remote-first organization so you can work from anywhere in India. You may be required to travel to our Mumbai office based on business requirements or for company/team events. You will be a part of Cactus Labs which is the R&D Cell of Cactus Communications. Cactus Labs is a high impact cell that works to solve complex technical and business problems that help keep us strategically competitive in the industry. We are a multi-cultural team spread across multiple countries. We work in the domain of AI/ML especially with Text (NLP - Natural Language Processing), Language Understanding, Explainable AI, Big Data, AR/VR etc. Within Cactus Labs you will work with the Big Data team. This team manages Terabytes of data coming from different sources. We are re-orchestrating data pipelines to handle this data at scale and improve visibility and robustness. We operate across all the three Cloud Platforms and leverage the best of them. In this role, you will get to own development of features end to end. You will also get to work on cloud platform and learn to design distributed data processing systems to operate at scale. Job Responsibilities Build and maintain robust data processing pipelines at scale Collaborate with a team of Big Data Engineers, Big Data and Cloud Architects and Domain SMEs to drive the product ahead Follow best practices in building and optimize existing processes Stay up to date with the progress of in the domain since we work on cutting-edge technologies and are constantly trying new things out Build solutions for massive scale. This requires extensive benchmarking to pick the right approach Understand the data in and out, and make sense of it. You will at times need to draw conclusions and present it to the business users Be independent, self-driven and highly motivated. While you will have the best people to learn from and access to various courses or training materials, we expect you to take charge of your growth and learning. Qualifications and Prerequisites 3-6 years of relevant experience in Big Data preferable with PySpark Highly proficient in distributed computing and Big Data Ecosystem especially with Apache Spark Good understanding of data lake and their importance in a Big Data Ecosystem Being able to guide junior team members and review their code Experience of working in a Cloud Environment (AWS, GCP or Azure) You like to work without a lot of supervision or micromanagement. Above all, you get excited by data. You like to dive deep, mine patterns and draw conclusions. You believe in making data-driven decisions and helping the team and project benefit from them. Preferred skills: Familiarity with search engines like Elasticsearch and Bigdata warehouses systems like AWS Athena, Google Big Query etc Building data pipelines using Airflow Experience of working in AWS Cloud Environment Knowledge of NLP and ML Additional Information CACTUS is a culture-driven company powered by its people, their passion, and their inspiration. All Cactizens live by the culture and values that define us. We value people for their talent, personality, competency, and the ability to learn and grow. We create a work environment that allows people to thrive and show their best performance. We believe in meritocracy. We take pride in our diversity. We strive to embrace diverse voices and create an inclusive workplace. We encourage all Cactizens to talk openly about their ideas and opinions and provide feedback to anyone who is a part of CACTUS, regardless of designation, experience, or seniority. We also encourage them to place their trust and be open to differences in opinions and feedback. About CACTUS Cactus Communications is a science communication and technology company. We specialize in AI products and solutions that improve how research gets funded, published, communicated, and discovered. We offer editorial, translation, education, and training solutions for researchers strategic and tactical scientific content solutions to global life science organizations AI-powered scholarly publishing products for journals and researchers and solutions for science dissemination and engagement with peers, public, and policymakers for wider research outreach. We have offices in London, Princeton, Singapore, Beijing, Shanghai, Tokyo, Seoul, Bengaluru, Hyderabad, and Mumbai a global workforce of over 3,000 experts and customers from over 190 countries. Awards and Recognition Cactus Communications has consistently ranked among the top 20 on the global list of the Top 100 Companies for Telecommute Jobs since 2016. Recognised as \'Employers of the Future\' two years in a row in 2023 and 2022, in a study by LeadUp Universe, Fortune India and Work Universe Recognised as One of India\'s Top 100 best Workplaces for Women by Great Place To Work in 2022 Winner of \'Best Innovation Leveraging AI Services\' at AWS AI Conclave 2022 Recognized as one of the Best Companies for Millennials 2019 by Times Ascent and Learning & Organisation Development Roundtable Emerged as one of India\'s Top 10 Safe Workplaces for Women in a survey conducted by Rainmaker in 2019 Ranked #1 among India\'s Great Mid-Size Workplaces by Great Place to Work Institute in 2017

foundit

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Related Jobs

Job Detail

  • Job Id
    JD3175437
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    India, India
  • Education
    Not mentioned
  • Experience
    Year