Data Collector

Year    MH, IN, India

Job Description

Data Scraping, MongoDB, Solr / ElasticSearch

We are seeking a skilled Python Developer with strong experience in web/data scraping and working knowledge of MongoDB, Solr, and/or ElasticSearch. You will be responsible for developing, maintaining, and optimizing scalable scraping scripts to collect structured and unstructured data, efficiently manage it in MongoDB, and index it for search and retrieval using Solr or ElasticSearch.

Design and develop robust web scraping solutions using Python (e.g., Scrapy, BeautifulSoup, Selenium, etc.). Extract and process large volumes of data from websites, APIs, and other digital sources. Ensure scraping mechanisms are efficient, resilient to site changes, and compliant with best practices. Store, retrieve, and manage scraped data efficiently in MongoDB databases. oIndex, manage, and optimize data search capabilities using Solr or ElasticSearch. oBuild data validation, cleaning, and transformation pipelines. Handle challenges like CAPTCHA solving, IP blocking, and dynamic content rendering. Monitor scraping jobs and troubleshoot errors and bottlenecks. Optimize scraping speed, search indexing, storage efficiency, and system scalability. Collaborate with product managers to define data requirements.
Job Type: Full-time

Pay: ?20,000.00 - ?25,000.00 per month

Application Question(s):

How immediate you can join ?
Location:

Mumbai, Maharashtra (Required)
Work Location: In person

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD4076354
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    MH, IN, India
  • Education
    Not mentioned
  • Experience
    Year