Part Time Python Developer

Year    UP, IN, India

Job Description

Job Title:

Part-Time Python OCR Developer (S3 Integration + Database Automation)

Project Overview:


Photon Software is implementing an

automated OCR engine

within the Vendor Management System (VMS) to extract key data from

Purchase Order and Invoice PDFs

stored in AWS S3. The process will run

24x7 as a scheduled job

, automatically downloading new files, extracting text and data, persisting the results into a database, and archiving processed PDFs.

Use Case Summary:



Monitor a specific S3 folder every 30 minutes for new PDF uploads. Automatically

download new PDFs

and perform

OCR extraction

(e.g., PO number, vendor details, invoice amount, date). Persist extracted data into a

MySQL database

using Python scripts. After successful extraction,

move PDFs

to an "Archive" folder in S3. Continue this cycle continuously (24x7) using a

scheduler

(e.g., cron job, APScheduler, AWS Lambda, or EC2 background service).

Responsibilities:



Develop and deploy Python-based OCR pipeline (Tesseract / EasyOCR). Integrate with AWS S3 for file read/write operations. Implement data persistence logic (MySQL insert/update). Design a scheduling mechanism for 30-minute job execution. Maintain error logs, exception handling, and retry mechanisms. Optimize performance for continuous 24x7 operations.

Required Skills:



Strong Python (3.x) programming experience. Hands-on with

AWS S3 SDK (boto3)

and

MySQL (PyMySQL / SQLAlchemy)

. OCR tools such as

Tesseract, EasyOCR, or PaddleOCR

. Experience in

PDF processing (pdfplumber, PyMuPDF/fitz)

. Knowledge of

job schedulers

(cron, APScheduler, Celery, or AWS Lambda scheduling). Basic understanding of

logging, error handling, and multithreading

.

Nice to Have:



Experience handling financial documents (POs, invoices). Familiarity with OCR accuracy improvement and confidence scoring. AWS Lambda or Docker deployment knowledge.

Engagement Details:



Part-time role (approx. 1 hour/day)

for ongoing OCR automation maintenance and enhancement.

Remote / flexible timing

with weekly progress updates.

Duration:

Long-term (initial 3-month contract, extendable).

How to Apply:


Send your profile or GitHub samples demonstrating similar OCR or AWS automation work to

shivbratmishra@photonsoftwares.com

. Include "Python OCR - S3 Automation" in the subject line.

Job Type: Part-time

Pay: ₹5,000.00 - ₹6,000.00 per month

Expected hours: 10 per week

Work Location: In person

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD4541959
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Part Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    UP, IN, India
  • Education
    Not mentioned
  • Experience
    Year