OCR / Machine Learning Engineer (Document AI) - Fileme

Location

Remote anywhere

Salary

US$50K - 120K

Min. experience

5+ years

Required skills

Full-time role

Posted 3 hours ago

Apply now

Actively recruiting / 11 applicants

About the role
FIleme is building an enterprise-grade document extraction engine for receipts, invoices, remittances, bank statements, and financial documents. We’re developing a multi-stage OCR/ML pipeline inspired by platforms like Dext Prepare — with a focus on accuracy, automation, and seamless integration with accounting ecosystems.

We’re looking for an OCR / Machine Learning Engineer to join on a contract or part-time basis to help build and scale our next-generation extraction engine. You will design and implement models, pipelines, and cloud services that turn messy, real-world financial documents into structured financial data.

This role is ideal for someone who thrives on technical autonomy, enjoys solving complex document-AI problems, and has prior experience with financial OCR, document intelligence, or applied deep learning.

Responsibilities

Build extraction pipelines for receipts, invoices, remittance advices, ATM slips, bank statements, and supporting financial documents.
Implement preprocessing (deskewing, denoising, segmentation, contour detection, multi-language text normalization).
Integrate and evaluate OCR engines (open-source + cloud APIs such as Tesseract, PaddleOCR, Datalabs/Marker, AWS Textract).
Train or fine-tune ML/DL models (e.g., classification, key-value extraction, layout analysis, entity detection).
Evaluate accuracy across document types and propose improvements.
Build cloud-based inference endpoints (AWS preferred).
Work with the engineering team to integrate extraction output into the ExpenseHub staging pipeline.
Create validation and error-handling logic for ambiguous outputs.
Implement analytics and reporting on extraction accuracy, throughput, and edge-case performance.

Qualifications

5+ years of experience in OCR, Computer Vision, Machine Learning, or Document AI.
Strong Python engineering experience including async services, model lifecycle, and API integration.
Hands-on experience with deep learning frameworks (PyTorch preferred).
Experience with structured extraction of financial documents is a strong advantage.
Understanding of document layouts, table detection, and field extraction.
Experience deploying ML systems to AWS (Lambda, S3, EC2, ECS, API Gateway).
Strong understanding of accuracy metrics, error profiles, and evaluation frameworks for OCR.
Ability to work independently and deliver high-quality, production-ready code.

Nice to have

Experience with Xero, QuickBooks, or accounting/fintech platforms.
Experience with multi-tenant SaaS architectures.
Experience with JSON schema versioning, validation frameworks, and ETL pipelines.
Familiarity with Datalabs, Dext Prepare, Rossum, Veryfi, or Glean.