For companies
  • Hire developers
  • Hire designers
  • Hire marketers
  • Hire product managers
  • Hire project managers
  • Hire assistants
  • How Arc works
  • How much can you save?
  • Case studies
  • Pricing
    • Remote dev salary explorer
    • Freelance developer rate explorer
    • Job description templates
    • Interview questions
    • Remote work FAQs
    • Team bonding playbooks
    • Employer blog
For talent
  • Overview
  • Remote jobs
  • Remote companies
    • Resume builder and guide
    • Talent career blog
Arc Exclusive
Arc Exclusive

OCR / Machine Learning Engineer (Document AI) - Fileme

Location

Remote anywhere

Salary

US$50K - 120K

Min. experience

5+ years

Required skills

PythonNLPMachine learningDeep LearningComputer VisionAITensorFlowData ScienceOpenCVData EngineeringFastapiJSON

Full-time role
Posted 3 hours ago
Apply now
Actively recruiting / 11 applicants

About the role
FIleme is building an enterprise-grade document extraction engine for receipts, invoices, remittances, bank statements, and financial documents. We’re developing a multi-stage OCR/ML pipeline inspired by platforms like Dext Prepare — with a focus on accuracy, automation, and seamless integration with accounting ecosystems.

We’re looking for an OCR / Machine Learning Engineer to join on a contract or part-time basis to help build and scale our next-generation extraction engine. You will design and implement models, pipelines, and cloud services that turn messy, real-world financial documents into structured financial data.

This role is ideal for someone who thrives on technical autonomy, enjoys solving complex document-AI problems, and has prior experience with financial OCR, document intelligence, or applied deep learning.

Responsibilities

  • Build extraction pipelines for receipts, invoices, remittance advices, ATM slips, bank statements, and supporting financial documents.
  • Implement preprocessing (deskewing, denoising, segmentation, contour detection, multi-language text normalization).
  • Integrate and evaluate OCR engines (open-source + cloud APIs such as Tesseract, PaddleOCR, Datalabs/Marker, AWS Textract).
  • Train or fine-tune ML/DL models (e.g., classification, key-value extraction, layout analysis, entity detection).
  • Evaluate accuracy across document types and propose improvements.
  • Build cloud-based inference endpoints (AWS preferred).
  • Work with the engineering team to integrate extraction output into the ExpenseHub staging pipeline.
  • Create validation and error-handling logic for ambiguous outputs.
  • Implement analytics and reporting on extraction accuracy, throughput, and edge-case performance.

Qualifications

  • 5+ years of experience in OCR, Computer Vision, Machine Learning, or Document AI.
  • Strong Python engineering experience including async services, model lifecycle, and API integration.
  • Hands-on experience with deep learning frameworks (PyTorch preferred).
  • Experience with structured extraction of financial documents is a strong advantage.
  • Understanding of document layouts, table detection, and field extraction.
  • Experience deploying ML systems to AWS (Lambda, S3, EC2, ECS, API Gateway).
  • Strong understanding of accuracy metrics, error profiles, and evaluation frameworks for OCR.
  • Ability to work independently and deliver high-quality, production-ready code.

Nice to have

  • Experience with Xero, QuickBooks, or accounting/fintech platforms.
  • Experience with multi-tenant SaaS architectures.
  • Experience with JSON schema versioning, validation frameworks, and ETL pipelines.
  • Familiarity with Datalabs, Dext Prepare, Rossum, Veryfi, or Glean.

Unlock all Arc benefits!

  • Browse remote jobs in one place
  • Land interviews more quickly
  • Get hands-on recruiter support
PRODUCTS
Arc

The remote career platform for talent

Codementor

Find a mentor to help you in real time

LINKS
About usPricingArc Careers - Hiring Now!Remote Junior JobsRemote jobsCareer Success StoriesTalent Career BlogArc Newsletter
JOBS BY EXPERTISE
Remote Front End Developer JobsRemote Back End Developer JobsRemote Full Stack Developer JobsRemote Mobile Developer JobsRemote Data Scientist JobsRemote Game Developer JobsRemote Data Engineer JobsRemote Programming JobsRemote Design JobsRemote Marketing JobsRemote Product Manager JobsRemote Project Manager JobsRemote Administrative Support Jobs
JOBS BY TECH STACKS
Remote AWS Developer JobsRemote Java Developer JobsRemote Javascript Developer JobsRemote Python Developer JobsRemote React Developer JobsRemote Shopify Developer JobsRemote SQL Developer JobsRemote Unity Developer JobsRemote Wordpress Developer JobsRemote Web Development JobsRemote Motion Graphic JobsRemote SEO JobsRemote AI Jobs
© Copyright 2025 Arc
Cookie PolicyPrivacy PolicyTerms of Service