For companies
  • Hire developers
  • Hire designers
  • Hire marketers
  • Hire product managers
  • Hire project managers
  • Hire assistants
  • How Arc works
  • How much can you save?
  • Case studies
  • Pricing
    • Remote dev salary explorer
    • Freelance developer rate explorer
    • Job description templates
    • Interview questions
    • Remote work FAQs
    • Team bonding playbooks
    • Employer blog
For talent
  • Overview
  • Remote jobs
  • Remote companies
    • Resume builder and guide
    • Talent career blog
Blueprint
Blueprint

AI/ML Engineer

Location

Remote restrictions apply
See all remote locations

Salary Estimate

N/AIconOpenNewWindows

Seniority

N/A

Tech stacks

Blueprint
AI
QA
+9

Visa

U.S. visa required

Permanent role
3 days ago
Apply now

About Blueprint

At Blueprint, we're on a mission to empower therapists with world-class tools so they can focus on what matters most—delivering exceptional mental health care.

Our AI assistant is purpose-built for therapists, automating the administrative tasks that slow them down and enabling them to operate at the top of their license. With Blueprint, therapists aren't just managing their work; they're supported by tools that understand the context of each client interaction. Compared to legacy software tools, Blueprint feels more like having the world's best executive assistant at your side.

Today, over 50,000 therapists are on Blueprint, leveraging our platform to enhance care for hundreds of thousands of clients. We've found strong product-market fit and are scaling rapidly to meet demand.

Our organization is very flat and our team is intentionally small and talent-dense. We like people who are truthseekers, creative, and passionate about improving mental health care.

We're a remote-first company (US and Canada only, for now) and come together in person a few times a year to connect, have fun, and help shape the future of mental health care.

About The Role

We're looking for an experienced AI/ML Engineer to take ownership of evaluation and quality across our AI systems. At Blueprint, AI isn't a bolt-on — it's the foundation of our product. We use LLMs to automate clinical documentation, deliver clinical insights, and reimagine how therapists work.

This role is about making sure those systems work reliably, safely, and well. You'll design the evaluation infrastructure that helps us measure what "good" looks like across subjective, human-centered workflows and build the tools to track, test, and improve model outputs over time.

You'll work closely with engineering, product, and clinical leaders to define quality in practical, therapist-facing terms and make sure we have the systems in place to deliver it consistently.

This is a highly cross-functional, high-impact role. Your work will directly shape what tens of thousands of therapists experience when they use our product every day.

What You'll Do

  • Design and build our end-to-end evaluation infrastructure: LLM-as-a-judge, human QA pipelines, offline scoring, and more
  • Define and implement application-specific quality metrics — not just accuracy, but tone, structure, clinical alignment, and more
  • Collaborate with product and clinical leads to turn subjective requirements into structured evaluation criteria
  • Monitor and analyze model performance across different therapist cohorts and workflows
  • Build tools and processes to capture in-the-wild feedback from clinicians and route it back into model and product improvement loops
  • Work closely with engineers to integrate eval into our CI, deployment, and iteration cycles
  • Help shape data labeling, prompt evaluation, experiment design, and prompt tuning frameworks

Who We're Looking For

You're a hands-on ML/AI practitioner who's passionate about building high-quality systems that actually get used — not just optimizing for benchmark scores. You've worked with LLMs in production at scale and know the hard part is making outputs reliable, human-aligned, and easy to evaluate. You're motivated by impact, comfortable with ambiguity, and thrive in early-stage, fast-paced environments.

_You might be a fit if:

_

  • You've built or owned evaluation infrastructure for LLMs or generative AI products
  • You have experience designing QA workflows, human-in-the-loop systems, or LLM-as-a-judge pipelines
  • You think in terms of feedback loops — and can turn fuzzy product goals into testable quality metrics
  • You write code, ship experiments, and are comfortable working across the stack to get the right signals flowing
  • You're excited about working closely with product, design, and domain experts to define and refine what "good" means in a real-world AI application

_Bonus if you have:

_

  • Experience in healthcare, mental health, or other high-trust environments
  • Familiarity with labeling, data QA, or prompt engineering at scale
  • A strong POV on eval tools, metrics, or best practices — and a willingness to invent new ones where needed

Benefits

  • Competitive salary and equity
  • 100% remote - no office, no commuting
  • Health, dental, and vision insurance, with 75% of your premium covered by Blueprint
  • Semi-annual team gatherings (in Chicago!)
  • Unlimited PTO
  • Opportunities to grow with the company and shape our product
  • Hardworking, mission-driven, friendly coworkers

Blueprint is an equal opportunity employer and does not discriminate on the basis of race, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition, or any other basis protected by law.

About Blueprint

🔗Website
Visit company profileIconOpenNewWindows

Unlock all Arc benefits!

  • Browse remote jobs in one place
  • Land interviews more quickly
  • Get hands-on recruiter support
PRODUCTS
Arc

The remote career platform for talent

Codementor

Find a mentor to help you in real time

LINKS
About usPricingArc Careers - Hiring Now!Remote Junior JobsRemote jobsCareer Success StoriesTalent Career BlogArc Newsletter
JOBS BY EXPERTISE
Remote Front End Developer JobsRemote Back End Developer JobsRemote Full Stack Developer JobsRemote Mobile Developer JobsRemote Data Scientist JobsRemote Game Developer JobsRemote Data Engineer JobsRemote Programming JobsRemote Design JobsRemote Marketing JobsRemote Product Manager JobsRemote Project Manager JobsRemote Administrative Support Jobs
JOBS BY TECH STACKS
Remote AWS Developer JobsRemote Java Developer JobsRemote Javascript Developer JobsRemote Python Developer JobsRemote React Developer JobsRemote Shopify Developer JobsRemote SQL Developer JobsRemote Unity Developer JobsRemote Wordpress Developer JobsRemote Web Development JobsRemote Motion Graphic JobsRemote SEO JobsRemote AI Jobs
© Copyright 2025 Arc
Cookie PolicyPrivacy PolicyTerms of Service