For companies
  • Hire developers
  • Hire designers
  • Hire marketers
  • Hire product managers
  • Hire project managers
  • Hire assistants
  • How Arc works
  • How much can you save?
  • Case studies
  • Pricing
    • Remote dev salary explorer
    • Freelance developer rate explorer
    • Job description templates
    • Interview questions
    • Remote work FAQs
    • Team bonding playbooks
    • Employer blog
For talent
  • Overview
  • Remote jobs
  • Remote companies
    • Resume builder and guide
    • Talent career blog
Alignerr
Alignerr

Software Engineer – AI Model Evaluator

Location

Remote restrictions apply
See all remote locations

Salary Estimate

N/AIconOpenNewWindows

Seniority

N/A

Tech stacks

Software Development
AI
Project management
+11

Contract role
a day ago
Apply now

About The Role

What if your years of engineering experience could directly influence how the world's most advanced AI systems write and reason about code? We're looking for experienced software engineers to evaluate frontier AI models — hunting down bugs, exposing failure modes, and helping ensure that AI-generated code actually holds up under real-world scrutiny.

This is a fully remote, flexible contract role built for engineers who love digging into hard problems. You set your own schedule, work across cutting-edge projects, and make a tangible impact on the AI tools that millions of developers will rely on.

  • Organization: Alignerr
  • Type: Hourly Contract
  • Location: Remote
  • Commitment: 10–40 hours/week

What You'll Do

  • Evaluate the performance of frontier language models on complex, real-world software engineering tasks
  • Identify bugs, logical errors, hallucinations, and reliability issues in AI-generated code and reasoning
  • Design and review prompts, test cases, and evaluation scenarios that stress-test advanced coding workflows
  • Provide precise, well-reasoned written feedback explaining model strengths, weaknesses, and edge cases
  • Work across multiple programming languages and codebases to assess generalization, correctness, and robustness
  • Think critically about model behavior — not just whether code runs, but whether it's right

Who You Are

  • 3+ years of professional software engineering experience
  • Strong proficiency in at least one of: TypeScript, Ruby, Java, or C++
  • Sharp debugger — you spot non-obvious issues and can articulate exactly why something is broken
  • Excellent written and spoken English; you communicate technical findings clearly and precisely
  • Comfortable reasoning about complex systems, edge cases, and unexpected failure modes
  • Familiarity with modern development tooling — Git, CLI workflows, testing frameworks, and similar
  • You critically evaluate outputs rather than taking them at face value

Nice to Have

  • Experience across multiple programming languages or paradigms
  • Background in QA, code review, or software reliability engineering
  • Familiarity with AI or LLM tools and how they generate code
  • Interest in AI safety, alignment, or model evaluation research

Why Join Us

  • Work on cutting-edge AI projects alongside leading research labs
  • Fully remote and flexible — work when and where it suits you
  • Freelance autonomy with the structure of meaningful, high-impact technical work
  • Make a direct, tangible impact on how AI writes, reasons about, and understands code
  • Potential for ongoing work and contract extension as new projects launch

About Alignerr

🔗Website
Visit company profileIconOpenNewWindows

Unlock all Arc benefits!

  • Browse remote jobs in one place
  • Land interviews more quickly
  • Get hands-on recruiter support
PRODUCTS
Arc

The remote career platform for talent

Codementor

Find a mentor to help you in real time

LINKS
About usPricingArc Careers - Hiring Now!Remote Junior JobsRemote jobsCareer Success StoriesTalent Career BlogArc Newsletter
JOBS BY EXPERTISE
Remote Front End Developer JobsRemote Back End Developer JobsRemote Full Stack Developer JobsRemote Mobile Developer JobsRemote Data Scientist JobsRemote Game Developer JobsRemote Data Engineer JobsRemote Programming JobsRemote Design JobsRemote Marketing JobsRemote Product Manager JobsRemote Project Manager JobsRemote Administrative Support Jobs
JOBS BY TECH STACKS
Remote AWS Developer JobsRemote Java Developer JobsRemote Javascript Developer JobsRemote Python Developer JobsRemote React Developer JobsRemote Shopify Developer JobsRemote SQL Developer JobsRemote Unity Developer JobsRemote Wordpress Developer JobsRemote Web Development JobsRemote Motion Graphic JobsRemote SEO JobsRemote AI Jobs
© Copyright 2026 Arc
Cookie PolicyPrivacy PolicyTerms of Service