For companies
  • Hire developers
  • Hire designers
  • Hire marketers
  • Hire product managers
  • Hire project managers
  • Hire assistants
  • How Arc works
  • How much can you save?
  • Case studies
  • Pricing
    • Remote dev salary explorer
    • Freelance developer rate explorer
    • Job description templates
    • Interview questions
    • Remote work FAQs
    • Team bonding playbooks
    • Employer blog
For talent
  • Overview
  • Remote jobs
  • Remote companies
    • Resume builder and guide
    • Talent career blog
Careerflow.ai
Careerflow.ai

AI/ML Software Engineer (RL Environments)

Location

Remote anywhere

Salary Estimate

N/AIconOpenNewWindows

Seniority

N/A

Tech stacks

Machine learning
Python
Project management
+8

Permanent role
2 days ago
Apply now

We're seeking experienced Machine Learning Engineers and Software Engineers with ML experience to design and build high-quality RL training environments for LLM agents. As an RL Environment Engineer, you'll create diverse machine learning tasks that challenge and improve language models, working with minimal supervision to deliver consistent, quality outputs.

What You'll Do

  • Design and build tasks for machine learning domains that target specific language models and difficulty distributions
  • Iterate rapidly on task designs based on customer feedback, with 24-hour turnaround times
  • Create diverse, challenging scenarios that test language model capabilities and expose their limitations
  • Hit the ground running with minimal onboarding time

What We're Looking For

  • Strong machine learning background through coursework, previous work experience, or personal projects
  • Python fluency: you write clean, efficient Python code regularly
  • Heavy LLM user who understands current model capabilities and failure modes through daily hands-on experience
  • Self-directed and creative. You can generate novel ML task ideas in your domain without constant guidance
  • High responsibility and integrity. You deliver quality work consistently and meet deadlines
  • Availability overlap with PST 9am-5pm (minimum 3 hours required)
  • Top Tier Coding Skills: Must be from Tier 1 college (Top IITs, NITs, BITS, etc.) or have demonstrable coding skills via competitive programming competition wins or other industry experiences.

Work Details

  • Location: Remote
  • Type: Contractor
  • Time Commitment: 40 hours a week. Must have at least 3 hours of overlap with PST business hours (9am-5pm)

Hiring Process:

  1. Screening
  2. Hacker rank assessment
  3. 1 Week paid task
  4. Full time

About Careerflow.ai

🔗Website
Visit company profileIconOpenNewWindows

Unlock all Arc benefits!

  • Browse remote jobs in one place
  • Land interviews more quickly
  • Get hands-on recruiter support
PRODUCTS
Arc

The remote career platform for talent

Codementor

Find a mentor to help you in real time

LINKS
About usPricingArc Careers - Hiring Now!Remote Junior JobsRemote jobsCareer Success StoriesTalent Career BlogArc Newsletter
JOBS BY EXPERTISE
Remote Front End Developer JobsRemote Back End Developer JobsRemote Full Stack Developer JobsRemote Mobile Developer JobsRemote Data Scientist JobsRemote Game Developer JobsRemote Data Engineer JobsRemote Programming JobsRemote Design JobsRemote Marketing JobsRemote Product Manager JobsRemote Project Manager JobsRemote Administrative Support Jobs
JOBS BY TECH STACKS
Remote AWS Developer JobsRemote Java Developer JobsRemote Javascript Developer JobsRemote Python Developer JobsRemote React Developer JobsRemote Shopify Developer JobsRemote SQL Developer JobsRemote Unity Developer JobsRemote Wordpress Developer JobsRemote Web Development JobsRemote Motion Graphic JobsRemote SEO JobsRemote AI Jobs
© Copyright 2025 Arc
Cookie PolicyPrivacy PolicyTerms of Service