About The Role
What if your engineering instincts could directly shape how AI writes code for millions of developers around the world? We're looking for experienced software engineers to critically evaluate AI-generated code — hunting down bugs, exposing failure modes, and helping make the next generation of AI systems genuinely reliable.
This is a fully remote, flexible contract role built for engineers who love digging into hard problems and aren't satisfied until they find what's broken.
- Organization: Alignerr
- Type: Hourly Contract
- Location: Remote
- Commitment: 10–40 hours/week
What You'll Do
- Evaluate AI-generated code across complex software engineering tasks — assessing correctness, logic, and reliability
- Hunt for bugs, hallucinations, edge cases, and subtle failure modes that others might miss
- Design and review prompts, test cases, and evaluation scenarios that push AI models to their limits
- Write clear, precise feedback that explains what a model got right, what it got wrong, and why it matters
- Work across multiple languages and codebases to assess how well AI generalizes across different contexts
Who You Are
- 3+ years of professional software engineering experience
- Strong proficiency in at least one of: TypeScript, Ruby, Java, or C++
- A natural debugger — you reason carefully about complex systems and notice what doesn't add up
- Excellent written English — you can articulate technical observations clearly and precisely
- Comfortable with modern development tooling: Git, CLI workflows, testing frameworks
- You critically evaluate code rather than simply run it — you care about why something works or doesn't
Nice to Have
- Familiarity with AI or LLM tools and evaluation workflows
- Experience across multiple programming languages or paradigms
- Background in software quality assurance, code review, or technical writing
- Prior exposure to prompt engineering or AI model evaluation
Why Join Us
- Work on frontier AI projects alongside leading research labs
- Fully remote and flexible — work when and where it suits you
- Freelance autonomy with the structure of meaningful, task-based engineering work
- Make a direct, tangible impact on how AI understands and generates software
- Potential for ongoing work and contract extension as new projects launch