About The Role
What if your years of software engineering experience could directly shape how AI writes code for millions of developers around the world? We're looking for experienced engineers to put frontier AI models through their paces — hunting down bugs, exposing hallucinations, and delivering the kind of sharp, expert feedback that makes these systems genuinely better.
This is a fully remote, flexible contract role built for engineers who love to think critically, debug deeply, and go beyond the surface of how code behaves.
- Organization: Alignerr
- Type: Hourly Contract
- Location: Remote
- Commitment: 10–40 hours/week
What You'll Do
- Evaluate frontier AI language models on complex, real-world software engineering tasks
- Identify bugs, logical errors, hallucinations, and reliability issues in AI-generated code
- Design prompts, test cases, and evaluation scenarios that probe the limits of model behavior
- Provide precise, well-reasoned written feedback on model strengths, weaknesses, and failure modes
- Work across multiple programming languages and codebases to assess generalization and correctness
- Think like an adversary — go beyond obvious inputs to surface non-obvious edge cases
Who You Are
- 3+ years of professional software engineering experience
- Strong proficiency in at least one of: TypeScript, Ruby, Java, or C++
- Excellent written and spoken English — you communicate technical ideas clearly and precisely
- Demonstrated ability to reason about complex systems and debug non-obvious issues
- Familiar with modern development tooling — Git, CLI workflows, testing frameworks, and IDEs
- Able to critically evaluate model behavior, not just consume model outputs
Nice to Have
- Experience across multiple programming languages or paradigms
- Background in QA, code review, or technical writing
- Prior exposure to AI/LLM tooling or prompt engineering
- Familiarity with software reliability, correctness proofs, or formal testing methods
Why Join Us
- Work on cutting-edge AI projects alongside leading research labs
- Fully remote and flexible — work when and where it suits you
- Freelance autonomy with the structure of meaningful, technically challenging work
- Make a direct, tangible impact on how AI understands and generates code at scale
- Potential for ongoing work and contract extension as new projects launch