For companies
  • Hire developers
  • Hire designers
  • Hire marketers
  • Hire product managers
  • Hire project managers
  • Hire assistants
  • How Arc works
  • How much can you save?
  • Case studies
  • Pricing
    • Remote dev salary explorer
    • Freelance developer rate explorer
    • Job description templates
    • Interview questions
    • Remote work FAQs
    • Team bonding playbooks
    • Employer blog
For talent
  • Overview
  • Remote jobs
  • Remote companies
    • Resume builder and guide
    • Talent career blog
Sopra Steria
Sopra Steria

SBS - GenAI R&D Automation Testing Senior Software Quality Engineer - SBS - Paris

Location

Remote restrictions apply
See all remote locations

Salary Estimate

N/AIconOpenNewWindows

Seniority

Senior

Tech stacks

Testing
Database
Automation
+27

Permanent role
3 days ago
Apply now

Description du poste

As a GenAI QA Engineer, you will ensure the quality and reliability of our RAG-based AI agent platform. Your responsibilities include:

Design and implement automated testing frameworks for RAG pipelines, including:

  1. Vector database performance and accuracy testing

  2. Retrieval quality metrics and relevance scoring

  3. LLM response validation and hallucination detection

  4. End-to-end agent conversation flow testing

Develop specialized test suites for AI/ML components:

  1. Knowledge base ingestion and chunking strategies

  2. Embedding quality and semantic search accuracy

  3. Prompt injection and security vulnerability testing

  4. Multi-modal content handling (documents, tables, images)

Create automated evaluation frameworks for:

  1. Agent response accuracy and consistency

  2. Contextual understanding and reasoning capabilities

  3. Performance benchmarking across different LLMs

  4. A/B testing for prompt engineering optimization

Collaborate with AI engineers to:

  1. Define quality metrics for RAG architectures

  2. Establish ground truth datasets for evaluation

  3. Implement continuous monitoring for model drift

  4. Design test scenarios for edge cases and failure modes

Build testing infrastructure for:

  1. Multi-tenant agent deployments

  2. Knowledge base versioning and rollback testing

  3. API rate limiting and scalability testing

  4. Integration testing with customer systems

Ensure compliance and safety:

  1. Test for bias and fairness in AI responses

  2. Validate data privacy and security measures

  3. Implement guardrails testing for harmful content

  4. Document AI system limitations and failure modes

Develop comprehensive test strategies for RAG-based AI agents.

Create automated benchmarks for retrieval quality and response accuracy.

Design adversarial testing scenarios to identify system vulnerabilities.

Build dashboards for monitoring AI system performance in production.

Collaborate with customers to understand their AI agent requirements.

Contribute to AI safety and alignment best practices.

Qualifications

Required Skills:

Education: Bachelor's degree in Computer Science, Engineering, AI/ML, or related field.

Experience: 5+ years in software testing with at least 2 years focused on AI/ML systems.

AI/ML Testing Expertise:

  1. Experience testing LLM applications, chatbots, or conversational AI

  2. Understanding of RAG architectures and vector databases (Pinecone, Weaviate, Qdrant)

  3. Familiarity with embedding models and similarity search concepts

  4. Knowledge of prompt engineering and LLM evaluation metrics

Technical Skills:

  1. Proficiency in Python for test automation and AI/ML frameworks

  2. Experience with LLM frameworks (LangChain, LlamaIndex, Haystack)

  3. API testing for RESTful services and streaming endpoints

  4. Familiarity with ML testing tools (MLflow, Weights & Biases, Neptune)

Automation Frameworks:

  1. pytest, unittest for Python-based testing

  2. Experience with async testing for streaming responses

  3. Load testing tools for AI endpoints (Locust, K6)

  4. CI/CD integration with model deployment pipelines

Domain Knowledge:

  1. Understanding of NLP concepts and evaluation metrics (BLEU, ROUGE, BERTScore)

  2. Knowledge of information retrieval metrics (precision, recall, MRR)

  3. Familiarity with financial services use cases for AI agents

  4. Understanding of responsible AI principles

Preferred Qualifications:

  1. Experience with cloud AI services (AWS Bedrock, Azure OpenAI, Google Vertex AI)

  2. Knowledge of vector database optimization and indexing strategies

  3. Familiarity with fine-tuning and model evaluation workflows

  4. Experience with multilingual AI systems testing

  5. Understanding of regulatory requirements for AI in financial services (EU AI Act, GDPR)

  6. Contributions to open-source AI/ML testing frameworks

Informations complémentaires

Les avantages à nous rejoindre :

  • Un accord télétravail pour télétravailler jusqu'à 2 jours par semaine selon vos missions.
  • Un package avantages intéressants : une mutuelle, un CSE, des titres restaurants, un accord d'intéressement, des primes vacances.

About Sopra Steria

🔗Website
Visit company profileIconOpenNewWindows

Unlock all Arc benefits!

  • Browse remote jobs in one place
  • Land interviews more quickly
  • Get hands-on recruiter support
PRODUCTS
Arc

The remote career platform for talent

Codementor

Find a mentor to help you in real time

LINKS
About usPricingArc Careers - Hiring Now!Remote Junior JobsRemote jobsCareer Success StoriesTalent Career BlogArc Newsletter
JOBS BY EXPERTISE
Remote Front End Developer JobsRemote Back End Developer JobsRemote Full Stack Developer JobsRemote Mobile Developer JobsRemote Data Scientist JobsRemote Game Developer JobsRemote Data Engineer JobsRemote Programming JobsRemote Design JobsRemote Marketing JobsRemote Product Manager JobsRemote Project Manager JobsRemote Administrative Support Jobs
JOBS BY TECH STACKS
Remote AWS Developer JobsRemote Java Developer JobsRemote Javascript Developer JobsRemote Python Developer JobsRemote React Developer JobsRemote Shopify Developer JobsRemote SQL Developer JobsRemote Unity Developer JobsRemote Wordpress Developer JobsRemote Web Development JobsRemote Motion Graphic JobsRemote SEO JobsRemote AI Jobs
© Copyright 2025 Arc
Cookie PolicyPrivacy PolicyTerms of Service