For companies
  • Hire developers
  • Hire designers
  • Hire marketers
  • Hire product managers
  • Hire project managers
  • Hire assistants
  • How Arc works
  • How much can you save?
  • Case studies
  • Pricing
    • Remote dev salary explorer
    • Freelance developer rate explorer
    • Job description templates
    • Interview questions
    • Remote work FAQs
    • Team bonding playbooks
    • Employer blog
For talent
  • Overview
  • Remote jobs
  • Remote companies
    • Resume builder and guide
    • Talent career blog
NicheHR Global
NicheHR Global

Data Scientist

Location

Remote restrictions apply
See all remote locations

Salary Estimate

N/AIconOpenNewWindows

Seniority

N/A

Tech stacks

Data
Experimentation
AI
+18

Visa

U.S. visa required

Permanent role
4 days ago
Apply now

Job Description

Senior Data Scientist | Remote

Product Experimentation & Evaluation (LLMs & AI)

Hiring on behalf of our client in the AI/Technology sector

We are seeking a senior-level Data Scientist to support our client in advancing their AI-powered products through robust experimentation and evaluation practices. This role is central to developing and optimizing large language model (LLM) applications, working at the intersection of product, engineering, and trust & safety teams.

You will lead end-to-end experimentation initiatives, define LLM evaluation frameworks, and drive high-impact product improvements. The ideal candidate has experience in startup-style environments as well as large-scale experimentation systems and is comfortable influencing both technical and non-technical stakeholders.

Key Responsibilities

  • Own experimentation processes: from hypothesis generation and metric design to experiment execution (A/B, multivariate, sequential testing) and actionable insights.
  • Develop and maintain evaluation frameworks for LLM-powered features, focusing on correctness, consistency, safety, hallucination detection, and bias/fairness.
  • Build predictive models and heuristics to enhance AI and NLP-based product experiences.
  • Collaborate with model engineers and prompt designers to explore prompt strategies, fine-tuning, model selection, and failure mode analysis.
  • Automate experiment pipelines: dashboards, instrumentation, monitoring, and alerting to ensure data integrity and rapid feedback loops.
  • Apply causal inference techniques and observational study methods when randomized experiments are infeasible.
  • Translate insights into product recommendations and influence decision-making across the product lifecycle.
  • Lead data initiatives in fast-paced, startup-like environments, with a strong focus on iteration speed, accuracy, and scalability.
  • Contribute to defining experimentation strategies at scale, supporting a culture of evidence-based product development.
  • Mentor junior team members and help shape best practices in experimentation and AI evaluation.

Requirements

Must-Have

  • 8–12+ years of experience in Data Science or Machine Learning, with a strong focus on experiment design and product analytics.
  • Demonstrated ability to drive experimentation in both startup and scaled enterprise environments.
  • Experience leading cross-functional teams, setting strategies, and executing roadmaps.
  • Proficiency in statistical analysis, causal inference, and robust metric design.
  • Deep experience in LLMs / NLP / AI, including working with prompts, model behavior, and evaluation.
  • Strong programming skills in Python, solid SQL, and familiarity with building and deploying analytic or ML pipelines.
  • Excellent communication skills, with the ability to translate complex data findings into business or product outcomes.

Nice-to-Have

  • Experience with fine-tuning LLMs, using multiple model providers or APIs.
  • Hands-on experience with experiment platforms or internal tooling for model evaluation.
  • Familiarity with voice, ASR, or other multi-modal AI applications.

Working Terms

  • Must be available to work during US business hours, specifically until 6 p.m. ET (Eastern Time).
  • Candidates should have their own remote work setup, including necessary equipment and internet access.

About NicheHR Global

🔗Website
Visit company profileIconOpenNewWindows

Unlock all Arc benefits!

  • Browse remote jobs in one place
  • Land interviews more quickly
  • Get hands-on recruiter support
PRODUCTS
Arc

The remote career platform for talent

Codementor

Find a mentor to help you in real time

LINKS
About usPricingArc Careers - Hiring Now!Remote Junior JobsRemote jobsCareer Success StoriesTalent Career BlogArc Newsletter
JOBS BY EXPERTISE
Remote Front End Developer JobsRemote Back End Developer JobsRemote Full Stack Developer JobsRemote Mobile Developer JobsRemote Data Scientist JobsRemote Game Developer JobsRemote Data Engineer JobsRemote Programming JobsRemote Design JobsRemote Marketing JobsRemote Product Manager JobsRemote Project Manager JobsRemote Administrative Support Jobs
JOBS BY TECH STACKS
Remote AWS Developer JobsRemote Java Developer JobsRemote Javascript Developer JobsRemote Python Developer JobsRemote React Developer JobsRemote Shopify Developer JobsRemote SQL Developer JobsRemote Unity Developer JobsRemote Wordpress Developer JobsRemote Web Development JobsRemote Motion Graphic JobsRemote SEO JobsRemote AI Jobs
© Copyright 2025 Arc
Cookie PolicyPrivacy PolicyTerms of Service