Data Scientist

Location

Remote restrictions apply

See all remote locations

Salary Estimate

N/A

Seniority

N/A

Tech stacks

Data

Experimentation

+18

Visa

U.S. visa required

Permanent role

4 days ago

Apply now

Job Description

Senior Data Scientist | Remote

Product Experimentation & Evaluation (LLMs & AI)

Hiring on behalf of our client in the AI/Technology sector

We are seeking a senior-level Data Scientist to support our client in advancing their AI-powered products through robust experimentation and evaluation practices. This role is central to developing and optimizing large language model (LLM) applications, working at the intersection of product, engineering, and trust & safety teams.

You will lead end-to-end experimentation initiatives, define LLM evaluation frameworks, and drive high-impact product improvements. The ideal candidate has experience in startup-style environments as well as large-scale experimentation systems and is comfortable influencing both technical and non-technical stakeholders.

Key Responsibilities

Own experimentation processes: from hypothesis generation and metric design to experiment execution (A/B, multivariate, sequential testing) and actionable insights.
Develop and maintain evaluation frameworks for LLM-powered features, focusing on correctness, consistency, safety, hallucination detection, and bias/fairness.
Build predictive models and heuristics to enhance AI and NLP-based product experiences.
Collaborate with model engineers and prompt designers to explore prompt strategies, fine-tuning, model selection, and failure mode analysis.
Automate experiment pipelines: dashboards, instrumentation, monitoring, and alerting to ensure data integrity and rapid feedback loops.
Apply causal inference techniques and observational study methods when randomized experiments are infeasible.
Translate insights into product recommendations and influence decision-making across the product lifecycle.
Lead data initiatives in fast-paced, startup-like environments, with a strong focus on iteration speed, accuracy, and scalability.
Contribute to defining experimentation strategies at scale, supporting a culture of evidence-based product development.
Mentor junior team members and help shape best practices in experimentation and AI evaluation.

Requirements

Must-Have

8–12+ years of experience in Data Science or Machine Learning, with a strong focus on experiment design and product analytics.
Demonstrated ability to drive experimentation in both startup and scaled enterprise environments.
Experience leading cross-functional teams, setting strategies, and executing roadmaps.
Proficiency in statistical analysis, causal inference, and robust metric design.
Deep experience in LLMs / NLP / AI, including working with prompts, model behavior, and evaluation.
Strong programming skills in Python, solid SQL, and familiarity with building and deploying analytic or ML pipelines.
Excellent communication skills, with the ability to translate complex data findings into business or product outcomes.

Nice-to-Have

Experience with fine-tuning LLMs, using multiple model providers or APIs.
Hands-on experience with experiment platforms or internal tooling for model evaluation.
Familiarity with voice, ASR, or other multi-modal AI applications.

Working Terms

Must be available to work during US business hours, specifically until 6 p.m. ET (Eastern Time).
Candidates should have their own remote work setup, including necessary equipment and internet access.

About NicheHR Global

🔗Website

Visit company profile

Unlock all Arc benefits!

Browse remote jobs in one place
Land interviews more quickly
Get hands-on recruiter support

PRODUCTS

Arc

The remote career platform for talent

Codementor

Find a mentor to help you in real time

LINKS

About us Pricing Arc Careers - Hiring Now!Remote Junior Jobs Remote jobs Career Success Stories Talent Career Blog Arc Newsletter

JOBS BY EXPERTISE

Remote Front End Developer Jobs Remote Back End Developer Jobs Remote Full Stack Developer Jobs Remote Mobile Developer Jobs Remote Data Scientist Jobs Remote Game Developer Jobs Remote Data Engineer Jobs Remote Programming Jobs Remote Design Jobs Remote Marketing Jobs Remote Product Manager Jobs Remote Project Manager Jobs Remote Administrative Support Jobs

JOBS BY TECH STACKS

Remote AWS Developer Jobs Remote Java Developer Jobs Remote Javascript Developer Jobs Remote Python Developer Jobs Remote React Developer Jobs Remote Shopify Developer Jobs Remote SQL Developer Jobs Remote Unity Developer Jobs Remote Wordpress Developer Jobs Remote Web Development Jobs Remote Motion Graphic Jobs Remote SEO Jobs Remote AI Jobs

Cookie Policy Privacy Policy Terms of Service