Data Scientist – PU Framework and Feature Store

Location

Remote restrictions apply

See all remote locations

Salary Estimate

N/A

Seniority

N/A

Tech stacks

Data

Python

+24

Permanent role

16 hours ago

Apply now

CoAdvantage is building two interlocking analytic substrates: the productivity-unit (PU) framework that measures realized τ across operational functions, and the analytics feature store that serves production ML models across the company. Both Data Scientist roles work across both substrates. This role reports to the AI Experimentation Lead and works in close coordination with the data team, the Staff MLOps Engineer, and the Principal AI Architect.

This is a hands-on engineering-grade data science role. The Data Scientist writes production-grade Python, owns model code from notebook to production, and uses AI-assisted coding tools as a daily driver. The role is not deck-only.

The two Data Scientists collectively own the PU measurement layer and the model portfolio served from the feature store. Both roles are accountable for PU baselines and causal estimators on the measurement side, and for feature definitions and production models on the feature store side. Workload is allocated by the AI Experimentation Lead based on backlog priority and candidate strengths; neither role is scoped to a single domain. Active workstreams span MLR pricing redistribution, propensity to renew, churn, contact volume forecasting, staffing demand, payroll exception rates, and PU baselines for in-scope function.

Core Responsibilities

Model development: Build production ML models that serve from the analytics feature store. This includes problem framing with stakeholders, feature engineering, model selection, training, validation, calibration, and packaging for production. Models are owned end-to-end; there is no separate "ML engineer" handoff.
Feature definitions in the feature store: Author feature definitions, including source contracts, transformation logic, freshness requirements, and lineage metadata. The Data Scientist is accountable for the quality and correctness of every feature they introduce into the store.
Data team collaboration on feasibility: Work directly with the data team to validate feasibility of proposed features before they are committed to the backlog: source system availability, refresh cadence, data quality, governance and tenant-isolation constraints. The Data Scientist is expected to be in the data team's review channels and to push back on infeasible features early rather than late.
PU baselines and causal estimators: Both Data Scientists own portions of the PU measurement substrate: defining the productivity unit for in-scope functions, building the pipelines that compute c_r (c-sub-r) from operational data, and authoring the causal estimators that underwrite τ measurement — difference-in-differences, synthetic control, propensity matching, interrupted time-series. The Data Scientist is the analyst behind several of these readouts and is expected to be the methodological author of record on at least one PU baseline.
Model monitoring and reconciliation: Define and instrument the production monitoring for each model: drift, calibration, business-metric reconciliation. Carry the on-call rotation for model issues alongside MLOps.
Documentation and methodological transparency: Every model the Data Scientist ships carries a model card: assumptions, training data window, identification strategy, known failure modes, and the reconciliation plan. The bar is reproducibility from underlying data.

How We Work

AI-first coding - Claude Code, Copilot, or successor tools are the default development surface. Feature definitions, model code, evaluation scripts, and monitoring instrumentation are expected to be authored with agentic coding tools in the loop. Hand-coding without AI assistance is the exception, not the norm.
Hands-on with production code - The Data Scientist owns production code, not only research notebooks. Pull requests, code review, CI checks, and on-call all apply.
Pre-registered targets - No model goes to production without a written success criterion and a reconciliation plan. The AI Experimentation Lead signs both.
Methodological transparency - Identification strategies and validation choices are documented in writing and defended in review. "It performed well in cross-validation" is not sufficient.
You estimate - Every workstream returns with a timeline, a confidence interval, and the smallest version that could ship in two weeks.

Required Qualifications

Four or more years of experience as a Data Scientist or Applied Scientist building production models, not only research prototypes.
Strong Python and SQL. Comfortable authoring production-grade analysis code, model training pipelines, and feature transformations without an engineering intermediary.
Direct experience with at least one feature store — Feast, Databricks Feature Store, Tecton, Vertex Feature Store, or an internal equivalent — including authoring feature definitions and managing freshness.
Demonstrated experience taking at least two models to production and operating them through at least one retrain cycle.
Hands-on experience with AI-assisted coding tools (Claude Code, Copilot, Cursor, or equivalent) as a daily driver, with code commits or repositories to demonstrate the practice.
Working fluency with causal inference (difference-in-differences, synthetic control, propensity matching) AND at least one of: time-series forecasting, propensity modeling, constrained optimization. Both roles are expected to operate across PU measurement and feature-store modeling, so causal methods are not optional.
Written communication skills sufficient to produce model cards and stakeholder-facing readouts.

Preferred Qualifications

Prior experience in a PEO, HR outsourcing, insurance brokerage, BPO, or other labor-intensive services organization.
Direct exposure to pricing, underwriting, churn, or operational forecasting use cases.
Familiarity with cloud ML platforms (Azure ML, Databricks, Vertex AI).
Experience working under data governance constraints typical of regulated multi-tenant environments (HIPAA, PII, tenant isolation).

What Success Looks Like At 12 Months

At least two production models owned end-to-end, with documented model cards and active monitoring.
At least eight feature definitions contributed to the feature store, with the data team co-sign on lineage and freshness.
At least one PU baseline (c_r) published with audit trail and methodological documentation.
At least one causal readout co-authored with the AI Experimentation Lead that informed an executive-level tooling or pricing decision.
Established working pattern with the data team — feasibility reviews routine rather than ad hoc.

EEO

CoAdvantage is committed to providing equal employment opportunities to all employees and applicants without regard to race, color, religion, national origin, ancestry, citizenship status, age, sex (including pregnancy, childbirth, breast feeding and pregnancy-related medical conditions), gender, gender identity or expression, sexual orientation, marital status, uniform service member and veteran status, disability, genetic information, or any other characteristic protected by applicable federal, state, or local laws and ordinances.