Data Scientist (Remote)

Location

Remote restrictions apply

See all remote locations

Salary Estimate

N/A

Seniority

N/A

Tech stacks

Data

Testing

Machine learning

+15

Permanent role

2 days ago

Apply now

Data Scientist

About Us:

At Codvo, we are committed to building scalable, future-ready data platforms that power business impact. We believe in a culture of innovation, collaboration, and growth, where engineers can experiment, learn, and thrive. Join us to be part of a team that solves complex data challenges with creativity and cutting-edge technology.

Role Summary

Model development, training pipeline, and analytics backend. Works in close coordination with

the on-site Data Scientist — the on-site person provides site context and validation feedback,

the offshore person implements model improvements, retraining logic, and drift detection.

Responsibilities

Model Development & Training

Maintain and improve the physics-based simulation engine — 19 equipment families,
64+ fault signatures, first-principles governing equations
Run model training pipelines — dataset generation, feature engineering, model fitting, hyperparameter tuning, MLflow experiment tracking
Implement model retraining triggers — drift detection (PSI-based), accuracy degradation monitoring, scheduled recalibration
Build and maintain the champion/challenger evaluation framework — shadow scoring, A/B testing, promotion guardrails
Develop new fault signatures as customer feedback identifies gaps

Analytics & Calibration

Implement probability calibration — Platt scaling, isotonic regression, ECE monitoring
Build the adaptive threshold controller — feedback-driven alarm threshold adjustment based on false alarm rate and recall
Develop the CMMS label linking pipeline — match work orders to predictions with confidence scoring
Analyze prediction outcomes — precision, recall, F1 by equipment family, by fault type, by site
Produce the weekly and monthly accuracy reports

Feature Engineering & Data Quality

Define and maintain feature sets for each equipment family — physics-informed features, rolling statistics, cross-tag correlations
Monitor data quality metrics — null rates, stale timestamps, schema violations, sensor drift
Build the healthy baseline update pipeline — daily computation of per-tag statistics from healthy operating data
Implement the training data snapshot pipeline — versioned, reproducible dataset extraction with manifest tracking

Expected Background

4+ years in machine learning engineering or applied data science
Strong Python skills — pandas, scikit-learn, XGBoost/LightGBM, MLflow
Experience with time-series data, anomaly detection, or predictive maintenance modeling
Understanding of model deployment patterns — model registry, versioning, A/B testing, canary deployments
Experience with statistical process control, calibration, or reliability engineering is a plus

About Codvo.ai

🔗Website

Visit company profile

Unlock all Arc benefits!

Browse remote jobs in one place
Land interviews more quickly
Get hands-on recruiter support

PRODUCTS

Arc

The remote career platform for talent

Codementor

Find a mentor to help you in real time

LINKS

About us Pricing Arc Careers - Hiring Now!Remote Junior Jobs Remote jobs Career Success Stories Talent Career Blog Arc Newsletter

JOBS BY EXPERTISE

Remote Front End Developer Jobs Remote Back End Developer Jobs Remote Full Stack Developer Jobs Remote Mobile Developer Jobs Remote Data Scientist Jobs Remote Game Developer Jobs Remote Data Engineer Jobs Remote Programming Jobs Remote Design Jobs Remote Marketing Jobs Remote Product Manager Jobs Remote Project Manager Jobs Remote Administrative Support Jobs

JOBS BY TECH STACKS

Remote AWS Developer Jobs Remote Java Developer Jobs Remote Javascript Developer Jobs Remote Python Developer Jobs Remote React Developer Jobs Remote Shopify Developer Jobs Remote SQL Developer Jobs Remote Unity Developer Jobs Remote Wordpress Developer Jobs Remote Web Development Jobs Remote Motion Graphic Jobs Remote SEO Jobs Remote AI Jobs

Cookie Policy Privacy Policy Terms of Service