Dice is the leading career destination for tech experts at every stage of their careers. Our client, HYR Global Source Inc, is seeking the following. Apply via Dice today!
Job Title: Data Scientist (Healthcare)
Location: 100% Remote
Visa: H1B Transfer, EAD/W2
Interview Process: Recruiter screen (skills & domain fit)
Technical deep dive (SQL, modeling, case study)
Practical exercise (notebook or take-home)
Stakeholder panel (communication & healthcare use-case discussion
About the Role: We're looking for a hands-on Data Scientist to solve real-world business problems in healthcare/health insurance using machine learning and modern cloud tooling. You'll own the full lifecycle-from problem framing and data wrangling to modeling, deployment support, and stakeholder storytelling.
Top Skills: Python/R, SQL, Spark/Databricks, Azure ML or Vertex AI, MLflow, Git/GitHub, Jupyter/Databricks notebooks, Power BI/Tableau, Azure/Google Cloud Platform services.
What You'll Do:
- Partner with business/clinical stakeholders to define measurable use cases (e.g., risk & cost forecasting, readmission/LOS prediction, member churn, fraud/waste/abuse).
- Explore, cleanse, and engineer tabular data from claims, eligibility, EMR, utilization, and care-management sources.
- Build and evaluate models using regression, classification, time series, clustering, trees/GBMs; pilot deep-learning where additive value.
- Productionize with data & MLOps teams: feature pipelines (Spark/Databricks), model packaging, monitoring, drift/decay, and A/B testing.
- Write clear analyses, dashboards, and exec-ready presentations; translate findings into actions and ROI.
- Ensure privacy/compliance (HIPAA/PHI), data governance, and reproducible research (git, notebooks, experiment tracking).
- Contribute to LLM initiatives: prompt engineering, RAG, basic fine-tuning, and experiments with agentic AI frameworks for workflow automation.
Required Qualifications
- 6 8 years of hands-on data science delivering business impact (healthcare/health insurance experience is a big plus).
- Strong tabular data handling/manipulation skills.
- Solid SQL and proficiency in at least one statistical programming language (Python or R).
- Working knowledge of MS Office (Excel/PowerPoint/Access) for quick analyses and stakeholder comms.
- Experience with Big Data tools: HDFS, Hive, Spark, MapR-DB (or equivalent).
- Practical experience with statistical & ML techniques: linear/logistic regression, time series, clustering, decision trees, tree ensembles (XGBoost/LightGBM).
- Cloud & ML platforms: Databricks, Azure ML and/or Google Vertex AI (pipelines, model registry, deployment workflows).
- Exposure to prompt engineering, LLM fine-tuning (LoRA/PEFT or platform-native), and agentic AI concepts.
- Clear, concise communicator; able to convert ambiguity into structured analysis and decisions.
- BS/MS in Computer Science, Statistics, Applied Math, Engineering, or related field (or equivalent experience).