Type of contract: Independent Contractor
About us
Gold Media Tech is a trusted staff augmentation partner for CTOs, Lead Designers, and CEOs in the US who are looking to build remote engineering and design teams. Our team of vetted engineers, designers, product managers can help you build.
About our client
Our client is a healthcare technology company building the infrastructure that powers smarter care navigation. Their products enable health plans, TPAs, and benefits administrators to provide members with real-time access to accurate provider information, cost estimates, and personalized care guidance.
They are a post-Series A team focused on replacing legacy data vendors with modern, API-first solutions, operating in a fast-paced, execution-driven environment.
Role Summary
Our client is building a first-party data platform designed to ingest healthcare data from multiple heterogeneous sources, including carrier FHIR feeds, web-scraped portals, client-delivered files, claims, and eligibility datasets.
They are looking for a Senior Data Engineer who will act as a primary builder, working closely with the Lead Data Engineer to design ingestion systems, normalize complex datasets, and contribute to scalable data models that power downstream APIs and products.
Responsibilities
- Build and maintain end-to-end data pipelines, from ingestion to normalized, query-ready datasets within AWS data lake environments (S3, Athena, Glue, EMR)
- Develop and maintain FHIR-based integrations with carrier data feeds, handling variability across payers and resolving large-scale data quality issues
- Design and operate production-grade web scrapers, including anti-bot mitigation, change detection, and failure alerting mechanism
- Implement ingestion workflows for structured and unstructured client data (CSV, Excel, flat files, proprietary formats) with SLA-driven batch processing
- Contribute to canonical data models, including schema design, entity resolution, deduplication, and change data capture
- Build and enforce data quality monitoring systems, including coverage metrics, freshness SLAs, and anomaly detection
- Collaborate with product and engineering teams to expose data via APIs and integrate with downstream systems
- Develop infrastructure-as-code solutions to provision and manage data infrastructure
- Ensure compliance with data protection standards such as HIPAA and SOC 2
Requirements
- B2/C1 english level
- 5–8 years of experience as a Data Engineer or Software Engineer with a strong focus on data systems
- Experience with FHIR R4 and healthcare data standards (HL7, X12, CMS)
- Proven experience building data ingestion pipelines and working with large-scale datasets
- Strong expertise in AWS data services (S3, Athena, Glue, EMR)
- Advanced proficiency in Python and data-related libraries and frameworks
- Experience with workflow orchestration tools such as Airflow
- Hands-on experience with web scraping at scale, including handling CAPTCHAs, rate limiting, and dynamic content
- Strong experience in data modeling, normalization, and entity resolution
Preferred Qualifications
- Familiarity with healthcare provider datasets (NPI, credentialing, network data)
- Experience with infrastructure-as-code tools such as Pulumi, Terraform, or CloudFormation
Please note: you will be contracted by Gold Media Tech