Introduction
Do you want to join an organization that invests in you as a Lead Data Scientist? At HCA, you come first. HCA Healthcare has committed up to $300 million in programs to support our incredible team members over the course of three years.
What you will do:
• DATA INJECTION: Design and implement incoming data for feature engineering and machine learning
• FEATURE CREATION AND TRANSFORMATION: Create features with usable predictive power by using domain knowledge of healthcare data, statistics, and data science
• FEATURE EXTRACTION AND SELECTION: Extract features via cluster analysis, text analytics, principal components analysis and related methodologies to identify useful information without distorting original relationships or significant information in order to performance tune and promote scalability of the model
• EXPLORATORY DATA ANALYSIS: Conduct data exploration and analysis to identify relevant patterns, trends, and relationships; understand the data deeply to appropriately align the data features with the appropriate machine learning methodologies
• MODEL EXPERIMENTATION: Apply various statistical and machine learning techniques to develop, tune, and optimize predictive modeling to align with the use case requirements
• DELIVERY: Deploy code using best practices
• DOCUMENTATION: Document and present data science strategies and insights and solutions
• MONITORING: Build and maintain pipelines, code, and processes to monitor the proper functioning of enterprise grade data science products
• DATA AND OUTPUT QUALITY: Ensure high quality and integrity throughout the entire data science process
• PRODUCT DEVELOPMENT MINDSET: Collaborate with stakeholders, data scientists, data engineers, and product managers to build and deliver high-quality data products
What qualifications you will need:
• 6+ years of overall experience in various aspects of data science and machine learning (for Lead level)
• Expert experience in SQL and Python
• Experience with structured and unstructured data (ie. tabular, text, images, video, etc)
• Experience with SQL relational and non-relational databases
• Experience with data processing and ETL tools (e.g Apache Spark)
• Experience with data visualization and data monitoring tools
• Experience with modern software engineering practices (e.g. automated testing, continuous integration and continuous development)
• Experience with Docker
• Experience with cloud deployments in GCP or another cloud platform
• Experience with Agile development methods and tools (e.g. Azure Dev Ops)
• Excellent communication and collaboration skills, with the ability to work effectively across multiple teams and stakeholders
• Ability to tell a story using data and insights that drives action and change
• Domain knowledge of healthcare data preferred
• Expertise in healthcare protocols and formats such as HL7, FHIR, DICOM preferred
• Expert in the regulatory aspects of the healthcare domain preferred
• Experience in GCP platform preferred