Data Scientist (Remote)
Our client is looking for a Data Scientist to support the development and implementation of advanced data and AI strategies. The ideal professional is passionate about data-driven innovation and eager to work with cutting-edge technologies in Machine Learning and Generative AI.
Key Responsibilities
- Develop and implement data strategies to support business objectives and innovation.
- Collaborate with business and technology teams to identify opportunities for data-driven improvements.
- Design, train, and validate Generative AI and Machine Learning models.
- Monitor and optimize NLP and Generative AI model performance to ensure accuracy and relevance.
- Evaluate and fine-tune pre-trained models to address specific business challenges.
- Ensure data privacy, security, and bias mitigation in AI models.
- Build prediction and recommendation models, providing actionable insights.
- Collaborate with data engineers to design efficient data pipelines.
- Maintain data inventories and dictionaries, ensuring data quality and consistency.
- Drive innovation by proposing improvements and new business solutions using Generative AI.
- Stay current with advances in AI, NLP, and emerging frameworks.
- Promote a data-driven culture within cross-functional teams.
Requirements
- Bachelor’s degree in a technology-related field.
- Solid experience with Machine Learning algorithms, from design to deployment and automation.
- Proven background developing Data Science solutions: optimization, classification, prediction, statistical analysis, and NLP.
- Hands-on experience with LLMs, RAG, and Generative AI (e.g., Amazon Bedrock).
- Strong proficiency in Python, statistics, and AI/ML frameworks.
- Advanced knowledge of relational (SQL Server, PostgreSQL) and non-relational databases.
- Experience with Databricks (Spark optimization, Delta Lake, MLflow).
- Familiarity with AWS services such as S3, Athena, Glue, SageMaker, and QuickSight.
- Experience implementing MLOps best practices.
- Advanced English (verbal and written).
Nice to Have
- Expertise in Big Data and advanced use of Databricks to accelerate AI and analytics initiatives.
- Research or projects published in NLP or Generative AI fields.