Job Summary
We are seeking a Data Scientist specializing in Large Language Models (LLMs) to develop and optimize ML pipelines, work with recommendation systems, and apply advanced AI/ML methodologies. The ideal candidate will have experience in feature engineering, data modeling, and cloud-based ML deployments while working with large datasets.
Key Responsibilities
- Experience with Large Language Models (LLMs)
- Develop ML pipelines for data cleaning and processing structured and unstructured data.
- Train LLMs to recognize products based on descriptions.
- Implement RAG (Retrieval-Augmented Generation) for applications like recommendation engines, network science, and search.
- Technical Competencies
ML & AI Expertise
- Strong knowledge of ML methods, including:
- Recommendation Systems
- Supervised Learning & Feature Engineering
- Time Series & Forecasting Analysis
- Clustering Algorithms
- Experience with network science projects and large-scale ML models.
- Expertise in feature engineering and model optimization.
- Strong programming skills in Python, R, Scala, or similar languages.
- Proficiency with machine learning libraries such as scikit-learn and TensorFlow.
- Experience working with cloud-based AI/ML tools (preferably GCP).
Data & Analytics
- Hands-on experience in data pipeline development, deployment, and operation.
- Strong knowledge of large, distributed datasets for actionable insights.
- Experience with Google BigQuery or Google Looker for dashboard development.
Collaboration & Agile Workflows
- Excellent communication and interpersonal skills with the ability to translate data into insights.
- Familiarity with collaborative AI/ML workflows.
- Experience working in Scrum teams and Agile methodologies.
- Willingness to travel and work flexible hours when required.
Essential Skillset
- Expertise in NLP (Natural Language Processing) and image processing is a plus.
- Experience in network science and time-series forecasting.
- Proficiency in at least one programming language: Python, Go, C, or C++.
- Strong knowledge of AI/ML models, deep learning, and algorithms.
- Familiarity with at least one Deep Learning framework.
Desirable Skillset
- Self-sufficiency in researching and solving technical problems.
- Experience with Google Cloud Platform (GCP), Power BI, or Google Looker.
- Experience handling large-scale datasets and optimizing ML models.
Preferred Qualifications
- Master’s or PhD in Computer Science, AI, Data Science, or a related field.
- Experience in predictive modeling, data visualization, and structured/unstructured data analysis.
- Proficiency in data mining and complex data modeling techniques.
Why Join Us?
- Work on cutting-edge AI/ML models and LLM technologies.
- Join a highly collaborative and innovative team.
- Competitive compensation and benefits package.
Skills: data visualization,processing,ml,r,large language models (llms),models,clustering algorithms,cloud,artificial intelligence (ai),deep learning,recommendation systems,scikit-learn,supervised learning,machine learning (ml),natural language processing (nlp),network science,modeling,time series analysis,google,feature engineering,tensorflow,learning,data,scala,bigquery,python,google cloud platform (gcp),datasets,data modeling