Position : Data Scientist
Experience : 3+ Years
Location : Remote
We are seeking a Data Scientist with expertise in Machine Learning, Predictive Analytics, and LLM-based solutions. The ideal candidate will have experience in statistical analysis, in-depth EDA, and deploying ML models on Databricks. Strong proficiency in Mosaic AI, LLM fine-tuning, embeddings, chunking, and agentic workflows is essential.
Key Responsibilities:
Develop and optimize supervised & unsupervised ML models.
Perform advanced statistical analysis and exploratory data analysis (EDA).
Build predictive and descriptive analytics dashboards on Databricks.
Utilize Unity Catalog for data governance and Databricks dashboards for visualization.
Implement SQL-based data processing for ML workflows.
Work with Mosaic AI, Llama3, and open-source LLMs for recommendation systems.
Apply LLM techniques such as embeddings, chunking, tokenization, and fine-tuning.
Develop LLM-based agentic solutions for automation.
Ensure scalable ML deployments in Databricks environments.
Conduct hyperparameter tuning for model optimization.
Evaluate model performance using metrics like Precision, Recall, RMSE, and MSE.
Work on time series forecasting techniques and trend analysis.
Design and implement AI-based recommendation systems for enhanced decision-making.
Apply predictive and descriptive analytics to extract meaningful insights from data.
Gain insights from raw data quickly through effective data understanding, requirement gathering, cleansing, and preprocessing.
Required Skills:
Proficiency in Python, SQL, and ML algorithms.
Hands-on experience with Databricks, Unity Catalog, and Mosaic AI.
Strong knowledge of LLM workflows and fine-tuning techniques.
Ability to build and interpret predictive models and statistical insights.
Experience in dashboard creation for descriptive and predictive analytics.
Familiarity with end-to-end ML workflows in cloud environments.
Expertise in data understanding, cleansing, preprocessing, and quick insight generation.
Understanding of time series modeling fundamentals.
Drop your CV at ruchi@msoltechnology.com