About the Lead Data Scientist Job Role
You will help our clients solve real-world problems by tracing the data-to-insights lifecycle:
- Understand business problems, making sense of the data landscape & footprint, performing a combination of exploratory, Machine Learning & Advanced Analytics on textual data/li>
- Create, experiment with, and deliver innovative solutions in a consultative mindset to client stakeholders using textual data
Qualification for Lead Data Scientist
- Total 5+ years of experience using analytical tools/languages like Python on large-scale data
- Must have Semantic model & NER experience
- Experience working with pre-trained models, awareness of state-of-art in embeddings and applicability for use cases
- Must have strong experience in NLP/NLG/NLU applications using popular Deep learning frameworks like PyTorch, Tensor Flow, BERT, Langchain, and GPT (or similar models) Open CV.
- Must have exposure to Gen AI models (LLMs) like Mistral, Falcon, Llama 2, GPT 3.5 & 4, Prompt Engineering
- Experience using Azure services for ML & GenAI projects is a plus
- Demonstrated ability to engage with client stakeholders
- Deep knowledge of techniques such as Linear Regression, gradient descent, Logistic Regression, Forecasting, Cluster analysis, Decision trees, Linear Optimization, Text Mining
- Strong understanding of integrating NLP models into business workflows. Prospect should have exposure to project initiation to business impact creation in at least one project.
- Broad knowledge of fundamentals and state-of-the-art in NLP and machine learning
- Coding skills in one or more programming languages such as Python, SQL
- Hands-on experience with popular ML frameworks such as TensorFlow
- Expert / high level of understanding of language semantic concepts & data standardization