Business Title: Sr. Data Scientist – LLM/RAG/Azure
Location: Atlanta, GA (100% Remote Job)
Job Type: Contract (6+ Months) with high possibility of conversion.
Projects:
- Chatbot – ask question, question can be NL but had to reach backend. Connect like SQL and then provide a response. Develop chatbot which provides interface so solve multiple business/project/customer needs.
Why the need for this person?
- SME – LLM/Azure space
- Deploying/implementing in Azure
Background:
- Implementing GenAI use cases – leveraging Azure
- Azure AI foundry (Some experience)
- Azure Open AI – deployed LLM
- Azure Databricks
- Prototype projects – can’t just wait on SRE – someone to get their hands dirty..
D2D:
- Identified few projects/use cases – this person will contribute to these on D2D basement.
- Actively construct functions that function needs of calling data – that calls endpoints that have frontend capabilities.
- Streamlit or Gradio (some frontend capabilities)
- Start working on this in small doses to get it ready.
- GitHub Actions / CI/CD – Azure DevOps
Top 5 tools/skills?
- Azure Open AI
- Azure Databricks
- Azure ML
- Azure Foundry (basic knowlegde/exp
- GitHub/CI/CD or Azure DevOps
- LLM/GenAI Knowledge
- Finetuning - Yes
- Evaluation – Yes
- Should be able to suggest the best route to take.
- RAG – Production-level exp – Setup pipelines.
- Model Development
Must haves?
- Python – very strong
- Azure services – data lake storage (Azure Data Lake storage)
- How to do an API call from Python
- Front end
- Gradio or Streamlit – light knowledge
- Want to create product – simple chatbot, need frontend to show what they’ve developed.
- Quick things to setup interface in Databricks/etc. (Chatbot)
- BERT Models – understanding of these – so they know what they’re implementing.
- Vector storage – why do they need it
- Chroma, pinecone, etc.
Plusses:
- Dynamic, adapts to change well, requirements will be changing.
- Mainly working on POCs