About the Role:
We are seeking a Data Scientist / Generative AI Developer with a strong foundation in mathematics and advanced data science methodologies, coupled with expertise in Large Language Models (LLMs), Generative AI, and orchestration frameworks like LangChain or CrewAI. This is a key role in shaping cutting-edge machine learning solutions and integrating state-of-the-art AI capabilities, while collaborating across diverse teams.
Key Responsibilities:
- Design, develop, and implement machine learning algorithms and models, including creating specifications, design documents, and prototypes.
- Build, fine-tune, and deploy solutions leveraging Large Language Models (LLMs) for various use cases.
- Develop workflows and applications using LangChain, CrewAI, or similar GenAI orchestration frameworks to enable seamless AI integrations.
- Optimize model performance with advanced tuning techniques, feature engineering, and experimentation with LLMs.
- Lead brainstorming sessions for system architecture and feature design to ensure scalability and efficiency.
- Integrate and manage Vector Databases (VectorDBs) such as Pinecone, Weaviate, or similar, to handle embedding-based search and retrieval tasks.
- Gather and document design requirements by interfacing with key stakeholders and external customers.
- Collaborate with Engineering, Data Science, Product, UX, Business Development, and Infrastructure teams.
- Uphold and promote best coding practices, including thorough documentation and peer reviews.
Required Qualifications:
- Master’s degree in Data Science, Mathematics, or a related field.
- 5+ years of experience in Data Science, Machine Learning, or a similar domain.
- Expertise in linear algebra, statistics, and probability, with hands-on experience in statistical testing, regression, and deep learning techniques.
- Proficiency in machine learning frameworks such as TensorFlow, PyTorch, or MxNet.
- Experience with Large Language Models (LLMs) such as GPT-4, BERT, or T5.
- Proven experience with orchestration frameworks like LangChain, CrewAI, or similar.
- Strong understanding of Vector Databases (VectorDBs) and embedding-based search.
- Advanced development experience in Python and libraries like NumPy, Sci-Kit Learn, and Matplotlib.
- Proven track record of model performance tuning and deploying optimized machine learning models.
Preferred Skills:
- Proficiency in creating scalable machine learning pipelines.
- Familiarity with industry standards for code optimization and debugging.
Education Requirements:
Master’s Degree in:
- Data Science
- Computer Science
- Artificial Intelligence
- Mathematics
- Machine Learning
- Statistics
- Related fields
Special Considerations:
Certifications or postgraduate diplomas in AI/ML (e.g., from institutions like IIIT-Hyderabad, ISB, or online platforms like Stanford AI, Coursera, edX) could also supplement academic credentials if accompanied by relevant experience.
Key Skills:
-Experience with LLMs, LangChain/CrewAI, Vector Databases (e.g., Pinecone, Weaviate).
-Data Science, Machine Learning, TensorFlow, PyTorch, Deep Learning
-Expertise in Python
If you are a highly motivated professional with expertise in data science, a passion for mathematical problem-solving, and a drive to create impactful solutions, we’d love to have you on our team!