Job Title: Data Scientist - LLM Data Trainer
Job Type: Full-time
Location: Remote
Job Summary:
We are on a lookout for talented Data Scientists to join our team as LLM Data Trainers. This role focuses on ensuring the highest quality for AI-generated datasets across various programming domains. Your expertise will directly contribute to refining AI models for improved accuracy and performance.
Key Responsibilities:
- Review AI-generated queries for accuracy, clarity, and relevance across multiple programming languages and frameworks.
- Validate and correct misleading or incorrect AI-generated responses.
- Ensure grammatical accuracy, logical structure, and coherence in queries.
- Categorize queries based on difficulty level and topic area.
- Provide constructive feedback to refine AI query generation processes.
- Collaborate with data scientists, machine learning engineers, and AI trainers to improve dataset quality.
- Maintain consistency in annotation standards and validation methodologies.
Required Skills and Qualifications:
- Proficiency in OpenAI APIs and experience with Azure-Samples and OpenAI Cookbook.
- Hands-on experience with Hugging Face Transformers, particularly PyTorch-based NLP models.
- Expertise in data science with a focus on machine learning.
- Exceptional written and verbal communication skills, with attention to detail.
- Ability to validate and enhance language model datasets effectively.
Preferred Qualifications:
- Experience working with a remote team.
- Background in multiple programming languages and frameworks.