We are looking for a proactive Data Scientist to lead and execute data-driven initiatives, using analytical techniques and AI to solve complex business problems. In this role, you will independently handle data projects, develop predictive models, and provide actionable insights that contribute to organizational growth and innovation.
Key Responsibilities
- Data Processing & Integration – Clean, transform, and integrate data to prepare it for analysis and modeling.
- Pipeline & Infrastructure Management – Maintain data pipelines and optimize the flow of information for model deployment.
- Model & Data Reliability – Validate data accuracy and model output to ensure consistency and reliability across projects.
- Predictive Modeling – Build and refine models to address specific business challenges, ensuring they are optimized and validated.
- Data Exploration & Analysis – Explore structured and unstructured data to uncover patterns and generate insights.
- Hypothesis Testing – Apply A/B testing to validate hypotheses and measure results.
- Visualization & Reporting – Create compelling visualizations that effectively communicate data insights to stakeholders.
- Feature Engineering & Model Optimization – Implement feature extraction and improve model performance through iterative refinements.
- Continuous Learning – Stay up to date with emerging trends in data science, machine learning, and AI.
Qualifications & Experience
- Experience applying software engineering methodologies and best practices including coding standards, code reviews, build processes, testing, and security
- Experience with AI solutions on cloud platforms is a plus.
Technical Proficiency
- Programming: Python (pandas, NumPy, scikit-learn), R, Java, Scala
- Machine Learning: Regression, classification, clustering, neural networks, time-series, and deep learning
- Generative AI: Experience with transformer models (PyTorch, TensorFlow)
- Databases: SQL (Oracle, SAP), NoSQL (MongoDB), Data Warehousing
- Big Data: Hadoop, Spark, Kafka
- Cloud Platforms: Azure, AWS, GCP
- Data Visualization: Power BI, Tableau, Matplotlib
- MLOps & DevOps: CI/CD, Docker, Kubernetes