Role and responsibilities:
- Design, implement, and deploy predictive and prescriptive models using Python and Data-bricks.
- Define and implement best practices for data ingestion, cleaning, and feature engineering.
- Optimize Spark jobs and workflows for scalability and cost-efficiency.
- Lead A/B testing and statistical validation of models.
- Implement robust pipelines for model deployment and monitoring.
- Work closely with business stakeholders to identify opportunities and deliver data-driven so-lutions.
- Coach junior team members and contribute to knowledge-sharing sessions.
- Stay updated on emerging trends in AI/ML and propose innovative solutions.
Technical Skills & Expertise:
- Advanced proficiency in Python (pandas, NumPy, scikit-learn, PySpark).
- Strong experience with Databricks and distributed data processing using Apache Spark.
- Solid understanding of SQL for data manipulation and query optimization.
- Expertise in building, tuning, and deploying ML models (classification, regression, clustering).
- Familiarity with MLflow for experiment tracking and model lifecycle management.
- Knowledge of deep learning frameworks (TensorFlow, PyTorch) is a plus.
- Hands-on experience with Azure Data Lake, AWS S3, or similar cloud storage.
- Understanding of Delta Lake and data versioning.
- Strong skills in data storytelling using tools like Power BI or Plotly.
- Experience with CI/CD pipelines for ML models and containerization (Docker).
- Ability to optimize Spark jobs and handle large-scale datasets efficiently.
Qualifications/Requirements:
- Education: Master’s or Ph.D. in Computer Science, Statistics, Mathematics, or related field.
- 5+ years in data science roles, with at least 2 years in a senior or lead capacity.
- Proven track record of delivering end-to-end ML solutions in production.
- Strong problem-solving and analytical thinking.
- Excellent communication skills for cross-functional collaboration.
- Languages: Fluent in English (written and spoken).
Desired Characteristics:
- Strong communication (both verbally and in writing) and interpersonal skills
- Fast learner, energetic, enthusiastic and ability to multi-task
- Business-focused, customer- & service-minded
- Flexible to work across time zones and with cross-functional teams.
- Results-oriented & work ethic to always deliver on-time and in-scope.
- Able to work under pressure and manage stressful situations confidently & effectively.
- Open-minded, positive attitude
- Self-motivated to work independently.
- Good team player, actively participating and contributing toward achieving our mission suc-cessfully.
Required skills:
- Python
- Apache Spark
- cross-functional
- Scikit-Learn
- AWS S3
- Docker
- ML Models
- Databricks
- MLFlow
- PySpark
- CI/CD
- Spark
- Delta lake
- Power BI
- SQL
- pytorch
- Deep Learning
- Analytical
- Azure Data Lake
- TensorFlow
- Pandas
- Production
- Ploty
- NumPy
- Problem Solving
Languages: English (Proficient)
Ready for your next career move? Explore opportunities at Co-Workertech.com
Join our LinkedIn groups for updates on upcoming opportunities! Connect, collaborate, and thrive with industry leaders:
- Co-Worker Technology
- Co-Worker Renewable Energy Industry Jobs
Follow us to stay updated on the latest news, insights, and exciting announcements from our company.