Position: Backend Developer/Data Scientist for Pedigrees 360
Job Summary:
We are looking for an experienced and versatile Backend Developer/Data Scientist to join our team and enhance the Pedigrees 360 platform. This role requires a deep understanding of both traditional and Graph Databases (specifically Neo4j), experience with modern Lakehouse technology and Medallion Architecture, as well as strong data science capabilities. This position will focus on developing scalable backend systems, working with complex data models (including horse pedigrees), and building machine learning models to drive intelligent insights for the platform.
Key Responsibilities:
Backend Development:
- Design and build scalable backend systems that integrate with both traditional relational databases and Graph databases (e.g., Neo4j).
- Develop and optimize RESTful APIs for seamless data interaction between backend systems, frontend applications, and external data sources.
- Build and manage data processing pipelines, ensuring the smooth flow of information across systems, especially within a Lakehouse architecture.
Data Science:
- Develop machine learning models for analyzing horse pedigrees, breeding patterns, and predicting breeding outcomes.
- Utilize Graph algorithms in Neo4j to uncover hidden relationships within pedigree data and enhance prediction models.
- Apply data science techniques to clean, transform, and engineer features from large datasets for model training.
- Implement data-driven decision-making processes that directly impact the platform’s features and functionality.
Data Architecture & Integration:
- Leverage Lakehouse technology and Medallion Architecture to build efficient data lakes and transform raw data into meaningful insights at scale.
- Work closely with the data engineering team to ensure data consistency, availability, and accessibility across systems.
- Optimize and manage both OLTP and OLAP systems, ensuring performance and scalability for large volumes of horse racing and breeding data.
Collaboration & Optimization:
- Work with the product, data engineering, and UI/UX teams to ensure seamless integration of backend systems and data science models into the Pedigrees 360 platform.
- Collaborate on designing and improving features based on data insights, customer feedback, and business needs
Required Skills & Qualifications:
Backend Development:
- Strong experience with backend programming languages such as Python, C#, or Node.js.
- Expertise in working with both traditional relational databases (e.g., SQL Server, PostgreSQL) and Graph Databases (preferably Neo4j).
- Experience with Neo4j queries and graph-based data models, utilizing graph algorithms to uncover relationships and patterns in data.
- Strong experience in building and optimizing RESTful APIs.
- Familiarity with cloud platforms, particularly Azure, for hosting, processing, and managing data at scale.
- Knowledge of containerization (e.g., Docker, Kubernetes) and version control (e.g., Git).
Data Science & Machine Learning:
- Strong proficiency in Python and data science libraries (Pandas, NumPy, Scikit-learn, TensorFlow, PyTorch).
- Experience building and deploying machine learning models for regression, classification, and prediction tasks.
- Knowledge of Graph-based algorithms and their application to machine learning.
- Proficient in data wrangling, feature engineering, and model validation techniques.
Data Architecture:
- Experience with Lakehouse technologies (e.g., Delta Lake, Databricks, Apache Iceberg) for managing data lakes and integrating them with traditional data warehouses.
- Knowledge of Medallion Architecture for data processing pipelines, ensuring data is organized, clean, and processed efficiently.
- Strong understanding of ETL processes and big data technologies.
Experience:
- Proven experience working with large datasets, graph models, and machine learning in a production environment.
Why Pedigrees 360?:
- Be part of a forward-thinking team working on cutting-edge technologies to revolutionize horse pedigree analysis.
- Opportunity to impact and optimize breeding strategies with machine learning and graph databases.
- Flexible, collaborative, and dynamic work environment.
If you're passionate about backend development, data science, and pushing the boundaries of technology in the world of horse racing and breeding, we want to hear from you!