Personal details

Pawan K. - Remote data engineer

Pawan K.

Based in: 🇬🇧 United Kingdom
Timezone: Edinburgh (UTC+1)

About

Data & Analytics Engineer with manufacturing domain experience. I build end-to-end data pipelines, ML models, and deployed dashboards. 3 live applications, 36+ automated tests, CI/CD on every project. MSc Data Analytics (Aston University). Microsoft PL-300 certified.

Work Experience

Operational Team Leader
Copernus Fresh Fish, Hull | Apr 2025 - Present
SQL Server 2012
Mes traceability
Kpi monitoring
  • Operate SI (Integreater) ERP daily for production data management: batch traceability reports, yield KPI monitoring, weight variance reconciliation, and audit-ready compliance records
  • Own end-to-end data integrity from batch activation through weight capture, metal detection logging, label verification, and despatch reconciliation across 11+ operatives
  • Identified despatch date mismatch between planned and actual batch dates; proposed SQL Server live priority dashboard solution to management
Cover Team Lead (Curing & Processing)
Cranswick Convenience Foods, Hull | Oct 2024 - Apr 2025
Vectorization
Navan
  • Managed batch-level traceability data for M&S and Tesco premium lines: Nav Code and Vector tag validation, allergen cross-referencing, full audit trail
  • Monitored production metrics on Metapress systems: throughput rates, equipment utilisation, QC pass/fail ratios

Projects

Apex Data Migration
2025
Python
Docker
XGBoost
Power BI
Duck Creek
DBT
Polars
Built 10-task Prefect pipeline with parallel execution, retry logic, and monitoring dashboard simulating production database migration. Trained XGBoost CPU-spike predictor: 93% accuracy, ROC-AUC 0.97, zero data leakage across 96,470 orders. Deployed Isolation Forest for anomaly detection; 94 automated dbt quality checks caught a live LLM label-corruption bug before Power BI load.
UK Crime Analytics Pipeline
2025
Python
PostgreSQL
Neon
GitHub Actions
DBT
Streamlit
End-to-end pipeline: 99,675 crime records ingested from Police UK API across 10 cities into PostgreSQL with idempotent upserts. dbt transforms into 4 mart models; Streamlit dashboard deployed live on Streamlit Cloud + Neon PostgreSQL at zero infrastructure cost. 3 GitHub Actions workflows: CI (lint + dbt test), weekly scheduled ingestion, daily health monitoring.

Education

Aston University
Master's degree・Data Analytics
Jan 2023 - Mar 2024
Amity University
Bachelor's degree・Computer Applications
Aug 2019 - Aug 2022