Hi! I am an experienced Data Scientist and Python programmer that works with Pandas, Numpy, Scipy, Matplotlib, Pydeck, SQLAlchemy, Snowflake, and Selenium, among other libraries, on a daily basis. I also have experience with AWS through the console and interacting with it using boto3 and awswrangler. I have a Ph.D. in Mathematical Optimization and love to solve challenging problems. Let me know how I can help you!
Port Polygons Project: Built port polygons by applying advanced clustering algorithms to AIS data binning. Created port anchorages and berths allowing a complete understanding of a port call. Main tools: Python, pandas, SciPy, NumPy, scikit-learn, DBSCAN, GeoPandas, h3, NetworkX, JupyterLab, awswrangler, Snowflake, PostgreSQL.
Visualizations: Created interactive visualizations of port polygons, port activity, and vessel activity. Visualizations were often used to support business decisions and sales requests. Main tools: pandas, PyDeck, Plotly, Matplotlib, JupyterLab.
Jobs Speed-Ups and AWS ECS Cost Reduction: Sped up analytics jobs by identifying their bottlenecks. Used multiprocessing to speed up parts of the code that were nonvectorizable. Significantly reduced AWS ECS costs by cutting running time.
Scraper Project: Developed web scraper to fetch metadata for vessels, ports, berths, anchorages, and berth calls. Main tools: Selenium, BeautifulSoup4, lxml, asyncio, aiohttp, awswrangler, Snowflake, Airflow, Docker.
Ballast and Laden Cutoffs Project: Applied unsupervised machine learning algorithms to calculate the vesselâs ballast and laden cutoffs. Applied KDE and Gaussian Mixture to reported draft measurements to estimate its probability distribution when the vessel is in ballast vs holding cargo. Apply Kolmogorov-Smirnov test to select the âbestâ pairs of distributions. Main tools: pandas, scikit-learn, GaussianMixture, KernelDensity, SciPy.