Personal details

hugo A. - Remote data scientist

hugo A.

Based in: 🇧🇷 Brazil
Timezone: Brasilia (UTC-3)

Summary

My mission is to be a catalyst for the growth of the companies I have the opportunity to work with.

Examples of Value-adding initiatives I led in companies I worked for:

# Led the R&D and industrialization of Machine Learning classifiers aimed at predicting the occasion of consumption (i.e., intent) around an order. (Employer: AbInbev)

# Leveraged NLP text processing techniques and transformer models to build text classifiers for key X (forme Twitter) pages, opening the way for the creation of new marketing consulting products(Parrot Analytics)

# Developed Time Series forecasting models to predict the next quarter of sales for dozens of different SKUs, allowing management to fine-tune its sales strategy and adjust the production capacity to minimize the loss of raw materials. (Abbot Laboratories)

Main Skills:

# Tireless problem solver, Lifelong learner

# AWS Certified: Machine Learning Specialty, Data Analytics Specialty, Cloud Developer Associate, and Solutions

Architect Associate.

# AZURE Certified: DP-100 - Designing and Implementing a Data Science Solution on Azure

# GCP Certified: Professional Machine Learning Engineer and Professional Data Engineer

# Main tech-stack:

---- Python, Scikit-learn, Tensorflow, Pytorch, Spark MLLib, SQL, NoSQL, database architecture and management

---- Git and Agile/Scrum methodologies

---- CI/CD: Experience with CircleCI, Kubeflow and Azure DevOps

---- BI tools( Power BI, tableau)

---- Linux and Bash scripting

# Language proficiency levels:

Portuguese(Native), English(Fluent), French(Fluent), Spanish(advanced), German(advanced)

Work Experience

Senior Data Scientist
Anthropic AI | Dec 2023 - Present
Python
SQL
Pandas
Data Analysis
Statistical Analysis
Scikit-learn
Large Language Models

- Perform A/B testing on alpha/beta versions of Large Language Models

- Statistical Data analysis using Python and packages like scikit-learn, statsmodels, xgboost, etc.

Senior Data Scientist
Leega | May 2023 - Mar 2024
Python
SQL
Google BigQuery
Docker
ETL
Google Cloud Platform
Apache Spark
Kubernetes
TensorFlow
Airflow
Scikit-learn

Leega is a Data Analytics Consulting Company. As such, I have been assigned to the following clients:

AB-Inbev Belgium (Data Squad):

Worked as Tech Lead, providing technical leadership and guidance to business stakeholders. Also got

my hands dirty performing ETL jobs with SQL and Python. Worked on the orchestration of Jobs using

Airflow dags and Kubernetes (GKE).

(Tech-stack: GCP + data-engineering toolkit)

AB-InBev Global (Consumer Data Intelligence Team):

Working as a Machine learning engineer, performing BigData analysis with Pyspark, orchestration of

jobs with Airflow and Kubernetes. Some of the ML related activities I worked in:

- customer segmentation of our client base (R&D)

Hugo Albuquerque Cosme da Silva - page 1- Code refactoring of ML pipelines related to the enrichment of order attributes(age, gender, city, state)

(MLOps)

- Development of an End-to-End ML pipeline to classify order occasion(Partying, relaxing, etc). (R&D +

MLOps)

(Tech-stack: GCP + Vertex AI)

Education

IU International University of Applied Sciences
Master's degree・Artificial Intelligence
Jun 2022 - Jun 2024
Udacity
nanodegree・AWS Machine Learning Engineer
Jun 2023 - Sep 2023

Personal Projects

Resume Parsing using custom Spacy NERIconOpenNewWindows
2024
Python
Flask
Docker
NLP (Natural Language Processing)
AWS (Amazon Web Services)
Developed a custom NER model using spacy to extract entities like technical skills, Job title, college, degree, companies worked at, email, etc. The application code was packaged into a docker application that was pushed to Amazon Container Registry and deployed using AWS Fargate, which is a managed container service.

Certifications & Awards

AWS Certified Machine Learning – Specialty
Amazon Web Services | Aug 2023
Microsoft Certified: Azure Data Scientist Associate
Microsoft | Jul 2023