Personal details

Lucas H. - Remote data engineer

Lucas H.

Machine Learning Engineer
Based in: đŸ‡§đŸ‡· Brazil
Timezone: Brasilia (UTC-3)

Summary

Master and Bachelor in Computer Science | Machine Learning Engineer.

An enthusiast for entrepreneurship and the business world, with experience in a range of projects in the technology industry and in the area of startups.

I have experience with web projects and in the last few years, I am focused on Data Analysis / Science / Engineer. During these projects, I worked with teams, managing tasks and developing features using agile methodologies like Kanban and Scrum.

My master's research was focused on the Artificial Intelligence (AI) area, using and comparing machine learning techniques in the identification of writers' manuscripts. With my work, I made four international publications.

During graduation, I participated in extension and research projects. I was one of the founding members of HackerSpace MaringĂĄ (a community laboratory) and also reactivated the academic center of the course, in which I was president for two years.

“Success is the sum of small efforts, repeated day-in, and day-out.” - Robert Collier

Work Experience

Data Specialist
americanas s.a. | Jun 2021 - Present
Google Cloud Platform
Apache Spark
Apache Airflow
As a Senior Data Scientist, I work to make a hybrid between Data Science and Engineer, in order to be able to orchestrate data sources and later model them generating value for the business. At Americanas S.A. I work in the Seller Analytics team, analyzing data from the group's Marketplace (Americanas.com, Submarino, Shoptime). More specifically, I work on the team responsible for freight policy, creating data models, optimizing investments in freight costs, and building data pipelines for our models (MLOps). I use the following technologies: - Google Cloud Platform - GCP; - BigQuery; - Vertex AI; - CloudRun; - Dataproc; - Apache AirFlow; - dbt; - Python; - FastAPI; - PySpark; - APIs / Functions; - Docker; - Data Studio;
Data Engineer
Eleflow Big Data | Jan 2020 - Jul 2021
Python
SQL
Azure
NoSQL
Spark streaming
Databricks
As a Data Engineer, I worked on two different fronts in Americanas Store S.A. (LASA). First, in a project related to LGPD (Brazilian General Data Protection Law), in it, our team was responsible for organizing and loading into the production of the main Data Lake with all customers base from LASA. After we finished the step of putting the system into production, I changed to another project in LASA. In this second project, I worked together with the Data Management of Americanas Store. In it, the main product was the Single Sale Base (BUV), in which I was responsible for ingesting data from the PDVs (Points of Sales), from all physical stores in Brazil to this database. I worked too with batch and stream processing, improvement in internal process, creation and migration of pipelines, development, and deployment of releases on DevOps (CI/CD), and others. I used the followed technologies: - Azure; - Databricks; - Data Factory; - SQL Server; - CosmoDB; - Azure Functions; - Event Hub; - Stream Analytics.

Education

State University of Maringa
Master's degree・Computer Science
Jan 2016 - Dec 2018

Personal Projects

2017
Python
O Home Team is a web platform that aims to find a better way to find shared homes. :) Home Team is a startup to help people find the right roommate to share a home with. By matching tastes and interests, Home Team tries to match people that won't only share a living space, but also create memories, build a home and make bonds for life.