Personal details

Anil M. - Remote

Anil M.

Timezone: Eastern Time (US & Canada) (UTC-4)

Summary

Real-Time Streaming enthusiast, aspiring to build and optimize real time data collecting and processing tools to enable data driven decision making. I currently focus on building large scale real time data processing tools. My research spanned across various domains like distributed computing, cloud computing and internet of things. Exposure to Hadoop and Spark has provided me a with strong foothold on distributed file systems, MapReduce paradigm and in-memory computational principles. My learnings are captured in the book titled - “Guide to High Performance Distributed Computing” which I co-authored with my research advisor.

Work Experience

Data Engineer, Infrastructure
Spotify | Jan 2017 - Present
- Build large-scale real time infrastructure on Google Cloud Platform - Develop best practices for continuous integration and delivery of real-time pipelines - Drive optimization, testing and tooling to improve data quality - Developer advocate for Google Data Processing solutions - Collaborate with cross functional feature teams
Graduate Research Assistant
Georgia Institute of Technology | Aug 2015 - Dec 2016
Worked with the Data Driven Education team as part of the Vertically Integrated Program of Center for 21st Century Universities. Focus building data warehouse solution that enables data analytics of data from Georgia Tech's Massively Open Online Courses. Written pipelines that ingest data from multiple MOOC platforms and leverages efficient data storage and delivery practices to detect interesting student learning trends. Other Responsibilities : * Teaching a specialized group of Undergraduate students in principles of Data Engineering through practice.