Passion in data product/system development
Keep learning & doing
4+ years of data experiences in internet companies across UK/HK/TW
Mobile : +886-963335868 (TW)
SKYPE : yennanliu
Mail : f339339@gmail.com
Github : github.com/yennanliu
Website : yennanliu.github.io
** Scripts **
•Python:
ETL: Airflow, Luigi, Digdag
Spark : Pyspark
ML: scikit-learn, Tensorflow, Keras
Analysis : Pandas, NumPy, SciPy
Visualization : Matplotlib, Seaborn, ggplot, Folium
Web : Flask, Django
Web_scraping: BeautifulSoup, Selenium, Urli
•Spark : SparkSQL, SparkRDD, Mlib, SparkStream
•Scala : Spark
•Shell : bash script, Linux system
•Javascript : D3.JS, Highcharts, Mapbox
** Big data **
•Spark
•Hadoop
•Pig
•Hive
•Hbase
•Impala
** Cloud **
•AWS : EMR, S3, Redshift, DynamoDB, Lambda, ECS, ECR
•GCP : Bigquery, Dataflow, CloudSQL, Bigtable
** DB **
SQL : PostgreSQL, MySQL, MongoDB, Bigquery,Snowflake, SQLite, Oracle
NoSQL : MongoDB
Kv: Redis
Others : Elasticsearch
** Dev-Op **
•CI/CD: Jenkins, Travis
•Git
•Github, Gitlab
•Docker
** Characteristics **
• Problem-solving: data engineering + data science skills
• Cooperation: Work with teams (backend/prod/biz) across the company and alone
• Passion: Learning things fast, willing to boost productivity growth via data and product mindset