Personal details

yennaniu - Remote

yennaniu

Timezone: Taipei (UTC+8)

Summary

Passion in data product/system development

  • Big data
  • Data storage (database, data warehouse)
  • Data infrastructure
  • ETL
  • ML application
  • Data as service (DaaS)

Keep learning & doing

4+ years of data experiences in internet companies across UK/HK/TW

Mobile : +886-963335868 (TW)
SKYPE : yennanliu
Mail : f339339@gmail.com
Github : github.com/yennanliu
Website : yennanliu.github.io

** Scripts **

•Python:
ETL: Airflow, Luigi, Digdag
Spark : Pyspark
ML: scikit-learn, Tensorflow, Keras
Analysis : Pandas, NumPy, SciPy
Visualization : Matplotlib, Seaborn, ggplot, Folium
Web : Flask, Django
Web_scraping: BeautifulSoup, Selenium, Urli
•Spark : SparkSQL, SparkRDD, Mlib, SparkStream

•Scala : Spark
•Shell : bash script, Linux system
•Javascript : D3.JS, Highcharts, Mapbox

** Big data **

•Spark
•Hadoop
•Pig
•Hive
•Hbase
•Impala

** Cloud **

•AWS : EMR, S3, Redshift, DynamoDB, Lambda, ECS, ECR
•GCP : Bigquery, Dataflow, CloudSQL, Bigtable

** DB **

SQL : PostgreSQL, MySQL, MongoDB, Bigquery,Snowflake, SQLite, Oracle
NoSQL : MongoDB
Kv: Redis
Others : Elasticsearch

** Dev-Op **

•CI/CD: Jenkins, Travis
•Git
•Github, Gitlab
•Docker

** Characteristics **

• Problem-solving: data engineering + data science skills
• Cooperation: Work with teams (backend/prod/biz) across the company and alone
• Passion: Learning things fast, willing to boost productivity growth via data and product mindset