Julia's coding blog - Practice makes perfect

From January 2015, she started to practice leetcode questions; she trains herself to stay focus, develops "muscle" memory when she practices those questions one by one. 2015年初, Julia开始参与做Leetcode, 开通自己第一个博客. 刷Leet code的题目, 她看了很多的代码, 每个人那学一点, 也开通Github, 发表自己的代码, 尝试写自己的一些体会. She learns from her favorite sports – tennis, 10,000 serves practice builds up good memory for a great serve. Just keep going. Hard work beats talent when talent fails to work hard.

Friday, August 26, 2022

Linkedin profile: Data engineer | Zach Wilson

- Decreased the landing time of Airbnb's unit economics dataset from 3 days to 1 1/2 days. Built a long-term roadmap to further increase the quality of this critical dataset.
  
  Improved the quality of online systems at Airbnb to be easily compilable by Spark pipelines to compute metrics offline.
  
  Lead a team of 6 engineers to deliver on the MIDAS initiative to increase the data quality of the Commercial Products org.
  
  Upleveled smart pricing at Airbnb by improving the feature engineering and latency of the data used to train the smart pricing model.Decreased the landing time of Airbnb's unit economics dataset from 3 days to 1 1/2 days. Built a long-term roadmap to further increase the quality of this critical dataset. Improved the quality of online systems at Airbnb to be easily compilable by Spark pipelines to compute metrics offline. Lead a team of 6 engineers to deliver on the MIDAS initiative to increase the data quality of the Commercial Products org. Upleveled smart pricing at Airbnb by improving the feature engineering and latency of the data used to train the smart pricing model.

- Senior Software Engineer
  Oct 2018 - Mar 2020 · 1 yr 6 mosSan Francisco Bay Area
  Built a machine learning feedback system that allowed security engineers to label corporate user behavior as risky or not risky. Built Asset Inventory - a graph database solution that is a map of all of Netflix's cloud infrastructure.
  Skills: Big Data · Scala · Apache Spark · Cybersecurity · SQL · Machine Learning · Apache Airflow · Team Leadership · Data Visualization · Data Analysis · Java · Python · JavaScript · HTML · Cascading Style Sheets (CSS) · Node.js · React.js · D3.js · Linux · Git · PostgreSQL · REST APIs · Spring Framework · Googling
- Senior Data Engineer
  Jun 2018 - Oct 2018 · 5 mosLos Gatos, California
  I built a pipeline that measures the cloud infrastructure impact on AB tests, saving Netflix millions by allowing them to make smarter AB test rollout decisions.

- - Managed a 10 PB+ Hive data warehouse
  - Consolidated and conformed company-wide growth metrics (across WhatsApp, Instagram, Messenger, and Facebook) into a single, company-wide view.
  - Optimized machine learning feature set generation pipelines (200+ TB/day) from having a 4 day latency to having a 1 day latency. While also dropping compute costs for those pipelines 4x.
  - Reduced core notification data set latencies from 36 hours to < 8 hours.
  - Migrated 50% of notifications pipelines from using Hive to use Spark, Presto, or real-time streaming.
  - Cut compute cost from notifications pipelines by 40% over the course of 9 months.- Managed a 10 PB+ Hive data warehouse - Consolidated and conformed company-wide growth metrics (across WhatsApp, Instagram, Messenger, and Facebook) into a single, company-wide view. - Optimized machine learning feature set generation pipelines (200+ TB/day) from having a 4 day latency to having a 1 day latency. While also dropping compute costs for those pipelines 4x. - Reduced core notification data set latencies from 36 hours to < 8 hours. - Migrated 50% of notifications pipelines from using Hive to use Spark, Presto, or real-time streaming. - Cut compute cost from notifications pipelines by 40% over the course of 9 months.

Julia's coding blog - Practice makes perfect

Friday, August 26, 2022

Linkedin profile: Data engineer | Zach Wilson

No comments:

Post a Comment