Skip to content

Instantly share code, notes, and snippets.

@peijiehu
Created August 20, 2017 05:43
Show Gist options
  • Save peijiehu/31baf4c181a9a8e40bd1a7c2508f55f3 to your computer and use it in GitHub Desktop.
Save peijiehu/31baf4c181a9a8e40bd1a7c2508f55f3 to your computer and use it in GitHub Desktop.

Data jiujitsu: When building a data product, start small and verify that the users like the idea before investing too much to get a perfect product. This is reiterated in “Rules of Machine Learning: Best Practices for ML Engineering” by Martin Zinkevich.

Hadoop joins: If your offline flow is taking a long time to converge, it might be that you are doing massive joins.

Use hybrid: When building an online recommendation service, consider using a hybrid solution to precompute some parts of the computation in order to speed up your service. We have other successful systems at LinkedIn that follow a similar approach, including our Ads ML system.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment