Skip to content

Instantly share code, notes, and snippets.

Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save wolfram77/10964cd26f11f7a7299e7b74a0be7e7e to your computer and use it in GitHub Desktop.
Save wolfram77/10964cd26f11f7a7299e7b74a0be7e7e to your computer and use it in GitHub Desktop.
Scaling PageRank to 100 Billion Pages : NOTES

Highlighted notes on:
Scaling PageRank to 100 Billion Pages.

Author:
Stergios Stergiou

Publication:
WWW '20: Proceedings of The Web Conference 2020, April 2020. Pages 2761–2767.
https://doi.org/10.1145/3366423.3380035

Distributed graph frameworks formulate tasks as sequences of supersteps within which communication is performed asynchronously by sending messages over the graph edges. PageRank's communication pattern is identical across supersteps since each vertex sends messages to all its edges. We exploit this pattern to develop a new communication paradigm that allows us to exchange messages that include only edge payloads, dramatically reducing bandwidth requirements. Experiments on a web graph of 38 billion vertices and 3.1 trillion edges yield execution times of 34.4 seconds per iteration, suggesting more than an order of magnitude improvement over the state-of-the-art.

Display the source blob
Display the rendered blob
Raw
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment