Skip to content

Instantly share code, notes, and snippets.

@kwikadi
Created November 26, 2014 18:10
Show Gist options
  • Save kwikadi/001b4099755ba7a1eb68 to your computer and use it in GitHub Desktop.
Save kwikadi/001b4099755ba7a1eb68 to your computer and use it in GitHub Desktop.
The Wiki Project: Mapping wikipedia links into a graph #wiki #ideabin
Create a graph for all links in wikipedia.
Insipred by '4 links' concept.
Using pure C++
1. Download wiki dumps (~10 GB)
2. Rip out all links (Simple regex)
3. Create graph
4. Become happy
ISSUES:
Parallel processing
Containing graph in memory
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment