Skip to content

Instantly share code, notes, and snippets.

View BillLucky's full-sized avatar
🎯
Focusing

BillLucky BillLucky

🎯
Focusing
View GitHub Profile
@BillLucky
BillLucky / setup.md
Created January 22, 2017 03:17 — forked from xrstf/setup.md
Nutch 2.3 + ElasticSearch 1.4 + HBase 0.94 Setup

Info

This guide sets up a non-clustered Nutch crawler, which stores its data via HBase. We will not learn how to setup Hadoop et al., but just the bare minimum to crawl and index websites on a single machine.

Terms

  • Nutch - the crawler (fetches and parses websites)
  • HBase - filesystem storage for Nutch (Hadoop component, basically)
@BillLucky
BillLucky / 0_reuse_code.js
Created July 9, 2016 08:01
Here are some things you can do with Gists in GistBox.
// Use Gists to store code you would like to remember later on
console.log(window); // log the "window" object to the console