Skip to content

Instantly share code, notes, and snippets.

@Areks
Areks / setup.md
Last active August 29, 2015 14:14 — forked from xrstf/setup.md

Info

This guide sets up a non-clustered Nutch crawler, which stores its data via HBase. We will not learn how to setup Hadoop et al., but just the bare minimum to crawl and index websites on a single machine.

Terms

  • Nutch - the crawler (fetches and parses websites)
  • HBase - filesystem storage for Nutch (Hadoop component, basically)
@Areks
Areks / dabblet.css
Created February 6, 2015 12:33 — forked from anonymous/dabblet.css
The first commented line is your dabblet’s title
/**
* The first commented line is your dabblet’s title
*/
background: #000;
@Areks
Areks / dabblet.css
Created February 6, 2015 12:36
The first commented line is your dabblet’s title
/**
* The first commented line is your dabblet’s title
*/
background: #000;