Skip to content

Instantly share code, notes, and snippets.

@dmpeters
Created February 25, 2013 00:33
Show Gist options
  • Save dmpeters/5026489 to your computer and use it in GitHub Desktop.
Save dmpeters/5026489 to your computer and use it in GitHub Desktop.
Hacker News - February Meetup - elasticsearch Notes

Hacker News - February Meetup

elasticsearch notes

What is elasticsearch

  • a way to search... things
  • way to search data in terms of naturla language and mocuh more...
  • distributed JSON API
  • fancy cluster

What is lucene?

  • software library providing full=text search
  • elasticsearch a wrapper around lucene

Modeling Data

  • Data is stored in an index
  • each index can store multiple types, similar to SQL
  • specifying attributes fora type is optional
  • all data is sentas JSON

Creating a Schema

  • connect to server
  • PUT data (json format)
  • GET data (query in json format)

WHat is Analysis

  • Analyisis is the process whereby words are transformed into tokens.
  • [examples]
  • 9 or 10 tokenizers
    • write your own
    • ie) snowball, NGram, Stop Word
  • [filters]

Boosting

  • speciy something is more important than others based on score
  • boost is a multiplier on the score
  • have to tweak constants for accuracy

Faceting

  • report counts of common terms in documents
  • ie) left side rail of linkedin & amazon

CLustering

  • Yes it has clustering and it just works
  • queries run across all shards of cluster
  • no built in security
  • proxing nginx in fron works for security

Ops side

  • easy
  • just need java
  • .deb package available
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment