Skip to content

Instantly share code, notes, and snippets.

View ggreg's full-sized avatar

Greg Leclercq ggreg

View GitHub Profile
@ggreg
ggreg / elevator.md
Created December 6, 2012 15:01 — forked from oleiade/elevator.md
Elevator plan

First Article (Addressing the problem)

Rationale

Here at Botify

  • batch processing of TeraBytes of web server logs involves storing temporary data
  • bulk write/read GigaBytes data loads that would not fit in server's main memory (we preferred to improve performance on a single host at the beginning)
  • need for persistence
@ggreg
ggreg / celery_workflow_nested.py
Created December 5, 2012 16:16
celery workflow example with nested canvas primitives
@task
def fetch(pattern, src):
fetcher = LogFetcher(pattern)
return fetcher.register(src, fetcher.fetch(src)).id
@task
def convert_tsv_blocks(tsv_id_list):
return group([aggregate.s(tsv_id) for tsv_id in
tsv_id_list])()