Skip to content

Instantly share code, notes, and snippets.

@karanth
Created February 17, 2014 15:42
Show Gist options
  • Save karanth/9052903 to your computer and use it in GitHub Desktop.
Save karanth/9052903 to your computer and use it in GitHub Desktop.

Data to the compute vs Compute to the data Data Streaming vs MapReduce

MapReduce model "Program once, deploy at scale" Allows programmers without background in parallel/distributed computing to use the distributed systems efficiently. batch processing

Streaming model No random access to data Limited memory, single CPU single pass, multiple pass within a window Low-latency

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment