One problem in distributed computing (eg. Map Reduce) is stragglers. A straggler is a computation that is going slower than others which holds up everyone. Stragglers may happen because of slow IO (say a bad controller) or from a temporary CPU spike. The solution is to run multiple of the same computations and when one is done kill all the rest.
Created
June 23, 2017 05:01
-
-
Save peijiehu/4946cf6837d4bb45cd5951e4f54ffa85 to your computer and use it in GitHub Desktop.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment