Skip to content

Instantly share code, notes, and snippets.

@gaborgsomogyi
Last active April 26, 2019 08:48
Show Gist options
  • Save gaborgsomogyi/2d0bc2049f842274d91022fae3bccf07 to your computer and use it in GitHub Desktop.
Save gaborgsomogyi/2d0bc2049f842274d91022fae3bccf07 to your computer and use it in GitHub Desktop.
ContainerId string format is changed if RM restarts with work-preserving recovery enabled.
It used to be such format:
container_{clusterTimestamp}_{appId}_{attemptId}_{containerId}
e.g.: container_1410901177871_0001_01_000005.
It is now changed to:
container_e{epoch}_{clusterTimestamp}_{appId}_{attemptId}_{containerId}
e.g.: container_e17_1410901177871_0001_01_000005.
Here, the additional epoch number is a monotonically increasing integer which starts from 0
and is increased by 1 each time RM restarts. If epoch number is 0, it is omitted and
the ContainerId string format stays the same as before.
https://hadoop.apache.org/docs/current/api/org/apache/hadoop/yarn/api/records/ContainerId.html
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment