Skip to content

Instantly share code, notes, and snippets.

@dbenson
Created February 24, 2011 16:28
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save dbenson/842386 to your computer and use it in GitHub Desktop.
Save dbenson/842386 to your computer and use it in GitHub Desktop.
Cluster partition and loss of data -- 104
[2011-02-24 02:16:58,287][INFO ][cluster.service ] [dm-essearchp104.bldrprod.local-ElasticSearch] removed {[dm-essearchp101.bldrprod.local-essearcherserver][7r2eKgQZRNOsV4740Yed2Q][inet[/10.15.104.1:9301]]{client=true, data=false},}, reason: zen-disco-node_failed([dm-essearchp101.bldrprod.local-essearcherserver][7r2eKgQZRNOsV4740Yed2Q][inet[/10.15.104.1:9301]]{client=true, data=false}), reason failed to ping, tried [3] times, each with maximum [30s] timeout
[2011-02-24 02:16:58,289][INFO ][cluster.service ] [dm-essearchp104.bldrprod.local-ElasticSearch] removed {[dm-essearchp101.bldrprod.local-ESIndexer][Q-mYKiOfTXaeXlxTj1bm_g][inet[/10.15.104.1:9302]]{client=true, data=false},}, reason: zen-disco-node_failed([dm-essearchp101.bldrprod.local-ESIndexer][Q-mYKiOfTXaeXlxTj1bm_g][inet[/10.15.104.1:9302]]{client=true, data=false}), reason failed to ping, tried [3] times, each with maximum [30s] timeout
[2011-02-24 02:16:58,587][INFO ][cluster.service ] [dm-essearchp104.bldrprod.local-ElasticSearch] removed {[dm-essearchp101.bldrprod.local-ElasticSearch][UZ5LMbXdR4i1yY-EHbzxaQ][inet[/10.15.104.1:9300]],}, reason: zen-disco-node_failed([dm-essearchp101.bldrprod.local-ElasticSearch][UZ5LMbXdR4i1yY-EHbzxaQ][inet[/10.15.104.1:9300]]), reason failed to ping, tried [3] times, each with maximum [30s] timeout
[2011-02-24 02:20:42,269][WARN ][transport ] [dm-essearchp104.bldrprod.local-ElasticSearch] Received response for a request that has timed out, action [discovery/zen/fd/ping], node [[dm-essearchp103.bldrprod.local-ESIndexer][M2tK7q1US2WjO6kK7IPNlw][inet[/10.15.104.3:9302]]{client=true, data=false}], id [43707246]
[2011-02-24 02:20:42,281][WARN ][transport ] [dm-essearchp104.bldrprod.local-ElasticSearch] Received response for a request that has timed out, action [discovery/zen/fd/ping], node [[dm-essearchp103.bldrprod.local-essearcherserver][-xiqLgsxTm2Dxl-oEQTvUA][inet[/10.15.104.3:9301]]{client=true, data=false}], id [43707247]
[2011-02-24 02:20:42,489][WARN ][transport ] [dm-essearchp104.bldrprod.local-ElasticSearch] Received response for a request that has timed out, action [discovery/zen/fd/ping], node [[dm-essearchp102.bldrprod.local-ElasticSearch][WEv2yxs-S2ihrtbrdoyyvw][inet[/10.15.104.2:9300]]], id [43707251]
[2011-02-24 02:20:42,961][WARN ][transport ] [dm-essearchp104.bldrprod.local-ElasticSearch] Received response for a request that has timed out, action [discovery/zen/fd/ping], node [[dm-essearchp102.bldrprod.local-essearcherserver][6QttpJ6DR227OYRnKCxnhQ][inet[/10.15.104.2:9301]]{client=true, data=false}], id [43707249]
[2011-02-24 02:20:42,999][WARN ][transport ] [dm-essearchp104.bldrprod.local-ElasticSearch] Received response for a request that has timed out, action [discovery/zen/fd/ping], node [[dm-essearchp102.bldrprod.local-ESIndexer][WOSuCf1ASd6ukY_wppPJsw][inet[/10.15.104.2:9302]]{client=true, data=false}], id [43707252]
[2011-02-24 02:20:43,093][WARN ][transport ] [dm-essearchp104.bldrprod.local-ElasticSearch] Received response for a request that has timed out, action [discovery/zen/fd/ping], node [[dm-essearchp103.bldrprod.local-ElasticSearch][fRPn3o62TPmtCLUHPeOxNg][inet[/10.15.104.3:9300]]], id [43707248]
[2011-02-24 02:23:37,015][WARN ][indices.cluster ] [dm-essearchp104.bldrprod.local-ElasticSearch] [djcnr_20110110233048][0] failed to start shard
org.elasticsearch.index.shard.recovery.RecoveryFailedException: Index Shard [djcnr_20110110233048][0]: Recovery failed from [dm-essearchp103.bldrprod.local-ElasticSearch][fRPn3o62TPmtCLUHPeOxNg][inet[/10.15.104.3:9300]] into [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]]
at org.elasticsearch.index.shard.recovery.RecoveryTarget.doRecovery(RecoveryTarget.java:242)
at org.elasticsearch.index.shard.recovery.RecoveryTarget.access$100(RecoveryTarget.java:60)
at org.elasticsearch.index.shard.recovery.RecoveryTarget$2.run(RecoveryTarget.java:145)
at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
at java.lang.Thread.run(Thread.java:619)
Caused by: org.elasticsearch.transport.RemoteTransportException: [dm-essearchp103.bldrprod.local-ElasticSearch][inet[/10.15.104.3:9300]][index/shard/recovery/startRecovery]
Caused by: org.elasticsearch.index.engine.RecoveryEngineException: [djcnr_20110110233048]0] Phase[1] Execution failed
at org.elasticsearch.index.engine.robin.RobinEngine.recover(RobinEngine.java:533)
at org.elasticsearch.index.shard.service.InternalIndexShard.recover(InternalIndexShard.java:386)
at org.elasticsearch.index.shard.recovery.RecoverySource.recover(RecoverySource.java:107)
at org.elasticsearch.index.shard.recovery.RecoverySource.access$1500(RecoverySource.java:60)
at org.elasticsearch.index.shard.recovery.RecoverySource$StartRecoveryTransportRequestHandler$1.run(RecoverySource.java:288)
at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
at java.lang.Thread.run(Thread.java:619)
Caused by: org.elasticsearch.index.shard.recovery.RecoverFilesRecoveryException: [djcnr_20110110233048]0] Failed to transfer [109] files with total size of [76mb]
at org.elasticsearch.index.shard.recovery.RecoverySource$1.phase1(RecoverySource.java:204)
at org.elasticsearch.index.engine.robin.RobinEngine.recover(RobinEngine.java:529)
... 7 more
Caused by: org.apache.lucene.store.AlreadyClosedException: this Directory is closed
at org.apache.lucene.store.Directory.ensureOpen(Directory.java:232)
at org.apache.lucene.store.FSDirectory.openInput(FSDirectory.java:346)
at org.elasticsearch.index.store.support.AbstractStore$StoreDirectory.openInput(AbstractStore.java:252)
at org.elasticsearch.index.shard.recovery.RecoverySource$1$1.run(RecoverySource.java:159)
... 3 more
[2011-02-24 02:23:37,020][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] sending failed shard for [djcnr_20110110233048][0], node[K7mlyvnETFu2bqY1mjn6dw], [R], s[INITIALIZING], reason [Failed to start shard, message [RecoveryFailedException[Index Shard [djcnr_20110110233048][0]: Recovery failed from [dm-essearchp103.bldrprod.local-ElasticSearch][fRPn3o62TPmtCLUHPeOxNg][inet[/10.15.104.3:9300]] into [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]]]; nested: RemoteTransportException[[dm-essearchp103.bldrprod.local-ElasticSearch][inet[/10.15.104.3:9300]][index/shard/recovery/startRecovery]]; nested: RecoveryEngineException[[djcnr_20110110233048]0] Phase[1] Execution failed]; nested: RecoverFilesRecoveryException[[djcnr_20110110233048]0] Failed to transfer [109] files with total size of [76mb]]; nested: AlreadyClosedException[this Directory is closed]; ]]
[2011-02-24 02:23:37,023][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] received shard failed for [djcnr_20110110233048][0], node[K7mlyvnETFu2bqY1mjn6dw], [R], s[INITIALIZING], reason [Failed to start shard, message [RecoveryFailedException[Index Shard [djcnr_20110110233048][0]: Recovery failed from [dm-essearchp103.bldrprod.local-ElasticSearch][fRPn3o62TPmtCLUHPeOxNg][inet[/10.15.104.3:9300]] into [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]]]; nested: RemoteTransportException[[dm-essearchp103.bldrprod.local-ElasticSearch][inet[/10.15.104.3:9300]][index/shard/recovery/startRecovery]]; nested: RecoveryEngineException[[djcnr_20110110233048]0] Phase[1] Execution failed]; nested: RecoverFilesRecoveryException[[djcnr_20110110233048]0] Failed to transfer [109] files with total size of [76mb]]; nested: AlreadyClosedException[this Directory is closed]; ]]
[2011-02-24 02:23:41,805][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] received shard failed for [genericnews2_20110110233049][0], node[WEv2yxs-S2ihrtbrdoyyvw], [P], s[STARTED], reason [master [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]] marked shard as started, but shard have not been created, mark shard as failed]
[2011-02-24 02:23:41,815][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] received shard failed for [rochdale_20110110233056][0], node[WEv2yxs-S2ihrtbrdoyyvw], [P], s[STARTED], reason [master [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]] marked shard as started, but shard have not been created, mark shard as failed]
[2011-02-24 02:23:41,823][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] received shard failed for [index16_20110110233053][0], node[WEv2yxs-S2ihrtbrdoyyvw], [P], s[STARTED], reason [master [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]] marked shard as started, but shard have not been created, mark shard as failed]
[2011-02-24 02:23:41,823][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] received shard failed for [rbcvideo_20110110233056][0], node[WEv2yxs-S2ihrtbrdoyyvw], [P], s[STARTED], reason [master [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]] marked shard as started, but shard have not been created, mark shard as failed]
[2011-02-24 02:23:41,823][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] received shard failed for [idol-reports1_20110110233051][3], node[WEv2yxs-S2ihrtbrdoyyvw], [R], s[STARTED], reason [master [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]] marked shard as started, but shard have not been created, mark shard as failed]
[2011-02-24 02:23:41,823][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] received shard failed for [idol-ft_20110110233050][4], node[WEv2yxs-S2ihrtbrdoyyvw], [R], s[STARTED], reason [master [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]] marked shard as started, but shard have not been created, mark shard as failed]
[2011-02-24 02:23:41,823][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] received shard failed for [spider01_20110110233057][0], node[WEv2yxs-S2ihrtbrdoyyvw], [R], s[STARTED], reason [master [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]] marked shard as started, but shard have not been created, mark shard as failed]
[2011-02-24 02:23:41,824][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] received shard failed for [index28_20110110233055][0], node[WEv2yxs-S2ihrtbrdoyyvw], [P], s[STARTED], reason [master [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]] marked shard as started, but shard have not been created, mark shard as failed]
[2011-02-24 02:23:41,824][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] received shard failed for [moodys_20110110233055][0], node[WEv2yxs-S2ihrtbrdoyyvw], [R], s[STARTED], reason [master [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]] marked shard as started, but shard have not been created, mark shard as failed]
[2011-02-24 02:23:41,824][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] received shard failed for [index21_20110110233053][0], node[WEv2yxs-S2ihrtbrdoyyvw], [P], s[STARTED], reason [master [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]] marked shard as started, but shard have not been created, mark shard as failed]
[2011-02-24 02:23:41,824][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] received shard failed for [genericnews_20110110233049][0], node[WEv2yxs-S2ihrtbrdoyyvw], [P], s[STARTED], reason [master [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]] marked shard as started, but shard have not been created, mark shard as failed]
[2011-02-24 02:23:41,824][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] received shard failed for [index29_20110110233055][0], node[WEv2yxs-S2ihrtbrdoyyvw], [P], s[STARTED], reason [master [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]] marked shard as started, but shard have not been created, mark shard as failed]
[2011-02-24 02:23:41,824][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] received shard failed for [idol-ft_20110110233050][2], node[WEv2yxs-S2ihrtbrdoyyvw], [R], s[STARTED], reason [master [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]] marked shard as started, but shard have not been created, mark shard as failed]
[2011-02-24 02:23:41,824][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] received shard failed for [genericreports_20110110233050][0], node[WEv2yxs-S2ihrtbrdoyyvw], [R], s[STARTED], reason [master [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]] marked shard as started, but shard have not been created, mark shard as failed]
[2011-02-24 02:23:41,812][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] received shard failed for [smereports_20110110233056][0], node[WEv2yxs-S2ihrtbrdoyyvw], [P], s[STARTED], reason [master [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]] marked shard as started, but shard have not been created, mark shard as failed]
[2011-02-24 02:23:41,824][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] received shard failed for [index23_20110110233054][0], node[WEv2yxs-S2ihrtbrdoyyvw], [R], s[STARTED], reason [master [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]] marked shard as started, but shard have not been created, mark shard as failed]
[2011-02-24 02:23:41,824][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] received shard failed for [idol-reports1_20110110233051][1], node[WEv2yxs-S2ihrtbrdoyyvw], [P], s[STARTED], reason [master [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]] marked shard as started, but shard have not been created, mark shard as failed]
[2011-02-24 02:23:41,824][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] received shard failed for [idol-reports1_20110110233051][6], node[WEv2yxs-S2ihrtbrdoyyvw], [P], s[STARTED], reason [master [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]] marked shard as started, but shard have not been created, mark shard as failed]
[2011-02-24 02:23:41,824][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] received shard failed for [sec_20110110233056][0], node[WEv2yxs-S2ihrtbrdoyyvw], [R], s[STARTED], reason [master [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]] marked shard as started, but shard have not been created, mark shard as failed]
[2011-02-24 02:23:41,824][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] received shard failed for [idol-reports1_20110110233051][2], node[WEv2yxs-S2ihrtbrdoyyvw], [P], s[STARTED], reason [master [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]] marked shard as started, but shard have not been created, mark shard as failed]
[2011-02-24 02:23:41,824][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] received shard failed for [goldman_20110110233050][0], node[WEv2yxs-S2ihrtbrdoyyvw], [R], s[STARTED], reason [master [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]] marked shard as started, but shard have not been created, mark shard as failed]
[2011-02-24 02:23:41,825][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] received shard failed for [idol-reports2_20110110233051][3], node[WEv2yxs-S2ihrtbrdoyyvw], [R], s[STARTED], reason [master [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]] marked shard as started, but shard have not been created, mark shard as failed]
[2011-02-24 02:23:41,824][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] received shard failed for [idol-ft_20110110233050][0], node[WEv2yxs-S2ihrtbrdoyyvw], [P], s[STARTED], reason [master [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]] marked shard as started, but shard have not been created, mark shard as failed]
[2011-02-24 02:23:41,824][WARN ][cluster.action.shard ] [dm-essearchp104.bldrprod.local-ElasticSearch] received shard failed for [idol-maestro-report1_20110110233050][2], node[WEv2yxs-S2ihrtbrdoyyvw], [P], s[STARTED], reason [master [dm-essearchp104.bldrprod.local-ElasticSearch][K7mlyvnETFu2bqY1mjn6dw][inet[/10.15.104.4:9300]] marked shard as started, but shard have not been created, mark shard as failed]
...
[2011-02-24 03:01:56,347][INFO ][cluster.service ] [dm-essearchp104.bldrprod.local-ElasticSearch] removed {[dm-essearchp102.bldrprod.local-essearcherserver][6QttpJ6DR227OYRnKCxnhQ][inet[/10.15.104.2:9301]]{client=true, data=false},}, reason: zen-disco-node_failed([dm-essearchp102.bldrprod.local-essearcherserver][6QttpJ6DR227OYRnKCxnhQ][inet[/10.15.104.2:9301]]{client=true, data=false}), reason transport disconnected (with verified connect)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment