Skip to content

Instantly share code, notes, and snippets.

@rmetzger
Created May 14, 2020 14:11
Show Gist options
  • Save rmetzger/0e2e62c3b3773f9b5b27f3c56f48c013 to your computer and use it in GitHub Desktop.
Save rmetzger/0e2e62c3b3773f9b5b27f3c56f48c013 to your computer and use it in GitHub Desktop.
Flink Error behavor
=============== INITIAL REPORT ===============================
2020-05-14 10:12:42,660 INFO org.apache.flink.streaming.connectors.kinesis.FlinkKinesisProducer [] - Started Kinesis producer instance for region 'eu-central-1'
2020-05-14 10:12:42,660 DEBUG org.apache.flink.streaming.api.operators.BackendRestorerProcedure [] - Creating operator state backend for StreamSource_cbc357ccb763df2852fee8c4fc7d55f2_(1/1) with empty state.
2020-05-14 10:12:42,823 INFO org.apache.flink.streaming.connectors.kinesis.FlinkKinesisProducer [] - Closing producer
2020-05-14 10:12:42,823 INFO org.apache.flink.streaming.connectors.kinesis.FlinkKinesisProducer [] - Flushing outstanding 2 records
2020-05-14 10:12:42,826 ERROR org.apache.flink.streaming.runtime.tasks.StreamTask [] - Error during disposal of stream operator.
org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.DaemonException: The child process has been shutdown and can no longer accept messages.
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.Daemon.add(Daemon.java:176) ~[blob_p-afc90c8bc62cad672112e2c1e958318a7526b7a6-58db67a9931ba82572c9b831aeb48677:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.KinesisProducer.flush(KinesisProducer.java:785) ~[blob_p-afc90c8bc62cad672112e2c1e958318a7526b7a6-58db67a9931ba82572c9b831aeb48677:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.KinesisProducer.flush(KinesisProducer.java:805) ~[blob_p-afc90c8bc62cad672112e2c1e958318a7526b7a6-58db67a9931ba82572c9b831aeb48677:?]
at org.apache.flink.streaming.connectors.kinesis.FlinkKinesisProducer.flushSync(FlinkKinesisProducer.java:404) ~[blob_p-afc90c8bc62cad672112e2c1e958318a7526b7a6-58db67a9931ba82572c9b831aeb48677:?]
at org.apache.flink.streaming.connectors.kinesis.FlinkKinesisProducer.close(FlinkKinesisProducer.java:305) ~[blob_p-afc90c8bc62cad672112e2c1e958318a7526b7a6-58db67a9931ba82572c9b831aeb48677:?]
at org.apache.flink.api.common.functions.util.FunctionUtils.closeFunction(FunctionUtils.java:43) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.api.operators.AbstractUdfStreamOperator.dispose(AbstractUdfStreamOperator.java:117) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.runtime.tasks.StreamTask.disposeAllOperators(StreamTask.java:642) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.runtime.tasks.StreamTask.cleanUpInvoke(StreamTask.java:580) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.runtime.tasks.StreamTask.invoke(StreamTask.java:494) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.runtime.taskmanager.Task.doRun(Task.java:721) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.runtime.taskmanager.Task.run(Task.java:545) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at java.lang.Thread.run(Thread.java:748) [?:1.8.0_252]
2020-05-14 10:12:42,834 WARN org.apache.flink.runtime.taskmanager.Task [] - Source: Custom Source -> Sink: Unnamed (1/1) (4a49aea047aeb3e67cf79c788df0e558) switched from RUNNING to FAILED.
org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.DaemonException: The child process has been shutdown and can no longer accept messages.
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.Daemon.add(Daemon.java:176) ~[blob_p-afc90c8bc62cad672112e2c1e958318a7526b7a6-58db67a9931ba82572c9b831aeb48677:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.KinesisProducer.addUserRecord(KinesisProducer.java:536) ~[blob_p-afc90c8bc62cad672112e2c1e958318a7526b7a6-58db67a9931ba82572c9b831aeb48677:?]
at org.apache.flink.streaming.connectors.kinesis.FlinkKinesisProducer.invoke(FlinkKinesisProducer.java:293) ~[blob_p-afc90c8bc62cad672112e2c1e958318a7526b7a6-58db67a9931ba82572c9b831aeb48677:?]
at org.apache.flink.streaming.api.operators.StreamSink.processElement(StreamSink.java:56) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.runtime.tasks.OperatorChain$CopyingChainingOutput.pushToOperator(OperatorChain.java:713) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.runtime.tasks.OperatorChain$CopyingChainingOutput.collect(OperatorChain.java:688) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.runtime.tasks.OperatorChain$CopyingChainingOutput.collect(OperatorChain.java:668) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.api.operators.CountingOutput.collect(CountingOutput.java:52) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.api.operators.CountingOutput.collect(CountingOutput.java:30) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.api.operators.StreamSourceContexts$NonTimestampContext.collect(StreamSourceContexts.java:104) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at de.robertmetzger.StreamingJob$1.run(StreamingJob.java:82) ~[blob_p-afc90c8bc62cad672112e2c1e958318a7526b7a6-58db67a9931ba82572c9b831aeb48677:?]
at org.apache.flink.streaming.api.operators.StreamSource.run(StreamSource.java:100) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.api.operators.StreamSource.run(StreamSource.java:63) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.runtime.tasks.SourceStreamTask$LegacySourceFunctionThread.run(SourceStreamTask.java:209) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
2020-05-14 10:12:42,836 INFO org.apache.flink.runtime.taskmanager.Task [] - Freeing task resources for Source: Custom Source -> Sink: Unnamed (1/1) (4a49aea047aeb3e67cf79c788df0e558).
2020-05-14 10:12:42,836 DEBUG org.apache.flink.runtime.taskmanager.Task [] - Release task Source: Custom Source -> Sink: Unnamed (1/1) network resources (state: FAILED).
2020-05-14 10:12:42,840 INFO org.apache.flink.runtime.taskmanager.Task [] - Ensuring all FileSystem streams are closed for task Source: Custom Source -> Sink: Unnamed (1/1) (4a49aea047aeb3e67cf79c788df0e558) [FAILED]
2020-05-14 10:12:42,859 INFO org.apache.flink.runtime.taskexecutor.TaskExecutor [] - Un-registering task and sending final execution state FAILED to JobManager for task Source: Custom Source -> Sink: Unnamed (1/1) 4a49aea047aeb3e67cf79c788df0e558.
2020-05-14 10:12:43,095 DEBUG org.apache.flink.runtime.taskexecutor.TaskExecutor [] - Free slot with allocation id 45ef7d95f3115e2b0aea185da581976e because: Stopping JobMaster for job Produce something into Kinesis(659a1a5023b72e73ca68b76d44051015).
2020-05-14 10:12:43,096 DEBUG org.apache.flink.runtime.taskexecutor.slot.TaskSlotTableImpl [] - Free slot TaskSlot(index:0, state:ACTIVE, resource profile: ResourceProfile{cpuCores=1.0000000000000000, taskHeapMemory=11.200mb (11744048 bytes), taskOffHeapMemory=0 bytes, managedMemory=220.800mb (231525584 bytes), networkMemory=64.000mb (67108864 bytes)}, allocationId: 45ef7d95f3115e2b0aea185da581976e, jobId: 659a1a5023b72e73ca68b76d44051015).
org.apache.flink.util.FlinkException: Stopping JobMaster for job Produce something into Kinesis(659a1a5023b72e73ca68b76d44051015).
at org.apache.flink.runtime.jobmaster.JobMaster.onStop(JobMaster.java:347) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.runtime.rpc.RpcEndpoint.internalCallOnStop(RpcEndpoint.java:216) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.runtime.rpc.akka.AkkaRpcActor$StartedState.terminate(AkkaRpcActor.java:514) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleControlMessage(AkkaRpcActor.java:176) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:26) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:21) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at scala.PartialFunction$class.applyOrElse(PartialFunction.scala:123) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.japi.pf.UnitCaseStatement.applyOrElse(CaseStatements.scala:21) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:170) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.actor.Actor$class.aroundReceive(Actor.scala:517) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.actor.AbstractActor.aroundReceive(AbstractActor.scala:225) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.actor.ActorCell.receiveMessage(ActorCell.scala:592) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.actor.ActorCell.invoke(ActorCell.scala:561) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:258) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.dispatch.Mailbox.run(Mailbox.scala:225) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.dispatch.Mailbox.exec(Mailbox.scala:235) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.dispatch.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.dispatch.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.dispatch.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.dispatch.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
2020-05-14 10:12:43,124 INFO org.apache.flink.runtime.taskexecutor.JobLeaderService [] - Remove job 659a1a5023b72e73ca68b76d44051015 from job leader monitoring.
2020-05-14 10:12:43,124 DEBUG org.apache.flink.runtime.taskexecutor.TaskExecutor [] - Close JobManager connection for job 659a1a5023b72e73ca68b76d44051015.
org.apache.flink.util.FlinkException: TaskExecutor akka.tcp://flink@172.31.8.191:6122/user/rpc/taskmanager_0 has no more allocated slots for job 659a1a5023b72e73ca68b76d44051015.
at org.apache.flink.runtime.taskexecutor.TaskExecutor.closeJobManagerConnectionIfNoAllocatedResources(TaskExecutor.java:1591) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.runtime.taskexecutor.TaskExecutor.freeSlotInternal(TaskExecutor.java:1569) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.runtime.taskexecutor.TaskExecutor.freeSlot(TaskExecutor.java:936) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) ~[?:1.8.0_252]
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) ~[?:1.8.0_252]
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) ~[?:1.8.0_252]
at java.lang.reflect.Method.invoke(Method.java:498) ~[?:1.8.0_252]
at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRpcInvocation(AkkaRpcActor.java:284) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRpcMessage(AkkaRpcActor.java:199) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleMessage(AkkaRpcActor.java:152) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:26) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:21) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at scala.PartialFunction$class.applyOrElse(PartialFunction.scala:123) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.japi.pf.UnitCaseStatement.applyOrElse(CaseStatements.scala:21) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:170) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.actor.Actor$class.aroundReceive(Actor.scala:517) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.actor.AbstractActor.aroundReceive(AbstractActor.scala:225) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.actor.ActorCell.receiveMessage(ActorCell.scala:592) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.actor.ActorCell.invoke(ActorCell.scala:561) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:258) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.dispatch.Mailbox.run(Mailbox.scala:225) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.dispatch.Mailbox.exec(Mailbox.scala:235) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.dispatch.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.dispatch.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.dispatch.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.dispatch.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
2020-05-14 10:12:43,134 DEBUG org.apache.flink.runtime.state.TaskExecutorLocalStateStoresManager [] - Releasing local state under allocation id 45ef7d95f3115e2b0aea185da581976e.
2020-05-14 10:12:43,145 DEBUG org.apache.flink.runtime.taskexecutor.TaskExecutor [] - Close JobManager connection for job 659a1a5023b72e73ca68b76d44051015.
org.apache.flink.util.FlinkException: Stopping JobMaster for job Produce something into Kinesis(659a1a5023b72e73ca68b76d44051015).
at org.apache.flink.runtime.jobmaster.JobMaster.onStop(JobMaster.java:347) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.runtime.rpc.RpcEndpoint.internalCallOnStop(RpcEndpoint.java:216) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.runtime.rpc.akka.AkkaRpcActor$StartedState.terminate(AkkaRpcActor.java:514) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleControlMessage(AkkaRpcActor.java:176) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:26) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:21) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at scala.PartialFunction$class.applyOrElse(PartialFunction.scala:123) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.japi.pf.UnitCaseStatement.applyOrElse(CaseStatements.scala:21) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:170) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.actor.Actor$class.aroundReceive(Actor.scala:517) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.actor.AbstractActor.aroundReceive(AbstractActor.scala:225) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.actor.ActorCell.receiveMessage(ActorCell.scala:592) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.actor.ActorCell.invoke(ActorCell.scala:561) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:258) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.dispatch.Mailbox.run(Mailbox.scala:225) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.dispatch.Mailbox.exec(Mailbox.scala:235) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.dispatch.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.dispatch.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.dispatch.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at akka.dispatch.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
2020-05-14 10:12:43,146 INFO org.apache.flink.runtime.taskexecutor.JobLeaderService [] - Cannot reconnect to job 659a1a5023b72e73ca68b76d44051015 because it is not registered.
2020-05-14 10:12:43,771 ERROR org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.KinesisProducer [] - Error in child process
java.lang.RuntimeException: Error running child process
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.Daemon.fatalError(Daemon.java:533) [blob_p-afc90c8bc62cad672112e2c1e958318a7526b7a6-58db67a9931ba82572c9b831aeb48677:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.Daemon.fatalError(Daemon.java:513) [blob_p-afc90c8bc62cad672112e2c1e958318a7526b7a6-58db67a9931ba82572c9b831aeb48677:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.Daemon.access$200(Daemon.java:63) [blob_p-afc90c8bc62cad672112e2c1e958318a7526b7a6-58db67a9931ba82572c9b831aeb48677:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.Daemon$1.run(Daemon.java:135) [blob_p-afc90c8bc62cad672112e2c1e958318a7526b7a6-58db67a9931ba82572c9b831aeb48677:?]
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) [?:1.8.0_252]
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) [?:1.8.0_252]
at java.lang.Thread.run(Thread.java:748) [?:1.8.0_252]
Caused by: java.lang.NullPointerException: You must specify a value for roleArn and roleSessionName
at org.apache.flink.kinesis.shaded.com.amazonaws.auth.STSAssumeRoleSessionCredentialsProvider$Builder.<init>(STSAssumeRoleSessionCredentialsProvider.java:359) ~[blob_p-afc90c8bc62cad672112e2c1e958318a7526b7a6-58db67a9931ba82572c9b831aeb48677:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.services.securitytoken.internal.STSProfileCredentialsService.getAssumeRoleCredentialsProvider(STSProfileCredentialsService.java:31) ~[blob_p-afc90c8bc62cad672112e2c1e958318a7526b7a6-58db67a9931ba82572c9b831aeb48677:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.auth.profile.internal.securitytoken.STSProfileCredentialsServiceProvider.getProfileCredentialsProvider(STSProfileCredentialsServiceProvider.java:39) ~[blob_p-afc90c8bc62cad672112e2c1e958318a7526b7a6-58db67a9931ba82572c9b831aeb48677:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.auth.profile.internal.securitytoken.STSProfileCredentialsServiceProvider.getCredentials(STSProfileCredentialsServiceProvider.java:71) ~[blob_p-afc90c8bc62cad672112e2c1e958318a7526b7a6-58db67a9931ba82572c9b831aeb48677:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.auth.WebIdentityTokenCredentialsProvider.getCredentials(WebIdentityTokenCredentialsProvider.java:72) ~[blob_p-afc90c8bc62cad672112e2c1e958318a7526b7a6-58db67a9931ba82572c9b831aeb48677:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.Daemon.makeSetCredentialsMessage(Daemon.java:565) ~[blob_p-afc90c8bc62cad672112e2c1e958318a7526b7a6-58db67a9931ba82572c9b831aeb48677:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.Daemon.startChildProcess(Daemon.java:436) ~[blob_p-afc90c8bc62cad672112e2c1e958318a7526b7a6-58db67a9931ba82572c9b831aeb48677:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.Daemon.access$100(Daemon.java:63) ~[blob_p-afc90c8bc62cad672112e2c1e958318a7526b7a6-58db67a9931ba82572c9b831aeb48677:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.Daemon$1.run(Daemon.java:133) ~[blob_p-afc90c8bc62cad672112e2c1e958318a7526b7a6-58db67a9931ba82572c9b831aeb48677:?]
... 3 more
2020-05-14 10:12:47,560 DEBUG org.apache.flink.runtime.taskexecutor.TaskExecutor [] - Received file upload request for file taskmanager.log
2020-05-14 10:12:47,562 DEBUG org.apache.flink.runtime.blob.BlobClient [] - PUT BLOB stream to /172.31.8.191:42616.
============================ including 7e23ceb72adca341e70fd923b293a892d9519894 ===========
2020-05-14 14:00:43,628 INFO org.apache.flink.streaming.connectors.kinesis.FlinkKinesisProducer [] - Started Kinesis producer instance for region 'eu-central-1'
2020-05-14 14:00:43,668 INFO org.apache.flink.streaming.connectors.kinesis.FlinkKinesisProducer [] - Closing producer
2020-05-14 14:00:43,669 INFO org.apache.flink.streaming.connectors.kinesis.FlinkKinesisProducer [] - Flushing outstanding 2 records
2020-05-14 14:00:43,670 ERROR org.apache.flink.streaming.runtime.tasks.StreamTask [] - Error during disposal of stream operator.
org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.DaemonException: The child process has been shutdown and can no longer accept messages.
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.Daemon.add(Daemon.java:176) ~[blob_p-007201db1e4eb2fd5d8043eaba083a9036979cae-375ca44b18953f5d6aed401e96a873f5:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.KinesisProducer.flush(KinesisProducer.java:785) ~[blob_p-007201db1e4eb2fd5d8043eaba083a9036979cae-375ca44b18953f5d6aed401e96a873f5:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.KinesisProducer.flush(KinesisProducer.java:805) ~[blob_p-007201db1e4eb2fd5d8043eaba083a9036979cae-375ca44b18953f5d6aed401e96a873f5:?]
at org.apache.flink.streaming.connectors.kinesis.FlinkKinesisProducer.flushSync(FlinkKinesisProducer.java:412) ~[blob_p-007201db1e4eb2fd5d8043eaba083a9036979cae-375ca44b18953f5d6aed401e96a873f5:?]
at org.apache.flink.streaming.connectors.kinesis.FlinkKinesisProducer.close(FlinkKinesisProducer.java:313) ~[blob_p-007201db1e4eb2fd5d8043eaba083a9036979cae-375ca44b18953f5d6aed401e96a873f5:?]
at org.apache.flink.api.common.functions.util.FunctionUtils.closeFunction(FunctionUtils.java:43) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.api.operators.AbstractUdfStreamOperator.dispose(AbstractUdfStreamOperator.java:117) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.runtime.tasks.StreamTask.disposeAllOperators(StreamTask.java:697) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.runtime.tasks.StreamTask.cleanUpInvoke(StreamTask.java:629) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.runtime.tasks.StreamTask.invoke(StreamTask.java:537) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.runtime.taskmanager.Task.doRun(Task.java:713) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.runtime.taskmanager.Task.run(Task.java:539) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at java.lang.Thread.run(Thread.java:748) [?:1.8.0_252]
2020-05-14 14:00:43,671 WARN org.apache.flink.runtime.taskmanager.Task [] - Source: Custom Source -> (Sink: Print to Std. Out, Sink: Unnamed) (1/1) (f0324728ebf789876cddb8a885425e78) switched from RUNNING to FAILED.
org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.DaemonException: The child process has been shutdown and can no longer accept messages.
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.Daemon.add(Daemon.java:176) ~[blob_p-007201db1e4eb2fd5d8043eaba083a9036979cae-375ca44b18953f5d6aed401e96a873f5:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.KinesisProducer.addUserRecord(KinesisProducer.java:536) ~[blob_p-007201db1e4eb2fd5d8043eaba083a9036979cae-375ca44b18953f5d6aed401e96a873f5:?]
at org.apache.flink.streaming.connectors.kinesis.FlinkKinesisProducer.invoke(FlinkKinesisProducer.java:301) ~[blob_p-007201db1e4eb2fd5d8043eaba083a9036979cae-375ca44b18953f5d6aed401e96a873f5:?]
at org.apache.flink.streaming.api.operators.StreamSink.processElement(StreamSink.java:56) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.runtime.tasks.OperatorChain$CopyingChainingOutput.pushToOperator(OperatorChain.java:715) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.runtime.tasks.OperatorChain$CopyingChainingOutput.collect(OperatorChain.java:690) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.runtime.tasks.OperatorChain$CopyingChainingOutput.collect(OperatorChain.java:670) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.runtime.tasks.OperatorChain$BroadcastingOutputCollector.collect(OperatorChain.java:785) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.runtime.tasks.OperatorChain$BroadcastingOutputCollector.collect(OperatorChain.java:738) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.api.operators.CountingOutput.collect(CountingOutput.java:52) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.api.operators.CountingOutput.collect(CountingOutput.java:30) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.api.operators.StreamSourceContexts$NonTimestampContext.collect(StreamSourceContexts.java:104) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at de.robertmetzger.StreamingJob$1.run(StreamingJob.java:82) ~[blob_p-007201db1e4eb2fd5d8043eaba083a9036979cae-375ca44b18953f5d6aed401e96a873f5:?]
at org.apache.flink.streaming.api.operators.StreamSource.run(StreamSource.java:100) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.api.operators.StreamSource.run(StreamSource.java:63) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
at org.apache.flink.streaming.runtime.tasks.SourceStreamTask$LegacySourceFunctionThread.run(SourceStreamTask.java:209) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
2020-05-14 14:00:43,671 INFO org.apache.flink.runtime.taskmanager.Task [] - Freeing task resources for Source: Custom Source -> (Sink: Print to Std. Out, Sink: Unnamed) (1/1) (f0324728ebf789876cddb8a885425e78).
2020-05-14 14:00:43,671 INFO org.apache.flink.runtime.taskmanager.Task [] - Ensuring all FileSystem streams are closed for task Source: Custom Source -> (Sink: Print to Std. Out, Sink: Unnamed) (1/1) (f0324728ebf789876cddb8a885425e78) [FAILED]
2020-05-14 14:00:43,672 INFO org.apache.flink.runtime.taskexecutor.TaskExecutor [] - Un-registering task and sending final execution state FAILED to JobManager for task Source: Custom Source -> (Sink: Print to Std. Out, Sink: Unnamed) (1/1) f0324728ebf789876cddb8a885425e78.
2020-05-14 14:00:43,737 INFO org.apache.flink.runtime.taskexecutor.slot.TaskSlotTableImpl [] - Free slot TaskSlot(index:0, state:ACTIVE, resource profile: ResourceProfile{cpuCores=1.0000000000000000, taskHeapMemory=11.200mb (11744048 bytes), taskOffHeapMemory=0 bytes, managedMemory=220.800mb (231525584 bytes), networkMemory=64.000mb (67108864 bytes)}, allocationId: 1b0224b2dcb2de6efe080b0a17e391fa, jobId: 698e6ee649eae85d5913199d32ef99db).
2020-05-14 14:00:43,737 INFO org.apache.flink.runtime.taskexecutor.DefaultJobLeaderService [] - Remove job 698e6ee649eae85d5913199d32ef99db from job leader monitoring.
2020-05-14 14:00:43,738 INFO org.apache.flink.runtime.taskexecutor.TaskExecutor [] - Close JobManager connection for job 698e6ee649eae85d5913199d32ef99db.
2020-05-14 14:00:44,664 ERROR org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.KinesisProducer [] - Error in child process
java.lang.RuntimeException: Error running child process
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.Daemon.fatalError(Daemon.java:533) [blob_p-007201db1e4eb2fd5d8043eaba083a9036979cae-375ca44b18953f5d6aed401e96a873f5:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.Daemon.fatalError(Daemon.java:513) [blob_p-007201db1e4eb2fd5d8043eaba083a9036979cae-375ca44b18953f5d6aed401e96a873f5:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.Daemon.access$200(Daemon.java:63) [blob_p-007201db1e4eb2fd5d8043eaba083a9036979cae-375ca44b18953f5d6aed401e96a873f5:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.Daemon$1.run(Daemon.java:135) [blob_p-007201db1e4eb2fd5d8043eaba083a9036979cae-375ca44b18953f5d6aed401e96a873f5:?]
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) [?:1.8.0_252]
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) [?:1.8.0_252]
at java.lang.Thread.run(Thread.java:748) [?:1.8.0_252]
Caused by: java.lang.NullPointerException: You must specify a value for roleArn and roleSessionName
at org.apache.flink.kinesis.shaded.com.amazonaws.auth.STSAssumeRoleSessionCredentialsProvider$Builder.<init>(STSAssumeRoleSessionCredentialsProvider.java:359) ~[blob_p-007201db1e4eb2fd5d8043eaba083a9036979cae-375ca44b18953f5d6aed401e96a873f5:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.services.securitytoken.internal.STSProfileCredentialsService.getAssumeRoleCredentialsProvider(STSProfileCredentialsService.java:31) ~[blob_p-007201db1e4eb2fd5d8043eaba083a9036979cae-375ca44b18953f5d6aed401e96a873f5:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.auth.profile.internal.securitytoken.STSProfileCredentialsServiceProvider.getProfileCredentialsProvider(STSProfileCredentialsServiceProvider.java:39) ~[blob_p-007201db1e4eb2fd5d8043eaba083a9036979cae-375ca44b18953f5d6aed401e96a873f5:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.auth.profile.internal.securitytoken.STSProfileCredentialsServiceProvider.getCredentials(STSProfileCredentialsServiceProvider.java:71) ~[blob_p-007201db1e4eb2fd5d8043eaba083a9036979cae-375ca44b18953f5d6aed401e96a873f5:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.auth.WebIdentityTokenCredentialsProvider.getCredentials(WebIdentityTokenCredentialsProvider.java:72) ~[blob_p-007201db1e4eb2fd5d8043eaba083a9036979cae-375ca44b18953f5d6aed401e96a873f5:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.Daemon.makeSetCredentialsMessage(Daemon.java:565) ~[blob_p-007201db1e4eb2fd5d8043eaba083a9036979cae-375ca44b18953f5d6aed401e96a873f5:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.Daemon.startChildProcess(Daemon.java:436) ~[blob_p-007201db1e4eb2fd5d8043eaba083a9036979cae-375ca44b18953f5d6aed401e96a873f5:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.Daemon.access$100(Daemon.java:63) ~[blob_p-007201db1e4eb2fd5d8043eaba083a9036979cae-375ca44b18953f5d6aed401e96a873f5:?]
at org.apache.flink.kinesis.shaded.com.amazonaws.services.kinesis.producer.Daemon$1.run(Daemon.java:133) ~[blob_p-007201db1e4eb2fd5d8043eaba083a9036979cae-375ca44b18953f5d6aed401e96a873f5:?]
... 3 more
@datability-io
Copy link

Hi, @rmetzger,

I'm seeing the same error message as in this gist. I'm just wondering if you have a way to fix this? Thanks.

Thomas

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment