Skip to content

Instantly share code, notes, and snippets.

2025-12-09 23:57:09,949 - INFO - Starting Adhoc Job Coordinator with the following parameters:
Job name: None
Job file: k8s/safety/accnts/bot_defense/bns_monitoring/model_performance/bot_defense_metrics_daily.py
Is analysis request: False
Remote: True
Start datetime: 2025-12-08 00:00:00
End datetime: None
2025-12-09 23:57:09,950 - INFO - container initialized
2025-12-09 23:57:09,950 - INFO - initializing job file coordinator...
2025-12-09 23:57:09,950 - INFO - about to execute coordinator...
An error was encountered:
An error occurred while calling o344.showString.
: org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in stage 28.0 failed 4 times, most recent failure: Lost task 0.3 in stage 28.0 (TID 1008) (ip-10-103-62-216.ec2.internal executor 128): org.apache.spark.sql.execution.datasources.FileDownloadException: Failed to download file path: s3://rbx.usr/nonmasked/dw_nonpii/ts_ops_forecast_organic_demand/training_end_date=2024-07-10/content_type=abuse_voice/part-00003-2a25150e-682e-4171-aaa9-835415b41c8d.c000.snappy.parquet, range: 0-25583, partition values: [2024-07-10,abuse_voice], isDataPresent: false, eTag: 44825b6f4ed7c2697fe5f8196f434aa5-1
at org.apache.spark.sql.execution.datasources.AsyncFileDownloader.next(AsyncFileDownloader.scala:142)
at org.apache.spark.sql.execution.datasources.FileScanRDD$$anon$1.getNextFile(FileScanRDD.scala:291)
at org.apache.spark.sql.execution.datasources.FileScanRDD$$anon$1.nextIterator(FileScanRDD.scala:218)
at org.apache.spark.sq