Skip to content

Instantly share code, notes, and snippets.

View anirtek's full-sized avatar
🎯
Improving and Learning Continuously

Aniruddha Tekade anirtek

🎯
Improving and Learning Continuously
View GitHub Profile
@tobilg
tobilg / custom_s3_endpoint_in_spark.md
Last active May 22, 2023 14:57
Description on how to use a custom S3 endpoint (like Rados Gateway for Ceph)

Custom S3 endpoints with Spark

To be able to use custom endpoints with the latest Spark distribution, one needs to add an external package (hadoop-aws). Then, custum endpoints can be configured according to docs.

Use the hadoop-aws package

bin/spark-shell --packages org.apache.hadoop:hadoop-aws:2.7.2

SparkContext configuration