Skip to content

Instantly share code, notes, and snippets.

@kasramp
Created April 20, 2020 19:10
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save kasramp/d4843dbf394ee2cae7f8b7709bac45b3 to your computer and use it in GitHub Desktop.
Save kasramp/d4843dbf394ee2cae7f8b7709bac45b3 to your computer and use it in GitHub Desktop.
private void transformToJson() {
SparkSession spark = SparkSession.builder()
.config("spark.master", "local")
.appName("Java Spark SQL basic example").getOrCreate();
String query = "SELECT id, title, description, author, year, edition, publisher FROM books order by id";
Dataset<Row> jdbcDF = spark.read().format("jdbc")
.option("url", propertyLoader.getJdbcConnectionString())
.option("user", propertyLoader.getDatabaseUserName())
.option("password", propertyLoader.getDatabasePassword())
.option("numPartitions", "10")
.option("partitionColumn", "id")
.option("lowerBound", "0")
.option("upperBound", Long.MAX_VALUE)
.option("driver", "com.mysql.jdbc.Driver")
.option("dbtable", String.format("(%s) AS tmp", query))
.load();
jdbcDF.write().format("json").mode("append").save("books_json");
spark.close();
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment