Skip to content

Instantly share code, notes, and snippets.

@kittipatkampa
Created July 31, 2019 21:29
Show Gist options
  • Save kittipatkampa/774a18b1f0717fcc56b231125959bf00 to your computer and use it in GitHub Desktop.
Save kittipatkampa/774a18b1f0717fcc56b231125959bf00 to your computer and use it in GitHub Desktop.
Converting pyspark dataframe into RDD and back to DataFrame can resolve StackOverflow Error.
train_df = spark.createDataFrame(train_df.rdd, schema=train_df.schema)
model_pred = pipeline_pred.fit(train_df)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment