Skip to content

Instantly share code, notes, and snippets.

Created August 8, 2017 14:20
Show Gist options
  • Save anonymous/ac5ec1c4deb52ae555215222ac730526 to your computer and use it in GitHub Desktop.
Save anonymous/ac5ec1c4deb52ae555215222ac730526 to your computer and use it in GitHub Desktop.
private def assemblePipeline(): Pipeline = {
val stages = Array(
cleanerPipeline(),
tokenizePipeline(),
vectorizePipeline(),
ldaPipeline(),
logisticRegressionPipeline()
).flatten
new Pipeline().setStages(stages)
}
override def fit(dataset: Dataset[_]): PipelineModel = {
logger.info(s"Preparing to fit quora question pipeline with params:\n${explainParams()}")
val model = assemblePipeline().fit(dataset)
logMetrics(model, dataset)
logTopics(dataset.sparkSession, model)
model
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment