Skip to content

Instantly share code, notes, and snippets.

@nivdul
Last active August 29, 2015 14:19
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save nivdul/0ff01e13ba05135df09d to your computer and use it in GitHub Desktop.
Save nivdul/0ff01e13ba05135df09d to your computer and use it in GitHub Desktop.
Mean and Variance
import org.apache.spark.mllib.stat.MultivariateStatisticalSummary;
import org.apache.spark.mllib.stat.Statistics;
private MultivariateStatisticalSummary summary;
public ExtractFeature(JavaRDD<Vector> data) {
this.summary = Statistics.colStats(data.rdd());
}
// return (mean_acc_x, mean_acc_y, mean_acc_z)
public double[] computeAvgAcc() {
return this.summary.mean().toArray();
}
// return (var_acc_x, var_acc_y, var_acc_z)
public double[] computeVariance() {
return this.summary.variance().toArray();
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment