Skip to content

Instantly share code, notes, and snippets.

View petersen-poul's full-sized avatar

Poul Petersen petersen-poul

View GitHub Profile
{
"name": "Assign Field Types by Field Name",
"description": "Sometimes, the automatic field detection does not assign field types correctly. This is especially a problem with fields that have a lot of missing values since the detection process only takes a peek at the data to determine if a field should be numeric, categorical, etc. This script allows you to alter the field types for a source based on the name of each field. Just put a partial match for the name in the list for the type you want to assign, and it will change all the fields whose name contain that string.",
"inputs": [
{
"name": "source",
"description": "Source to update.",
"type": "source-id"
},
{
@petersen-poul
petersen-poul / one-click-cluster-labels.json
Created March 22, 2019 19:16
one-click-cluster-labels
{
"name": "One Click Cluster Labels",
"description": "Given a cluster as input, assigns the cluster label to every instance in the dataset used to train the cluster.",
"inputs": [
{
"name": "cluster",
"description": "Cluster to use for labeling",
"type": "cluster-id"
}
],
{
"name": "Apply Date Format by Field Name",
"description": "Allows applying a custom date format to a source by matching field names",
"inputs": [
{
"name": "source",
"description": "Source to update",
"type": "source-id"
}
],
{
"name": "Lat/Long Distance between a pair of points",
"description": "Extends a dataset with the distance in meters between pairs of lat/long fields.",
"inputs": [
{
"name": "dataset-in",
"type": "dataset-id",
"description": "Dataset for extending with distance calculation."
},
{