Skip to content

Instantly share code, notes, and snippets.

@PMeinshausen
Created September 29, 2014 03:09
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save PMeinshausen/319268fc56f12f5abdbc to your computer and use it in GitHub Desktop.
Save PMeinshausen/319268fc56f12f5abdbc to your computer and use it in GitHub Desktop.
--Syntax (version 1.0)
CREATE TABLE model_table_name ( PARTITION KEY(token) ) AS
SELECT token, SUM( category_1 ) AS category_1, ... ,
SUM( category_n ) AS category_n FROM
NaiveBayesText(
ON input_table
TEXT_COLUMN( text_column )
CATEGORY_COLUMN( category_column )
CATEGORIES( category_1, ... , category_n )
[DELIMITER('delimiter_regular_expression')]
[PUNCTUATION('punctuation_regular_expression'))]
GROUP BY token;
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment