Skip to content

Instantly share code, notes, and snippets.

Scott Hoover githoov

Block or report user

Report or block githoov

Hide content and notifications from this user.

Learn more about blocking users

Contact Support about this user’s behavior.

Learn more about reporting abuse

Report abuse
View GitHub Profile
@githoov
githoov / redshift_metadata.md
Last active Sep 14, 2018
Redshift Metadata
View redshift_metadata.md

Redshift Tables Used

pg_table_def, stl_query, stl_querytext, stl_tr_conflict, stl_explain, stl_alert_event_log, stl_ddltext, stl_scan, stl_save, stl_hashjoin, stl_hash, stl_plan_info, stl_return, and information_schema.table_constraints.

Queries to Extract Features

  • execution time
select (endtime - starttime) as execution_time_in_ms 
from stl_query 
where query = QUERY_ID;
View index.html
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="utf-8">
<title>Tree Example</title>
<style>
.node {
View data.json
{"links": [{"type": "", "target": "Bulletproof Arcade Limited", "source": "PlayFab, Inc."}, {"type": "", "target": "Fluffy Fairy Games GmbH", "source": "PlayFab, Inc."}, {"type": "", "target": "Hubspot, Inc.", "source": "Amplitude Analytics"}, {"type": "", "target": "Bee Square S.L.", "source": "PlayFab, Inc."}, {"type": "", "target": "Starbreeze", "source": "PlayFab, Inc."}, {"type": "", "target": "Seismic Games", "source": "PlayFab, Inc."}, {"type": "", "target": "BoomTV Inc", "source": "PlayFab, Inc."}, {"type": "", "target": "Integrated Direct Marketing", "source": "ThinkSmart Data Services LLC."}, {"type": "", "target": "Integrated Direct Marketing", "source": "Square, Inc."}, {"type": "", "target": "Turner Broadcasting System, Inc.", "source": "Turner Broadcasting System, Inc."}, {"type": "", "target": "Cimpress USA Incorporated", "source": "Cimpress USA Incorporated"}, {"type": "", "target": "Slice & Co", "source": "Kustomer, Inc."}, {"type": "", "target": "Datascan Technologies LLC", "source": "Datasc
View index.html
</style>
<body>
<script src="//d3js.org/d3.v3.min.js"></script>
<script>
// http://blog.thomsonreuters.com/index.php/mobile-patent-suits-graphic-of-the-day/
var links = [
{
"target":"PlayFab, Inc.",
"source":"Bulletproof Arcade Limited"
View flare.json
{
"name": "snowhouse",
"children": [
{
"name": "snowhouse_import",
"children": [
{"children": [{"size": 14336, "name": "ACCOUNT_ETL"}, {"size": 0, "name": "ACCOUNT_INVITATION_ETL"}, {"size": 0, "name": "ACCOUNT_INVITATION_RAW"}, {"size": 2108928, "name": "ACCOUNT_RAW"}, {"size": 11019776, "name": "AUTHN_EVENT_ETL"}, {"size": 984576, "name": "AUTHN_EVENT_RAW"}, {"size": 20779008, "name": "CLIENT_TELEMETRY"}, {"size": 9216, "name": "CONFIGURATION_ETL"}, {"size": 36352, "name": "CONFIGURATION_ID_TABLE_ETL"}, {"size": 36974592, "name": "CONFIGURATION_ID_TABLE_RAW"}, {"size": 13059072, "name": "CONFIGURATION_RAW"}, {"size": 3584, "name": "CONFIGURATION_STRING_ETL"}, {"size": 525824, "name": "CONFIGURATION_STRING_RAW"}, {"size": 348672, "name": "CONSTRAINT_ETL"}, {"size": 758784, "name": "CONSTRAINT_RAW"}, {"size": 112128, "name": "COUNTING_NODE_ETL"}, {"size": 196286464, "name": "COUNTING_NODE_RAW"}, {"size": 70144, "name": "DATABASE_ETL"}, {"size": 48640, "name": "DATABASE_RAW"}, {"size": 35044864,
View this.md
column_name table_name
foo accounts
bar accounts
baz users
View this.md
column_name table_name ...
foo accounts ...
bar accounts ...
baz users ...
@githoov
githoov / nlp.md
Last active Apr 20, 2017
NLP Blog
View nlp.md

Introduction

I recently had the opportunity to showcase Snowflake at JOIN, Looker's first user conference. I used my time to highlight a few Snowflake features that I find particularly useful, as someone who does analytics. The presentation demonstrated simultaneous workloads that share the same data as well as analytically intensive SQL patterns against large-scale, semi-structured data.

I thought I'd refactor my presentation as a series of blog entries to share some useful insights and interesting patterns with a broader audience. I'm going to step through how to incorporate sentiment analysis as well as tweet similarity into an interactive model using both Looker and Snowflake. (Note: if you haven't read our previous blog on sentiment analysis using Naïve Bayes, I highly recommend you do so.)

Part 1 - Simultaneous Workloads, Shared Data

N/A

Part 2 - Tweet Similarity

Overview

You can’t perform that action at this time.