Skip to content

Instantly share code, notes, and snippets.

Avatar

João Antonio Ferreira joao-parana

View GitHub Profile
View Workflow01UDAF.scala
// UserDefinedAggregateFunction is the contract to define
// user-defined aggregate functions (UDAFs)
class MyCountUDAF extends UserDefinedAggregateFunction {
// Este método abaixo define pode ser invocado apenas assim: inputSchema(0)
// Isto é feito via inversão de dependência pelo Spark
// o retorno é um objeto StructField assim:
// StructField("id", LongType, true, {})
// o objeto StructField é do pacote org.apache.spark.sql.types
override def inputSchema: StructType = {
new StructType().add("id", LongType, nullable = true)
View theme.md

<img src="

View diagrama-W1.svg
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
View W1.scala
import org.apache.spark.sql.SparkSession
import org.apache.spark.sql.SparkSession.Builder
import org.apache.spark.SparkContext
import org.apache.log4j.{Level, Logger}
// A sparkSession é provida pelo proprio Spark Shell
// O nivel de log também já é configurado pela Spark Shell
def boolean_udf_wrapper(a:String, b:String, t:Any): Boolean = { true }
def string_udf_wrapper(a:String, b:String, t:Any): String = { "••••" }
import org.apache.spark.sql.functions.expr
import org.apache.spark.sql.functions.sum
View Workflow01UDFs.scala
def happyEmployees(salary: Int) => salary > 2200
def smartTextCase(name: String) => name.toUpperCase()
View kubernetes-dnd.sh
# Usando o Kubernetes com Docker in Docker (DIND)
sudo mkdir -p /usr/local
cd /usr/local
sudo mkdir dind-cluster
cd dind-cluster/
sudo chmod o+w .
ls -lat .. | head
# wget https://cdn.rawgit.com/kubernetes-sigs/kubeadm-dind-cluster/master/fixed/dind-cluster-v1.10.sh
curl -O https://cdn.rawgit.com/kubernetes-sigs/kubeadm-dind-cluster/master/fixed/dind-cluster-v1.10.sh
@joao-parana
joao-parana / .block
Created Mar 28, 2018 — forked from mbostock/.block
Chord Diagram
View .block
license: gpl-3.0
height: 960
border: no
@joao-parana
joao-parana / map-pushdow.sc
Last active Apr 20, 2018
Using CatalystExtension Points in Spark
View map-pushdow.sc
// Este script é para rodar no Ammonite.
// Crie o arquivo catalyst_04.sc com este conteúdo
// Dentro da shell REPL do Ammonitem, você deve invocar assim:
// import $file.catalyst_04, catalyst_04._
//
// Mas antes execute estes tres comandos abaixo
// import coursier.MavenRepository
// interp.repositories() ++= Seq(MavenRepository("file:/Users/admin/.m2/repository"))
// import $ivy.`org.apache.spark::spark-sql:2.3.0`
@joao-parana
joao-parana / index.html
Last active Dec 19, 2017
SOMA 02 - data binding
View index.html
<!DOCTYPE html>
<html>
<head>
<meta name="description" content="data-binding">
</head>
<body>
<script src="https://d3js.org/d3.v4.min.js" charset=" utf-8"></script>
<script>
var selector = d3.select('body')
.selectAll('span')
@joao-parana
joao-parana / index.html
Last active Dec 19, 2017
Exemplo simples com select
View index.html
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8">
<meta name="description" content="SOMA 01 - Hello World">
<title>SOMA 01 - Hello World</title>
</head>
<body>
<div id="page"></div>
<script src="https://d3js.org/d3.v4.min.js" charset="utf-8"></script>