Skip to content

Instantly share code, notes, and snippets.

View ibzib's full-sized avatar

Kyle Weaver ibzib

View GitHub Profile
55 results - 12 files
website/CONTRIBUTE.md:
192
193: ```
194: {{< highlight java >}}
195 // This is java
website/www/site/content/en/documentation/io/built-in/hadoop.md:
35
@ibzib
ibzib / spark-pwc-error.txt
Created May 13, 2019 18:22
Beam portable word count fails on Spark with `pre_optimize=all` set
ERROR:root:java.lang.IllegalArgumentException: PCollectionNodes [PCollectionNode{id=ref_PCollection_PCollection_4, PCollection=unique_name: "57read/Read/Reshuffle/ReshufflePerKey/Map(reify_timestamps).None"
coder_id: "ref_Coder_TupleCoder_5_length_prefixed"
is_bounded: BOUNDED
windowing_strategy_id: "ref_Windowing_Windowing_2"
}, PCollectionNode{id=ref_PCollection_PCollection_24, PCollection=unique_name: "46write/Write/WriteImpl/WindowInto(WindowIntoFn).None"
coder_id: "ref_Coder_TupleCoder_4_length_prefixed"
is_bounded: BOUNDED
windowing_strategy_id: "ref_Windowing_Windowing_1"
}, PCollectionNode{id=ref_PCollection_PCollection_12, PCollection=unique_name: "13pair_with_one.None"
coder_id: "ref_Coder_TupleCoder_11"
@ibzib
ibzib / Fork.java
Last active May 9, 2019 20:36
Beam pipeline to surface re-computation bug in Spark portable runner
PCollection<String> a = pipeline.apply(Impulse.create())
.apply("a", ParDo.of(new DoFn<byte[], String>() {
@ProcessElement
public void processElement(ProcessContext context) {
System.out.println("A");
context.output("a");
}
}));
PCollection<String> ab = a.apply("b", ParDo.of(new DoFn<String, String>() {
@ProcessElement